Voice features — transcription and text-to-speech — are easy to add and easy to overpay for. Here is how to get speech AI for the lowest cost in 2026, starting with the providers that give you credits for free.
Start with free voice-AI credits
The fastest way to ship voice features for $0 is to use the signup credits these providers hand out:
- Deepgram — $200 in credits, usable across speech-to-text, text-to-speech and voice-agent APIs. The largest signup credit in the voice space.
- AssemblyAI — $50 in credits for transcription and speech understanding.
- ElevenLabs — a free monthly text-to-speech allowance (plus speech-to-text and voice design) on the free plan.
Those three alone cover a lot of an early product. See the always-current list in the catalog.
How to think about cost
Speech APIs are usually billed per minute (for STT) or per character (for TTS). When you compare, normalize to the same unit and watch for:
- Streaming vs batch — real-time streaming often costs more than pre-recorded batch transcription.
- Model tier — a smaller, faster model is frequently accurate enough and much cheaper.
- Free-tier limits — generous monthly allowances can cover low-volume apps indefinitely.
A low-cost voice stack
- Transcription: spend Deepgram's $200 credit first, then AssemblyAI's $50; both are strong and let you start with no card.
- Text-to-speech: ElevenLabs free tier for early volume, then compare per-character pricing as you scale.
- The LLM in the middle: keep it cheap by routing to the lowest-cost provider per model — see our rankings.
When you outgrow the free credits
Once you are past the credits, the same rule applies as with LLMs: the cheapest provider for the exact model and mode you need can be far below the default. Re-compare periodically, because voice pricing moves.
Grab the voice credits above with a free account, and pair them with the rest of your AI stack from the catalog. Related: free AI API credits.