What's the cheapest speech-to-text API?

Start with free credits — Deepgram ($200) and AssemblyAI ($50) — then compare per-minute pricing. A smaller, faster model is often accurate enough and noticeably cheaper.

Is there a free text-to-speech API?

ElevenLabs offers a free monthly text-to-speech allowance, and Deepgram's $200 signup credit also covers text-to-speech.

Streaming or batch transcription — which is cheaper?

Pre-recorded (batch) transcription is usually cheaper than real-time streaming, so use batch wherever latency isn't critical.

How do I avoid overpaying for voice AI?

Spend the signup credits first, pick the right model tier for your accuracy needs, and re-compare providers periodically — voice pricing changes often.

The Cheapest (and Free) Speech-to-Text and TTS APIs in 2026

Compare the cheapest speech-to-text and text-to-speech APIs in 2026, plus the providers that give you free voice-AI credits on signup.

Voice features — transcription and text-to-speech — are easy to add and easy to overpay for. Here is how to get speech AI for the lowest cost in 2026, starting with the providers that give you credits for free.

Start with free voice-AI credits

The fastest way to ship voice features for $0 is to use the signup credits these providers hand out:

Deepgram — $200 in credits, usable across speech-to-text, text-to-speech and voice-agent APIs. The largest signup credit in the voice space.
AssemblyAI — $50 in credits for transcription and speech understanding.
ElevenLabs — a free monthly text-to-speech allowance (plus speech-to-text and voice design) on the free plan.

Those three alone cover a lot of an early product. See the always-current list in the catalog.

How to think about cost

Speech APIs are usually billed per minute (for STT) or per character (for TTS). When you compare, normalize to the same unit and watch for:

Streaming vs batch — real-time streaming often costs more than pre-recorded batch transcription.
Model tier — a smaller, faster model is frequently accurate enough and much cheaper.
Free-tier limits — generous monthly allowances can cover low-volume apps indefinitely.

A low-cost voice stack

Transcription: spend Deepgram's $200 credit first, then AssemblyAI's $50; both are strong and let you start with no card.
Text-to-speech: ElevenLabs free tier for early volume, then compare per-character pricing as you scale.
The LLM in the middle: keep it cheap by routing to the lowest-cost provider per model — see our rankings.

When you outgrow the free credits

Once you are past the credits, the same rule applies as with LLMs: the cheapest provider for the exact model and mode you need can be far below the default. Re-compare periodically, because voice pricing moves.

Grab the voice credits above with an account, and pair them with the rest of your AI stack from the catalog. Related: free AI API credits.

The Cheapest (and Free) Speech-to-Text and TTS APIs in 2026

Start with free voice-AI credits

How to think about cost

A low-cost voice stack

When you outgrow the free credits

Frequently asked questions

Keep reading