price tracker · updated 2026-05-14
Cheapest Llama 3.3 70B API
llm · per 1M output tokens
Llama 3.3 70B inference ranges from $0.30 to $0.90 per 1M output tokens depending on provider — Groq is the cheapest verified endpoint.
Llama 3.3 70B
llm · per 1M output tokens
save up to 62%
Cheapest verified endpoint: groq at $0.30 (llm · per 1M output tokens). Prices normalized per unit, ex. egress.
How we track this
We re-pull every known provider that serves Llama 3.3 70B weekly, normalize to a single unit (llm · per 1M output tokens), and pin the price the moment a provider posts it — we don't average across stale snapshots. Switching providers is usually a one-line base-URL change. Spot a stale price? tell us.
Other models we track
Cheapest Seedance 2.0 API
video gen · 5s @ 1080p · save up to 84%
Cheapest Nano Banana Pro API
image gen · 1024² · per image · save up to 55%
Cheapest Whisper Large v3 API
transcription · per minute · save up to 73%
Building on AI? Don't pay full price.
Perkstack also tracks 200+ verified AI tool credits and startup grants — free with an account.