perkstack
100% free, just login to see

price tracker · updated 2026-05-14

Cheapest Llama 3.3 70B API

llm · per 1M output tokens

Llama 3.3 70B inference ranges from $0.30 to $0.90 per 1M output tokens depending on provider — Groq is the cheapest verified endpoint.

provider rankingsave up to 62%

Llama 3.3 70B

llm · per 1M output tokens

save up to 62%

01
groq
$0.30
02
together.ai
$0.54
03
fireworks
$0.90
04
openrouteravg
$0.79

Cheapest verified endpoint: groq at $0.30 (llm · per 1M output tokens). Prices normalized per unit, ex. egress.

How we track this

We re-pull every known provider that serves Llama 3.3 70B weekly, normalize to a single unit (llm · per 1M output tokens), and pin the price the moment a provider posts it — we don't average across stale snapshots. Switching providers is usually a one-line base-URL change. Spot a stale price? tell us.

Other models we track

Building on AI? Don't pay full price.

Perkstack also tracks 200+ verified AI tool credits and startup grants — free with an account.