Skip to main content
aibizhub

Comparison · 8 min · 2 citations

Deepgram vs AssemblyAI Pricing 2026: Speech-to-Text Cost

Deepgram vs AssemblyAI pricing 2026: Deepgram Nova-3 is $0.0077/min pre-recorded, AssemblyAI is $0.15/hour. Per-minute vs per-hour billing compared.

By AI Biz Hub · Published May 25, 2026

Education · General business information, not legal, tax, or financial advice. Editorial standards Sponsor disclosure Corrections

TL;DR

Deepgram bills per minute, AssemblyAI per hour, so normalize before comparing. Deepgram Nova-3 pre-recorded is $0.0077/min, about $0.462/hour[1]. AssemblyAI Universal-2 async is $0.15/hour and Universal-3 Pro is $0.21/hour[2].

On its lower-cost async model, AssemblyAI is cheaper per hour. On its flagship Pro model the two converge. Both ship real free credits (Deepgram $200; AssemblyAI a generous hour allowance), so validate accuracy on your own audio before the rate decides anything.

Deepgram and AssemblyAI are the two speech-to-text APIs a builder compares when adding transcription, captions, or voice features to a 2026 product. Their prices look an order of magnitude apart at a glance, but only because one quotes per minute and the other per hour. This article converts both to the same unit, separates pre-recorded from streaming pricing, and notes the free credits that let you test accuracy before the rate matters.

1. Per-minute vs per-hour: normalize first

The two vendors quote in different units, which is the first thing to fix. Verified against each pricing page as of May 25, 2026.

  • Deepgram quotes per minute. Nova-3 monolingual pre-recorded is $0.0077/min on pay-as-you-go[1].
  • AssemblyAI quotes per hour. Universal-2 async is $0.15/hour; Universal-3 Pro async is $0.21/hour[2].

To compare, convert Deepgram's per-minute rate to per hour by multiplying by 60: $0.0077 × 60 = $0.462 per hour. Now the two are in the same unit and the comparison is real. Quoting one vendor per minute and the other per hour is the single most common source of error in speech-to-text cost analysis, because $0.0077 looks far smaller than $0.15 until you realize they measure different amounts of audio.

2. Normalized rates compared

All rates converted to cost per hour of audio, pay-as-you-go, as of May 25, 2026.

ModelQuoted ratePer hour
Deepgram Nova-3 (pre-recorded)$0.0077/min[1]~$0.462/hr
AssemblyAI Universal-2 (async)$0.15/hr[2]$0.15/hr
AssemblyAI Universal-3 Pro (async)$0.21/hr[2]$0.21/hr

On the normalized per-hour basis, AssemblyAI's lower-cost Universal-2 async model at $0.15/hour is below Deepgram's Nova-3 flagship at about $0.462/hour, and even AssemblyAI's Universal-3 Pro at $0.21/hour comes in under it. The comparison is not apples-to-apples on capability, since these are each vendor's different model tiers, but on published rate for pre-recorded transcription AssemblyAI's lower tiers are cheaper per hour. Deepgram offers further savings on its Growth tier ($0.0065/min for Nova-3 pre-recorded), which narrows the gap for committed-volume customers[1].

3. Streaming is priced separately on both

Real-time streaming transcription is priced separately and generally differs from pre-recorded on both platforms:

  • Deepgram Nova-3 streaming: $0.0048/min pay-as-you-go, about $0.288/hour, which is actually lower than its pre-recorded rate[1].
  • AssemblyAI streaming: Universal-Streaming is $0.15/hour, while Universal-3 Pro streaming is $0.45/hour[2].

The streaming picture inverts part of the pre-recorded ranking: Deepgram's streaming Nova-3 at about $0.288/hour undercuts AssemblyAI's Universal-3 Pro streaming at $0.45/hour, while AssemblyAI's lower-cost Universal-Streaming stays at $0.15/hour. The lesson is that the cheaper vendor depends on both the workload (pre-recorded vs streaming) and which model tier you choose. Decide your use case first, then read the matching rate; do not assume the pre-recorded ranking carries over to streaming. To fold transcription spend into your full monthly stack budget, use the AI stack cost calculator.

4. Free credits and trial volume

Both vendors let you test on real audio before paying, which matters because accuracy on your specific audio (accents, domain terms, noise) decides quality more than the rate does:

  • Deepgram: $200 free credit on new pay-as-you-go accounts, no credit card required, no expiration[1]. At $0.0077/min that funds roughly 26,000 minutes of Nova-3 pre-recorded transcription to test with.
  • AssemblyAI: a free tier with no credit card required and a substantial included allowance of pre-recorded and streaming hours for new accounts[2].

The planning move is to run both free credits on a representative sample of your actual audio and compare transcription accuracy, not just price. A model that is a fraction of a cent cheaper per minute but produces more errors costs you far more in correction and bad user experience than the rate difference saves. Use the free credits to make accuracy the deciding factor, with price as the tiebreaker.

5. Decision guidance

  • Cheapest published pre-recorded rate: AssemblyAI Universal-2 async at $0.15/hour, below Deepgram Nova-3 at about $0.462/hour.
  • Low-cost streaming: Deepgram Nova-3 streaming at about $0.288/hour undercuts AssemblyAI Universal-3 Pro streaming ($0.45/hour); AssemblyAI Universal-Streaming is $0.15/hour.
  • Committed volume: Deepgram's Growth tier ($0.0065/min Nova-3 pre-recorded) narrows the gap with pre-paid annual credits.
  • Before deciding on price: test both free credits on your real audio. Accuracy on your domain beats a fractional rate difference.
  • Always: normalize to the same unit (per hour) before comparing any two quotes.

Re-verify each pricing page before committing; speech-to-text rates and model tiers change frequently. For the broader AI-vendor cost picture, see the cheapest LLM API ranking and the 2026 AI solopreneur stack.

All rate figures verified against official pricing pages as of 2026-05-25.

Frequently asked questions

Is Deepgram or AssemblyAI cheaper for transcription in 2026?

It is close once you normalize the units. Deepgram bills per minute and AssemblyAI per hour. Deepgram Nova-3 pre-recorded is $0.0077 per minute, which is about $0.462 per hour, while AssemblyAI Universal-2 async is $0.15 per hour and Universal-3 Pro is $0.21 per hour, verified on each vendor's pricing as of May 2026. On its lower-cost async model AssemblyAI is cheaper per hour; on its flagship Pro model the two are closer. The decisive step is converting both to the same unit before comparing.

Why can't I compare Deepgram and AssemblyAI prices directly?

Because Deepgram publishes per-minute rates and AssemblyAI publishes per-hour rates. Deepgram's $0.0077 per minute looks tiny next to AssemblyAI's $0.15 per hour, but they are not in the same unit. Multiply Deepgram's per-minute rate by 60 to get a per-hour figure of about $0.462, then compare. Without that conversion the comparison is meaningless, which is the single most common mistake in speech-to-text cost analysis.

Do Deepgram and AssemblyAI offer free credits?

Yes, both. Deepgram gives new accounts a $200 free credit with no credit card required and no expiration on its pay-as-you-go tier. AssemblyAI provides a free tier with no credit card required that includes a substantial allowance of pre-recorded and streaming transcription hours for new accounts. Both let you validate transcription quality on real audio before paying, which matters more than the headline rate when accuracy on your specific audio is the real decision.

References

Sources

Primary sources only. No vendor-marketing blogs or aggregated secondary claims.

  1. 1 Deepgram — Pricing (Nova-3 pre-recorded $0.0077/min PAYG, streaming $0.0048/min; $200 free credit) — accessed 2026-05-25
  2. 2 AssemblyAI — Pricing (Universal-2 async $0.15/hour, Universal-3 Pro $0.21/hour; Universal-Streaming $0.15/hour) — accessed 2026-05-25

Tools referenced in this article

Business planning estimates — not legal, tax, or accounting advice.