Using Deepgram?
‍Pay 88% less for transcription with higher accuracy

Modulate's Transcription API is 10x lower cost than Deepgram. Trained on 500M hours or real-world, noisy data, Modulate’s API is more accurate across 3 independent benchmarks.

Best-in-class accuracy on real-world speech

88% less expensive ($0.03 / hr vs. $0.26 / hr)

Best-in-class accuracy on real-world speech (AMI Corpus and IHM)

Detects 20+ emotions and 20+ accents

Supports 70+ languages, PII redaction and more

Get Immediate API Access
400 Hours Free

No sales conversation needed

Transcription Accuracy (Word Error Rate)

Word Error Rate across benchmark datasets — lower is better

Word Error Rate

30%

7.5%

Modulate

velma-transcribe

15.7%

Deepgram

nova-3

Earnings-22

8.0%

Modulate

velma-transcribe

8.2%

Deepgram

nova-3

VoxPopuli

14.9%

Modulate

velma-transcribe

28.1%

Deepgram

nova-2

AMI Meeting Corpus

Transcription Cost ($/hr)

Cost per hour ($)

0.2$

0.4$

$0.03

Modulate

transcribe

$0.31

Deepgram

nova-3

Batch Transcription

$0.06

Modulate

transcribe

$0.35

Deepgram

nova-3

Streaming Transcription

Compare Modulate vs. Deepgram

Explore cost and accuracy comparison tool

Modulate vs. Deepgram

A side-by-side comparison for teams evaluating transcription providers

Feature

Batch Cost:

$0.03 / hour

$0.31 / hour (nova-3)

Streaming Cost:

$0.06 / hour

$0.35 / hour (nova-3)

Additional features:

All free

$0.12 / hour

Accent detection

20+ accents

None

Language Support

57+ languages

50+ languages

PII / PHI redaction

Yes, $0.02/hr

Yes, $0.12/hr

Diarization

Yes

Yes, $0.12/hr

Streaming Support

Real-time streaming

Real-World Accuracy

14.9% WER on AMI Corpus

28.1% WER on AMI Corpus

Accuracy on Earnings-22

7.8% WER

15.7% WER

Training Data

500M+ hours of conversations

Primarily curated / structured datasets

Overlapping speakers

Handles naturally

Underperforms in complex multi-speaker audio

#1 Accuracy in independent
benchmarks

10x lower cost than leading competitors

Built for real production
workloads

Enterprise-ready performance

Drop-In API. No Friction.

Integrate in minutes, not weeks.

Simple REST API

Clean documentation

Works with your existing stack

Built for real-time and batch transcription

Transcription is Just the Beginning

Teams switching to Modulate experience unique features such as emotion detection, accent detection. Modulate also offers Deepfake Detection at 100x lower cost and Conversation Understanding

Start with transcription. Gain capabilities with Velma

The #1 Transcription API — Try It Free

Try The API

Cookie consent notice

Preferences Dashboard

Using Deepgram?‍Pay 88% less for transcription with higher accuracy

Compare Modulate vs. Deepgram

Modulate vs. Deepgram

Drop-In API. No Friction.

Transcription is Just the Beginning

Transcription

Emotion Detection

Deepfake Detection

Conversation Understanding

Using Deepgram?
‍Pay 88% less for transcription with higher accuracy