Using

Half the Error Rate. 578x Lower Cost.

Modulate ranks #1 on the 🤗 Hugging Face Speech Deepfake Arena leaderboard. 5x more accurate than Resemble AI at 578x lower cost.

1.1% EER vs 2.5% - less than half the error rate
$0.25/hr vs $144/hr - 578x cheaper
98.9% accuracy - 11 misses per 1K real voice calls vs 26
Detection in under 2.5s
Get Immediate API Access - 40 Free Hours

No sales conversation needed

Hugging Face’s Deepfake Speech Leaderboard

Modulate is the top ranked deepfake detection model on Hugging Face's Speak Deepfake Arena , the leading independent benchmark. View it here.

Modulate is #1 on 🤗 Hugging Face

Modulate is the top ranked deepfake detection model on Hugging Face's Speech Arena Leaderboard, the leading independent benchmark. Just 1.1% Equal Error Rate, Modulate catches 133% more deepfakes than the next best.
System Date Added Num Params (M) Pooled EER Average EER ↓
🥇Modulate-VELMA-2-Syntheti
🥇Modulate-VELMA-2-Syntheti 11/03/2026 316.000 1.586 1.104
🥈Resemble-Detect-3B-Omni
🥈Resemble-Detect-3B-Omni 14/10/2025 3000.000 2.099 2.570
🥉Hiya-Authenticity-Verific
🥉Hiya-Authenticity-Verific 13/02/2026 1000.000 2.324 2.113
DLMSL-SpeakSure-v0.1
DLMSL-SpeakSure-v0.1 27/10/2025 658.630 6.142 3.954
Whispeak
Whispeak 20/08/2025 98.900 8.060 3.049
EER (Equal Error Rate) is the foundation performance metric used to evaluate how accurately a model can distinguish between genuine human speech and AI-generated audio.

Modulate Catches 99% of all Deepfakes

Catch 2x more deepfakes and flag 48% fewer false positives vs. next-best. 🤗 Hugging Face Leaderboard.
Accuracy
92
94
96
98
100%
98.9%
Modulate
velma-deepfake-detect
97.9%
Hiya
authenticity-verific
97.4%
Resemble AI
resemble-detect-3b
96.9%
Whispeak
whispeak
96.0%
Deep Learning
dlmsl-speaksure-v0.1
94.2%
DF Arena
df-arena-500m-v1
94.1%
DF Arena
df-arena-1b-v1
93.9%
Syntra
syntra-detector
92.9%
Momenta
momenta

Detect Deepfakes for just $0.25 / hr

Fraud protection at scale, at a price that levels the playing field vs. scammers.
Modulate Deepfake-Detect
$0.25 / hr
Resemble AI Enterprise
$29 / hr
Other Providers
$30 — $120 / hr
Resemble AI Self-Serve
$144 / hr

Modulate vs. Resemble AI
Let the Data Decide

Feature
Accuracy
98.90%
97.4%
Equal Error Rate
1.10%
2.57%
Cost (audio detection)
$0.25/hr
$144/hr ($0.04/sec)
Model parameters
316M
>1B
Deepfakes missed per 1K
11
26
False positives per 1K
11
26
Optimized for
Noise resilience
Clean recordings only
Core expertise
Voice intelligence & detection
Voice generation & cloning
Transcription
$0.03/hr (available now)
Not available
Additional Models
Deepfake only

Their Specialty Is Voice Generation.
Ours Is Voice Intelligence.

#1 on 🤗 Hugging Face. They're Not.

Lower is better. Modulate: 1.1% vs Resemble AI: 2.5%

Detect in Under 2.5 Seconds.

Resemble needs up to 30 seconds of audio to detect a deepfake.

$0.25/hr. Not $144.

Resemble charges $144/hr. Modulate is 578x cheaper

Drop-In API. No Friction.

Integrate in minutes, not weeks.

Simple REST API

Clean documentation

Works with your existing stack

Built for real-time and batch detection

Deepfake Detection Is Just the Beginning

Teams switching to Modulate also get transcription, emotion, and accent detection; capabilities Resemble AI doesn't offer.

Deepfake Detection
Available Now AT $0.25/hr
Emotion Detection
Available Now
Conversation Understanding
Coming Soon

1.1% EER. $0.25/hr.
98.9% Accuracy.
Switch Now.

See why teams are switching from Resemble AI. 40 free hours.
No credit card. No sales conversation required.

© 2026 Modulate. The Voice Intelligence Company.

The #1 Deepfake Detection API - Try It Free
Try The API