Voice intelligence for games & social platforms

Real-time voice safety for communities where conversation moves fast

Protect players and users—without killing the conversation

Voice is where communities come alive—and where harm, abuse, and manipulation escalate fastest. Modulate helps gaming studios and social platforms understand what's happening in voice conversations as they happen, so they can protect users, enforce policies fairly, and preserve authentic connection.

Powered by Velma, the world's first production Ensemble Listening Model, the Modulate platform understands how something was said—not just what—making it far more effective than text-first or LLM-based moderation approaches.

Protect players and users—without killing the conversation

Voice is where communities come alive—and where harm, abuse, and manipulation escalate fastest.

Modulate helps gaming studios and social platforms understand what’s happening in voice conversations as they happen, so they can protect users, enforce policies fairly, and preserve authentic connection.

Powered by Velma, the world’s first production Ensemble Listening Model, the Modulate platform understands how something was said—not just what—making it far more effective than text-first or LLM-based moderation approaches.

ToxMod: Real-time voice moderation at scale

ToxMod is Modulate’s voice moderation product for gaming and social platforms. It monitors live voice conversations to detect harassment, hate, threats, grooming, and other harmful behaviors—in real time, with low latency and high precision.

Unlike legacy moderation tools that rely on transcripts or keyword matching, ToxMod listens directly to voice and behavioral signals such as tone, intensity, escalation patterns, and interaction dynamics.

Why voice moderation needs a new approach

Most moderation systems were built for text.
Voice is different.

Two conversations with identical transcripts can have completely different meanings depending on tone, pacing, sarcasm, interruption, or emotional escalation. That nuance is exactly where harm—and intent—often lives.

No brittle keyword rules

No reliance on text-only LLM pipelines

No black-box outputs without evidence

Instead, ToxMod is powered by Velma, Modulate's flagship Ensemble Listening Model—designed specifically to understand real conversations in adversarial, fast-moving environments, and provide a clear explanation for each of its conclusions.

What ToxMod detects in real time

ToxMod continuously analyzes live voice channels and produces real-time signals and events, including:

Harassment, hate, and abusive behavior

Escalation and conflict patterns

Threats and intimidation

Grooming and exploitation indicators

Coordinated or repeated toxic behavior

Contextual misuse (e.g., sarcasm, baiting, dogwhistles)

Built for gaming and social platforms

ToxMod is trusted in some of the most adversarial voice environments in the world and is designed to scale across:

Multiplayer games

Protect players in competitive, cooperative, and social play—without disrupting gameplay or adding friction.

Social audio & voice-first platforms

Moderate live rooms, group chats, and creator communities where tone and interaction dynamics matter as much as words.

UGC-driven communities

Support large-scale, user-generated voice interactions with consistent, policy-aligned enforcement.

We're focused on delivering a welcoming community for all players. Through our collaboration with Modulate, we're continuing to unlock new ways for technology to strengthen the player experience and encourage more positive moments of play across our games. Voice chat is a powerful part of how players communicate and compete in our games. By using proactive machine learning to identify disruptive behavior in real time, we're building a foundation that supports enforcement while sustaining the fun and fairness that define great multiplayer experiences.
Natasha Tatarchuk

Senior Vice President, Chief Technology Officer

Activision

How it fits into your platform

ToxMod integrates directly into your existing voice infrastructure and moderation workflows.

Live monitoring: Detect harmful behavior as it happens—enabling warnings, interventions, or automated actions in real time

Human-in-the-loop review: Surface high-confidence events for moderators with audio clips, timestamps, and behavioral context

Flexible enforcement: Route signals into your existing trust & safety systems, moderation tools, or custom workflows

Why Modulate?

Uncontested Voice Intelligence, powered by Velma

Modulate's platform builds on years of experience processing hundreds of millions of complex audio conversations - the culmination of which is our unique analysis engine, Velma. Velma is the world's first voice-native Ensemble Listening Model (ELM), designed to understand how something was said—not just what was said.

Velma consists of hundreds of component models, each focused on a specific aspect of speech or behavior, and orchestrated carefully to build up a complete understanding of any conversation.

Baked into an Enterprise Platform designed to Solve Real Problems

Despite our leading role in AI research, Modulate's core focus is delivering usable, practical tools for enterprise needs. That's why we're not just handing you an AI model, but an enterprise platform complete with expert support, compliant design for CCPA, GDPR, ISO 27001, HIPAA, and more, and integrations into telephony, CCaaS, VoIP, and other critical systems used by enterprises for calls today.

Velma's architecture ties into this mission as well. The unique ELM design allows Modulate to:

  • Fine-tune individual detectors without retraining a monolithic model

  • Provide transparent and clear audit trails explaining how we arrived at each decision

  • Adapt quickly as adversarial tactics evolve

  • Deliver consistent performance across languages, accents, and environments

Insights that fit your workflow.

Pairing Velma’s industry-leading intelligence with the reliability and control required for enterprise.

Dashboards and Review Console

Explore conversations and escalations in a UI designed for operations teams—fraud, trust & safety, and contact center leadership.

APIs and Webhooks

Bring voice intelligence into your stack: route signals into case management, risk engines, agent coaching tools, or moderation workflows.

Integrations

Deploy without ripping and replacing. Connect into the voice infrastructure you already use.

Compatible with

Built for trust, transparency, and scale

Designed for real-time operation with low latency

Enterprise-grade security and privacy practices

Transparent, reviewable outputs for moderators and stakeholders

Proven at scale in live, adversarial voice environments

Keep voice social—without letting it turn harmful

Voice should bring people together in games and on social platforms. With ToxMod, you can protect users, enforce standards fairly, and preserve what makes your community engaging. Talk to our team today.