Comparison · Comms

Superhuman vs Deepgram

Side-by-side trajectory, velocity, and editorial themes.

COMMS

6.3

Inbox becomes an MCP endpoint — agents now drive Superhuman alongside humans, in your voice.

◆ Current state

Superhuman ships at very high cadence, mixing mobile polish (Quick Reply from notifications, calendar widget, Split Inbox reorder/hide) with category-shifting AI work. The April MCP launch turned Superhuman Mail into a callable surface for Claude, ChatGPT, and other assistants, with 'uniquely Superhuman' actions (Smart Send, Read Statuses, Split Inbox triage) exposed as tools. Draft Sync with Gmail/Outlook bridges the agent ecosystem further: assistants can draft anywhere, you review and send in Superhuman.

◆ Where it's heading

The product is moving from 'fast email for power users' to 'AI-and-humans share the inbox.' Personalization, Write with Voice, and MCP form a clear stack — voice in, agent action, voice out — with the original power-user keyboard-shortcut audience preserved through continued Split Inbox refinement. Mobile gets weekly polish to keep that surface from rotting while the AI direction takes the headlines.

◆ Prediction

Next likely move is delegated-inbox MCP actions for executive assistants (act-as-on-behalf permissions) and recurring agent tasks tied to Personalization rules. A cross-app demo — Superhuman + Granola + a calendar tool, all via MCP — is the obvious narrative the May 21st virtual event has been set up to deliver.

Deepgram

COMMS

6.3

Deepgram pairs a real diarization quality jump with voice-agent platform breadth.

◆ Current state

Deepgram is shipping on two tracks at once. The speech-recognition core is getting model-quality work — diarization v2 is the headline, with profanity filtering and numerals expanding across long tails of languages. In parallel, the Voice Agent API is being built out as a multi-vendor orchestration layer, with managed Gemini, GPT, and Cartesia options sitting next to Deepgram's own Aura-2 TTS and Flux ASR.

◆ Where it's heading

The arc is two products converging: a best-in-class speech stack and an opinionated voice-agent runtime that abstracts the LLM/TTS choice. Diarization v2 — preferred 3.3× over v1 in human eval, with ~80% median CER reduction on contact-center audio — is the kind of underlying model win that pulls call-center workloads onto the platform. Meanwhile, runtime controls like Aura-2 speed and pronunciation, plus managed third-party LLMs, position Deepgram as a single integration target rather than a single component vendor.

◆ Prediction

Expect Diarization v2 to become the default behind diarize=true once the opt-in window closes, and expect the Voice Agent API to keep adding tier-priced managed providers — that's the obvious monetization layer. Multilingual feature parity (numerals, profanity, Flux) will continue to fill in tail languages, narrowing the gap between English-only buyers and global deployments.

See more alternatives to Superhuman →
See more alternatives to Deepgram →