Superhuman vs Deepgram
Side-by-side trajectory, velocity, and editorial themes.
Inbox becomes an MCP endpoint — agents now drive Superhuman alongside humans, in your voice.
Superhuman ships at very high cadence, mixing mobile polish (Quick Reply from notifications, calendar widget, Split Inbox reorder/hide) with category-shifting AI work. The April MCP launch turned Superhuman Mail into a callable surface for Claude, ChatGPT, and other assistants, with 'uniquely Superhuman' actions (Smart Send, Read Statuses, Split Inbox triage) exposed as tools. Draft Sync with Gmail/Outlook bridges the agent ecosystem further: assistants can draft anywhere, you review and send in Superhuman.
The product is moving from 'fast email for power users' to 'AI-and-humans share the inbox.' Personalization, Write with Voice, and MCP form a clear stack — voice in, agent action, voice out — with the original power-user keyboard-shortcut audience preserved through continued Split Inbox refinement. Mobile gets weekly polish to keep that surface from rotting while the AI direction takes the headlines.
Next likely move is delegated-inbox MCP actions for executive assistants (act-as-on-behalf permissions) and recurring agent tasks tied to Personalization rules. A cross-app demo — Superhuman + Granola + a calendar tool, all via MCP — is the obvious narrative the May 21st virtual event has been set up to deliver.
Deepgram pairs a real diarization quality jump with voice-agent platform breadth.
Deepgram is shipping on two tracks at once. The speech-recognition core is getting model-quality work — diarization v2 is the headline, with profanity filtering and numerals expanding across long tails of languages. In parallel, the Voice Agent API is being built out as a multi-vendor orchestration layer, with managed Gemini, GPT, and Cartesia options sitting next to Deepgram's own Aura-2 TTS and Flux ASR.
The arc is two products converging: a best-in-class speech stack and an opinionated voice-agent runtime that abstracts the LLM/TTS choice. Diarization v2 — preferred 3.3× over v1 in human eval, with ~80% median CER reduction on contact-center audio — is the kind of underlying model win that pulls call-center workloads onto the platform. Meanwhile, runtime controls like Aura-2 speed and pronunciation, plus managed third-party LLMs, position Deepgram as a single integration target rather than a single component vendor.
Expect Diarization v2 to become the default behind diarize=true once the opt-in window closes, and expect the Voice Agent API to keep adding tier-priced managed providers — that's the obvious monetization layer. Multilingual feature parity (numerals, profanity, Flux) will continue to fill in tail languages, narrowing the gap between English-only buyers and global deployments.
See more alternatives to Superhuman →
See more alternatives to Deepgram →