LiveKit Agents
Framework for building real-time voice and multimodal AI agents over WebRTC.
LiveKit Agents added Answering Machine Detection — voice agents are becoming a serious telephony runtime.
◆Recent moves
- 12h ago
Agents 1.5.12: deprecate mcp_servers; Perplexity Responses; AvatarMetrics
1.5.12 deprecates mcp_servers on Agent and AgentSession, adds Perplexity Responses LLM, gpt-realtime-whisper STT, multi-message generation in realtime, UserTurnLimitOptions for cutting off long user speech, and AvatarMetrics for join/playback latency. Provider breadth plus API-surface cleanup — fits the contact-center hardening arc.
View source ↗ - 1d ago
Agents 1.5.11: routine version bump
1.5.11 is a bot-cut version bump with no published release notes. Routine release in the high-cadence 1.5.x line.
View source ↗ - 3d ago
Agents 1.5.10: live LLM swaps and Speechmatics STT in inference
1.5.10 adds inference.LLM.update_options for live model swaps, Speechmatics STT to inference, Rime Coda model, and shutdown-callback handling. The live-swap mechanism is the standout — it lets running sessions change model without tearing down the agent.
View source ↗ - 7d ago
Agents 1.5.9: Answering Machine Detection for outbound calls
⚡ SPARK1.5.9 introduces Answering Machine Detection — the feature that turns LiveKit Agents into a credible outbound-calling platform, not just a realtime meeting agent. Bundled with WarmTransferTask additions, Perplexity LLM, and Rime WebSocket TTS.
View source ↗ - 15d ago
Agents 1.5.8: barge-in cooldown and observability retry
1.5.8 adds barge-in cooldown for corrections, switches Fish Audio TTS to WebSocket for faster inference, retries session-recording upload, and adds Soniox TTS. The barge-in cooldown is the meaningful one — handling the case where a user corrects themselves without re-triggering the interruption pipeline.
View source ↗ - 20d ago
Agents 1.5.7: dynamic endpointing, playback metric, Runway and Cerebras plugins
1.5.7 is a heavy-throughput release: dynamic endpointing alpha param, playback_latency metric, Runway Characters avatar plugin, Cerebras LLM plugin, expanded MCP header updates, double-encoded JSON tool-arg handling. Lots of provider plumbing rather than a single headline, consistent with the breadth-over-depth phase.
View source ↗