LiveKit Agents
Open-source framework for building real-time voice and multimodal AI agents over WebRTC
LiveKit Agents is a Python and Node.js framework for building real-time voice AI agents that join calls as participants. It handles audio streaming, turn detection, and interruption over WebRTC, wiring together STT, LLM, and TTS providers into conversational agents. Integrations include OpenAI Realtime API, Deepgram, Cartesia, ElevenLabs, Google Gemini Live, and others. The framework supports tool use, multi-agent handoffs, vision input, and SIP telephony. It powers ChatGPT's Advanced Voice Mode and runs on LiveKit's open-source WebRTC server or LiveKit Cloud.
Pricing: Usage-based
LiveKit Agents Alternatives
Explore 17 products in the Audio category. View all LiveKit Agents alternatives.
Eleven Labs
Natural Text to Speech & AI Voice Generator.
LemonFox
Affordable speech-to-text and text-to-speech API with 100+ language support
OpenAI
API access to GPT, o-series reasoning, DALL-E, and Whisper models
Speechmatics
Enterprise speech-to-text API supporting 55+ languages with high accuracy
Also listed in
Is your product missing?