Cartesia (Voice AI) MCP Connector for Claude
A+Generate lifelike AI voices, clone speech, and transcribe audio with Cartesia's state-of-the-art Sonic models directly from your AI agent.
Connect Cartesia to your AI agent to unlock high-performance voice synthesis and speech recognition. Cartesia's Sonic models provide industry-leading latency and quality for real-time applications.
What you can do
- Text-to-Speech (TTS) — Generate high-fidelity audio bytes or stream via SSE using models like Sonic 3.5 and Sonic 3.
- Speech-to-Text (STT) — Transcribe audio files into text using the Ink Whisper model with multi-language support.
- Voice Cloning — Create custom voice models from as little as 5 seconds of audio input.
- Voice Management — List, retrieve, and update voices, or use the Voice Changer to transform existing audio.
- Pronunciation Control — Manage custom pronunciation dictionaries for specialized terminology or accents.
- Agent Orchestration — List and manage AI agents and monitor call logs and usage credits.
How it works
- Subscribe to this server
- Enter your Cartesia API Key
- Start generating audio or transcribing speech from Claude, Cursor, or any MCP-compatible client
Who is this for?
- Developers — integrate real-time voice synthesis into applications without managing complex infrastructure.
- Content Creators — automate voiceovers and audio localization using high-quality cloned voices.
- Product Teams — build conversational AI agents that sound human and respond with sub-second latency.
Related Connectors
Fathom MCP
Privacy-first website analytics — track visitors, monitor real-time traffic, and manage sites and events directly from your AI agent.
Exa MCP
Find exactly the web content you need with semantic search that understands context and returns high-quality curated results.
Pixabay MCP
Search and retrieve royalty-free stock images, vectors, illustrations, and videos via AI directly from Pixabay.
AlisQI MCP
Quality management orchestration — manage analysis sets, results, and QMS data via AI.