Can my agent transcribe an audio file from a public URL?

Yes. Use the 'transcribe_url' tool. Provide the public URL of the audio file (WAV, MP3, etc.) and specify the model (e.g., 'nova-2'). The agent will dispatch the request to Deepgram and return the transcribed text instantly.

How do I generate speech from text using the agent?

Use the 'speak_text' tool. Provide the text script and the target voice model (e.g., 'aura-asteria-en'). Your agent will trigger the high-fidelity Aura voice engine and return the binary audio stream data.

Can I monitor my remaining project balance via chat?

Absolutely. Use the 'get_balances' tool with your project ID. The agent will retrieve your current wallet thresholds and funding limits directly from Deepgram to ensure your audio pipelines stay active.

Deepgram MCP Connector for Claude

A+

Power audio AI via Deepgram — perform high-speed speech-to-text, generate lifelike text-to-speech, track usage, and manage API keys directly from any AI agent.

10 tools Official Updated Jun 28, 2026 Official Vinkius Partner

More Details Connect to Claude

Connect your Deepgram account to any AI agent and take full control of your speech-to-text (STT) and text-to-speech (TTS) workflows through natural conversation.

What you can do

Speech-to-Text (STT) — Dispatch automated transcription requests for remote audio URLs using the lightning-fast Nova-2 model to consume explicit WAV/MP3 web streams
Text-to-Speech (TTS) — Generate high-fidelity audio from raw text using Aura voices, outputting the exact binary stream footprint natively from your chat
Usage Monitoring — Analyze specific global bounds hitting /usage to map literally terabytes of exact API transcription times and TTS byte usage
Project & Key Management — List and create ephemeral Deepgram access boundaries (API keys) and isolate organizational tenants where Audio AI billing is enforced
Wallet Oversight — Retrieve explicit cloud logging tracing explicit Vault limits and verify direct wallet thresholds to ensure pipelines never drop
Identity & Invites — Manage developer limits by listing members and sending team invites to specific project UUIDs strictly

How it works

Subscribe to this server
Enter your Deepgram API Key (found in the Deepgram Console under Settings > API Keys)
Start managing your audio AI workflows from Claude, Cursor, or any MCP-compatible client

Who is this for?

AI Developers — test STT/TTS models and manage API keys without leaving the development environment
Product Teams — monitor audio AI usage and verify transcription accuracy in real-time
Data Engineers — audit transcription volumes and manage project-wide audio pipelines using natural language
Ops Teams — track wallet balances and manage team access across multiple Deepgram projects

speech-to-texttext-to-speechtranscriptionvoice-ainatural-language-processingaudio-processing

Related Connectors

Shopify MCP

23 tools Official

Manage your Shopify store via AI — list products, process orders, search customers, track inventory, and manage discounts from any agent.

A+ View details →

USDA FoodData Central MCP

2 tools Official

Access the gold standard in nutrition data — 300,000+ foods with scientific-grade nutrient profiles from the U.S. Department of Agriculture.

A+ View details →

HelloAsso MCP

11 tools Official

Automate association management via HelloAsso — manage payments, forms, and orders for French non-profits directly from any AI agent.

A+ View details →

tl;dv MCP

12 tools Official

Record, transcribe, and clip key moments from Google Meet and Zoom calls so your team never misses important meeting insights.

A+ View details →