Deepgram

Deepgram MCP Connector for Claude

A+

Power audio AI via Deepgram — perform high-speed speech-to-text, generate lifelike text-to-speech, track usage, and manage API keys directly from any AI agent.

10 tools Official Updated Jun 28, 2026 Official Vinkius Partner

Connect your Deepgram account to any AI agent and take full control of your speech-to-text (STT) and text-to-speech (TTS) workflows through natural conversation.

What you can do

  • Speech-to-Text (STT) — Dispatch automated transcription requests for remote audio URLs using the lightning-fast Nova-2 model to consume explicit WAV/MP3 web streams
  • Text-to-Speech (TTS) — Generate high-fidelity audio from raw text using Aura voices, outputting the exact binary stream footprint natively from your chat
  • Usage Monitoring — Analyze specific global bounds hitting /usage to map literally terabytes of exact API transcription times and TTS byte usage
  • Project & Key Management — List and create ephemeral Deepgram access boundaries (API keys) and isolate organizational tenants where Audio AI billing is enforced
  • Wallet Oversight — Retrieve explicit cloud logging tracing explicit Vault limits and verify direct wallet thresholds to ensure pipelines never drop
  • Identity & Invites — Manage developer limits by listing members and sending team invites to specific project UUIDs strictly

How it works

  1. Subscribe to this server
  2. Enter your Deepgram API Key (found in the Deepgram Console under Settings > API Keys)
  3. Start managing your audio AI workflows from Claude, Cursor, or any MCP-compatible client

Who is this for?

  • AI Developers — test STT/TTS models and manage API keys without leaving the development environment
  • Product Teams — monitor audio AI usage and verify transcription accuracy in real-time
  • Data Engineers — audit transcription volumes and manage project-wide audio pipelines using natural language
  • Ops Teams — track wallet balances and manage team access across multiple Deepgram projects
speech-to-texttext-to-speechtranscriptionvoice-ainatural-language-processingaudio-processing

10 tools expose this connector's capabilities to your AI agent.

send_invite

Invite a team member to a project

list_keys

List API keys for a project

get_balances

Get credit balances for a project

create_key

Payload should include comment, scopes, and optional expiration. Create a new API key for a project

delete_key

Revoke and delete an API key

list_members

List team members in a project

list_projects

List all Deepgram projects in your account

transcribe_url

Supports WAV, MP3, FLAC, and other formats. Transcribe audio from a URL using Deepgram Nova-2

speak_text

Returns metadata about the generated audio. Convert text to speech using Deepgram Aura voices

get_usage

Use query_string for date filters (e.g. "start=2026-01-01&end=2026-06-01"). Get API usage statistics for a project

See how to talk to your AI agent using Deepgram.

Transcribe this audio: https://example.com/recording.mp3 using nova-2

Transcription complete! Using the Nova-2 model, I've processed the audio stream. Here is the text: 'Welcome to the deep dive into AI orchestration...'. I can also provide the confidence scores for each word.

Generate speech for: 'The future of AI is agentic' using aura-asteria-en

Audio stream generated! I've synthesized the text using the Aura Asteria voice. You can download or play the resulting MP3 binary data. The sound is high-fidelity and naturally paced.

Show me my Deepgram usage for this month

Retrieving usage data... You have transcribed 450 minutes of audio and generated 12MB of TTS content this month. Your project limits are well within the current thresholds.

Yes. Use the 'transcribe_url' tool. Provide the public URL of the audio file (WAV, MP3, etc.) and specify the model (e.g., 'nova-2'). The agent will dispatch the request to Deepgram and return the transcribed text instantly.

Related Connectors