What languages are supported for transcription?

Parakeel models support 50+ languages including English, Portuguese, Spanish, French, German, Mandarin, Japanese, and many more. Specify the language for best results.

Can I clone a specific voice?

Yes! Use the `clone_voice` tool with a reference audio sample (a few seconds is enough) and the text you want the cloned voice to speak.

What is speaker diarization?

Speaker diarization identifies 'who spoke when' in an audio recording. It segments the audio by speaker and returns timestamps for each speaker's turns.

What audio formats are supported?

The API supports WAV, MP3, FLAC, OGG, and most common audio formats. For best transcription accuracy, use high-quality WAV or FLAC files at 16kHz or higher sample rate.

NVIDIA Audio MCP Connector for Claude

A+

Transcribe speech, generate voices, translate audio, and clone voices via NVIDIA Audio APIs.

10 tools Official Updated Jun 28, 2026 Official Vinkius Partner

More Details Connect to Claude

Connect NVIDIA Audio to any AI agent and unlock professional-grade audio processing — transcribe speech to text, generate natural voices, translate audio across languages, perform speaker diarization, and clone voices through natural conversation.

What you can do

Speech-to-Text — Transcribe audio files with high accuracy using Parakeel models
Text-to-Speech — Convert text to natural-sounding speech
Audio Translation — Translate spoken audio directly to another language
Speaker Diarization — Identify and separate different speakers in audio
Voice Cloning — Clone a voice from a sample and generate new speech
Noise Cancellation — Remove background noise from recordings
Audio Classification — Classify audio as speech, music, noise, etc.
Punctuation Restoration — Add punctuation to raw speech-to-text output

How it works

Subscribe to this server
Enter your NVIDIA API Key (from build.nvidia.com)
Start processing audio from Claude, Cursor, or any MCP-compatible client

Who is this for?

Transcription Services — Automate meeting notes, podcast transcripts, and subtitles
Content Creators — Generate voiceovers and clone voices for multilingual content
Customer Support — Analyze call recordings with diarization and sentiment analysis

speech-to-texttext-to-speechaudio-processingspeaker-diarizationvoice-cloningtranscription

Related Connectors

PrecisionConvert Unit Engine MCP

2 tools Official

Universal unit conversion intelligence — transform physical values via AI.

A+ View details →

Product Hunt MCP

3 tools Official

Discover the best new products in tech daily — check the leaderboard, search for specific tools, and get detailed product insights via your AI agent.

A+ View details →

Glofox MCP

8 tools Official

Manage members, classes, trainers, bookings, and purchases for your Glofox-powered gym or fitness studio through natural conversation.

A+ View details →

Broadage Sports MCP

10 tools Official

Access real-time sports data via Broadage — track scores, matches, and lineups directly from any AI agent.

A+ View details →