How do I get a Cohere API Key?

Log in to the [**Cohere Dashboard**](https://dashboard.cohere.com/api-keys), go to **API Keys** and click **Create API Key**. Copy the key immediately — it starts with a random string and won't be shown again. Free tier includes trial access with rate limits.

What models are available?

Use the `list_models` tool to see all available Cohere models. Key models include command-r-plus (most capable, 128K context), command-r (efficient, 128K context), command-r7b (lightweight, 128K context), embed-v4 (embeddings) and rerank-v3.5 (reranking).

Can I send multi-turn conversations?

Yes! Pass a messages array with alternating 'user', 'assistant' and 'system' roles. Each message has a 'role' and 'content' field. Command models support function calling and will return tool_calls when appropriate.

What is reranking and when should I use it?

Reranking reorders a set of documents by their relevance to a query. Use it after an initial search to improve result quality. The rerank tool takes a query, list of documents and returns them ranked by relevance score. Cohere's rerank models are industry-leading for search applications.

Cohere MCP Connector for Claude

A+

Access Cohere AI models via API — chat with Command models, generate embeddings, rerank documents and tokenize text from any AI agent.

6 tools Official Updated Jun 28, 2026 Official Vinkius Partner

More Details Connect to Claude

Connect your Cohere account to any AI agent and leverage enterprise-grade AI models through natural conversation.

What you can do

Model Discovery — List all available Cohere models with their names, capabilities and context lengths
Chat API — Send conversations to Command models (command-r-plus, command-r, command-r7b) and receive responses with citations and tool call support
Embeddings — Generate vector embeddings for semantic search with multiple embedding types (float, int8, uint8, binary)
Reranking — Rerank documents by relevance to a search query using Cohere's industry-leading reranking models
Tokenization — Tokenize and detokenize text for estimating token counts and debugging

How it works

Subscribe to this server
Enter your Cohere API Key
Start using Cohere models from Claude, Cursor, or any MCP-compatible client

No more switching between API tools to interact with Cohere. Your AI acts as an LLM orchestration layer.

Who is this for?

Developers — quickly send messages to Command models, generate embeddings and rerank search results without writing HTTP code
ML Engineers — discover available models, compare capabilities and generate embeddings with multiple types (float, int8, binary)
Search Teams — rerank documents by relevance, tokenize text and generate embeddings for search index building

llmembeddingsrerankingnatural-language-processingtokenizationchat-api

Related Connectors

Commerce.js MCP

10 tools Official

Manage your e-commerce store via Commerce.js — list products, manage carts, and handle orders directly from any AI agent.

A+ View details →

T-Test Statistics Engine MCP

1 tools Official

Run exact Student's, Welch's, and Paired t-tests local. Get CPU-guaranteed p-values instead of LLM-hallucinated guesses.

A+ View details →

FreeAgent MCP

12 tools Official

Manage accounting, track invoices, and oversee bank transactions via AI agents with FreeAgent.

A+ View details →

PaperQuotes MCP

4 tools Official

Access a vast library of quotes, search by author or tags, and get the quote of the day directly in your AI agent.

F View details →