Can I check my total AI token costs through my agent?

Yes. Use the `query_llm_costs` tool. Your agent will execute a NRQL aggregation summing the `tokenSpanCost` property from your LLM events over the last 24 hours, faceted by model, to provide a clear financial breakdown.

How do I monitor the p95 latency of my LLM generations?

The `query_llm_latency` tool retrieves the average duration and latency matrices for your AI providers. Your agent will report the results as a timesheet or summary, helping you identify performance bottlenecks instantly.

Can my agent run custom NRQL queries against my telemetry data?

Absolutely. Use the `custom_nrql` tool to provide any valid read-only NRQL string. Your agent will query New Relic's NerdGraph API and return the resulting dataset, allowing for complete flexibility in how you analyze your AI operations.

New Relic AI (LLM Observability) MCP Connector for Claude

A+

Monitor and audit LLM telemetry via New Relic AI — track token costs, p95 latency, and user feedback.

10 tools Official Updated Jun 28, 2026 Official Vinkius Partner

More Details Connect to Claude

Connect your New Relic AI account to any AI agent and take full control of your LLM observability, token cost tracking, and performance analytics through natural conversation.

What you can do

LLM Telemetry Audit — Retrieve detailed LLM chat completion messages and prompt inputs directly from your agent to understand literal model behavior in real-time
Token Cost Tracking — Execute structural extraction of model costs to calculate exact USD token consumption across your entire AI infrastructure securely
Performance Monitoring — Extract p95 latency matrices and average response times to ensure your LLM text generation remains performant and sub-second
User Feedback Loop — Retrieve chronological feedback messages and 1-5 rating scores dumped by human supervisors to identify quality regressions natively
Custom NRQL Execution — Run sophisticated read-only queries using the New Relic Query Language (NRQL) to extract rich insights from multi-tenant AI datasets instantly
Custom Event Injection — Post atomic generic telemetry rows to track internal agent states and custom behavioral markers across your observability pipeline
Resource Discovery — Enumerate active APM apps, dashboards, and alert policies to audit your AI environment's structural health and PagerDuty configurations

How it works

Subscribe to this server
Enter your New Relic API Key and Account ID
Start monitoring your AI stack from Claude, Cursor, or any MCP-compatible client

Who is this for?

AI Engineers — monitor LLM prompt performance and verify model accuracy through natural conversation without manual dashboard navigation
Observability Leads — track global AI token costs and p95 latency benchmarks directly from your workspace to optimize infrastructure spend
DevOps Teams — audit APM app health and verify alert policy triggers across multiple AI environments efficiently

llm-monitoringtoken-cost-trackingperformance-analyticsai-observabilitylatency-tracking

Connect to Claude

Subscribe on Vinkius, then add this connector to Claude.ai or Claude Code.

① Claude.ai (web app)

Go to Settings → Connectors → Add custom connector
Paste the MCP endpoint URL below

https://edge.vinkius.com/vk_preview_oqdaAroeFoXBv9yPI4WsHZZZZZuzqhwVoSn1YCyq/mcp

② Claude Code (terminal)

claude mcp add --transport http new-relic-ai-llm-observability https://edge.vinkius.com/vk_preview_oqdaAroeFoXBv9yPI4WsHZZZZZuzqhwVoSn1YCyq/mcp

③ Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "new-relic-ai-llm-observability": {
      "url": "https://edge.vinkius.com/vk_preview_oqdaAroeFoXBv9yPI4WsHZZZZZuzqhwVoSn1YCyq/mcp"
    }
  }
}

Get full access on Vinkius

The preview token above works for testing. Powered by Vinkius.

Details

Tools: 10
Grade: A+
Score: 100/100
Updated: Jun 28, 2026

Related Connectors

Swan MCP

9 tools Official

Empowers algorithmic control over European Bank Accounts. Execute SEPA transfers and manage Virtual Corporate Cards programmatically.

A+ View details →

Liftoff MCP

7 tools Official

Access mobile advertising performance reports and metadata via the Liftoff REST API.

A+ View details →

CoderPad MCP

8 tools Official

Manage technical interviews and assessments via CoderPad — create pads, track interview events, and audit the question bank directly from any AI agent.

F View details →

DingConnect MCP

10 tools Official

Equip your AI agent to manage mobile top-ups, track operators, and monitor account balance via the DingConnect API.

A+ View details →