AI that knows your data
Your AI tools should know about your Braintrust experiments. They should understand your log schemas, help debug failed evaluations, and answer questions about your model performance.
Today, we're introducing Braintrust's Model Context Protocol (MCP) server. MCP is Anthropic's open standard that lets AI tools securely access external data sources. We've built an MCP server that exposes your Braintrust data and works seamlessly with popular AI coding tools to provide better access and help you discover insights about your app's structure and performance.
The problem with context switching
Building and evaluating AI applications means constantly jumping between tools like your code editor, the Braintrust UI, various documentation, and more.
Without proper tooling, each context switch breaks your flow. The tools that should be helping you debug and improve your product might have no idea what experiments you're running in Braintrust or how they're performing. This means building custom evaluation infrastructure from scratch.
What you can do with MCP
With our MCP integration, your AI tools of choice become aware of your Braintrust projects, experiments, and data. You can now:
Query experiments naturally: "What were the accuracy scores for my recent sentiment analysis experiments?" Get instant answers from your data.
Debug failures in context: "Show me examples where my model failed on edge cases." See specific data points and understand what went wrong.
Get contextual documentation: "How do I create a custom scorer?" Find relevant examples based on your current project.
Compare model performance: "Compare GPT-5 vs Claude 4 Sonnet performance on my customer support dataset." Run analysis and get explanations automatically.
Supported AI assistants
We support the most popular AI coding tools:
- Cursor: One-click setup via our documentation
- Claude Code:
claude mcp add --transport http braintrust https://api.braintrust.dev/mcp
- VS Code: Works with GitHub Copilot
- Windsurf: Simple JSON configuration
Authentication is handled via OAuth 2.0 with your existing Braintrust account. If you're using SSO, it works with that too. For self-hosted instances, the MCP server runs within your environment, so your data never leaves your infrastructure.
The tools your AI gets
Once connected, your AI assistant gains access to the following tools:
- Documentation search: Semantic search across all Braintrust docs
- Object resolution: Convert between names and IDs for experiments, datasets, and projects
- Recent objects: Browse your latest experiments and datasets
- Schema analysis: Understand your data structure with
infer_schema
- BTQL queries: Execute complex analyses using Braintrust query language
- Experiment summaries: Get high-level performance overviews
- Permalink generation: Create shareable links to results
What this means for your workflow
With Braintrust's MCP, your AI assistant finally understands your context. It knows your projects and data without explanations, accesses experiment results directly, and runs performance comparisons automatically.
Get started with MCP and let your AI assistant join your evaluation workflow.