Monitor AI and catch issues before users
Trace every agent, score live traffic, and get alerted when AI model performance drops.
Free to start · No credit card · 5-min setup

Trusted by AI teams at
See traces in minutes
One decorator. Automatic tracing. Quality scores on live traffic.
from braintrust import traced @traced async def my_agent(query): docs = await retrieve(query) return await llm(query, docs)
from braintrust import current_span
from autoevals import Factuality
score = Factuality()(
output=result, input=query
)
current_span().log(
scores={"factuality": score.score}
)- name: Quality alert condition: factuality < 0.80 notify: slack:#ai-alerts action: add_to_dataset
What changes when Braintrust is part of your workflow
10x
Faster issue resolution
<10 min
Eval turnaround
25%
Accuracy improvement
45x
More feedback
Notion, Dropbox, Zapier, and Coursera use Braintrust to ship better AI. Get started free →
Works how your team works
Engineers instrument in code. Everyone watches quality, latency, and cost in one dashboard. Problems surface before users report them.
For engineers
from braintrust import traced, current_span
from autoevals import Factuality
@traced
async def my_agent(query):
docs = await retrieve(query)
result = await llm(query, docs)
score = Factuality()(output=result, input=query)
current_span().log(
scores={"factuality": score.score}
)
return resultOne decorator to trace. Score any span inline with autoevals. Works with any AI provider or framework.
For SMEs & PMs

Quality scores, latency, and cost in one view. Set thresholds and get notified in Slack.
Built for AI monitoring from the start
Real-time AI performance metrics
Quality scores, latency percentiles, token usage, and cost tracked per request. Spot regressions the moment they appear.
AI agent monitoring
Every tool call, retrieval step, and LLM call in a multi-step agent is captured as a span. See the full decision path for any request and know exactly where it went wrong.
Production traces become eval datasets
Tag a failing trace and see it land in your dataset. The same data you monitor in production becomes the test suite you run in CI.
What our customers say
“There are some problems we wouldn't know were problems without Braintrust.”
Sarah Sachs, AI Lead at Notion

Stop shipping on vibes
See your first trace in minutes
Free to start · No credit card required
Get started free