Skip to main content
After instrumenting your application, Braintrust captures every request as traces that you can view, filter, and analyze in real time. This observability enables you to monitor production behavior, identify issues, and gather data for improving your application.

Why observe in Braintrust

Observability in Braintrust creates a feedback loop between production and evaluation. Logs use the same data structure as experiments, which means:
  • Instrumentation code works for both logging and evaluation
  • Traces capture identical data in production and testing
  • Scores and feedback apply to both logs and experiments
  • Production data seamlessly becomes evaluation datasets
This unified structure lets you iterate faster and maintain consistency across your development workflow.

View your logs

The Logs page displays all traces from your application in a searchable, filterable table. Each row represents a complete trace with its root span. Logging Screenshot You can:
  • Browse traces and individual spans
  • Group related traces by metadata or tags
  • Create custom columns to surface important values
  • Extract prompts to iterate in playgrounds
  • Apply tags to organize traces
Find specific traces using multiple approaches:
  • Filter menu: Quick filters and SQL queries for precise matching
  • Deep search: Semantic search to find traces by meaning, not just keywords
  • Loop: Ask natural language questions about your logs
  • API: Programmatic access for automation

Score online

Run scorers on production traces automatically or on-demand to evaluate quality in real time. Online scoring helps you:
  • Catch regressions immediately
  • Monitor quality metrics continuously
  • Identify patterns and edge cases
  • Trigger alerts when scores drop

Monitor with dashboards

Custom dashboards aggregate metrics across your logs and experiments. Track request counts, latency, token usage, costs, scores, and custom metrics over time. Dashboards help you:
  • Visualize trends and anomalies
  • Compare performance across time periods
  • Drill into specific data points
  • Share insights with your team

Use Loop

Loop is an AI-powered assistant that understands your data structure and helps you explore logs through natural language. Ask questions, identify patterns, and get insights without writing queries.

Next steps