Skip to main content

Braintrust home page

Request demo
Dashboard
Dashboard

Cookbook

Recipes

Building a deep research agent with Temporal
Evaluating and iterating on AI apps with Lovable
Using Loop in AI product development
Observability for Strands Agents on Amazon Bedrock
Building reliable AI agents
Using PDF attachments in playgrounds
Tracing Vercel AI SDK applications
Evaluating video QA with Twelve Labs
Evaluating a web agent
Prompt versioning and deployment
Evaluating video QA
Evaluating a voice agent
Classifying spam using structured outputs
Evaluating a prompt chaining agent
Evaluating the precision and recall of an emotion classifier
Evaluating audio with the OpenAI Realtime API
Evaluating SimpleQA
Evaluating voice AI agents with Evalion
Using Python functions to extract text from images
Using OpenTelemetry for LLM observability
Using functions to build a RAG agent
Evaluating multimodal receipt extraction
Unreleased AI: A full stack Next.js app for generating changelogs
An agent that runs OpenAPI commands
Benchmarking inference providers
Tool calls in LLaMa 3.1
Evaluating a chat assistant
LLM Eval For Text2SQL
Optimizing Ragas to evaluate a RAG pipeline
Comparing evals across multiple AI models
Detecting Prompt Injections
AI Search Bar
How Zapier uses assertions to evaluate tool usage in chatbots
Generating release notes and hill-climbing to improve them
Generating beautiful HTML components
Coda's Help Desk with and without RAG
Improving Github issue titles using their contents
Classifying news articles
Text-to-SQL

404

Page Not Found

We couldn't find the page. Maybe you were looking for one of these pages below?

Cookbook OpenAI Improving Github issue titles using their contents

⌘I