Manage projects

Projects organize AI features in your application. Each project contains logs, experiments, datasets, prompts, and other functions. Configure project-specific settings to customize behavior for your use case.

View projects

Select your organization from the top-left menu to see a list of all projects in your organization, including key project details such as:

Name
Creator
Counts of playgrounds, experiments, datasets, logs, and spans
Log size estimates for different time periods
Token usage and duration metrics

Use the projects list to organize and find projects:

Search: Use the search box to find projects by name
Filter: Use the Filter menu to add custom filtering.
Sort: Choose sorting options in column headers.
Configure columns: Customize which columns are visible in the table

You can use the Braintrust API to change the name, description, or creator of an existing project. See the API reference for updating projects.

Analyze projects with Loop

Select Loop in the bottom right corner of your organization’s projects overview page to analyze patterns across all your projects. Use Loop to compare project metrics, identify trends in usage and performance, and understand how different projects are being utilized. Example queries:

“Which projects have the highest token usage?”
“Show me trends in log volume across projects”
“What projects were created in the last month?”
“Compare latency metrics across my top 5 projects”
“Which projects have the most experiments?”
“Show me projects with declining usage”

For more information about Loop’s capabilities, see Use Loop.

Create a project

Navigate to your organization’s project list
Click + Project
Enter a project name
Optionally add a description
Click Create

If a project already exists, projects.create() returns a handle. There is no separate .get() method.

import * as braintrust from "braintrust";

// Get a handle to the project (creates if it doesn't exist)
const project = braintrust.projects.create({ name: "my-project" });

// Use the project to create functions
project.prompts.create({...});
project.tools.create({...});

Projects are automatically created when initializing experiments or loggers:

import * as braintrust from "braintrust";

// Creates "my-project" if it doesn't exist
const experiment = braintrust.init("my-project", {
  experiment: "my-experiment"
});

For more details, see the SDK reference for Python or TypeScript.

Configure AI providers

Project-level AI provider keys override organization-level keys. Use project-level keys when:

Different projects need separate billing or rate limits
You want to isolate API usage by project
Projects require different provider accounts or credentials

You can also configure project-level AI providers inline from playgrounds within that project. When running a playground, you can set up providers without navigating to configuration settings. To configure project-level AI providers:

Navigate to your project.
Go to Settings.
Under Project, select Project AI providers.
Click the provider you want to configure.
Enter your API key for that provider.
Click Save.

Project-level providers are available in a project’s playgrounds, experiments, and when using the gateway with the project’s context.

API keys are stored as one-way cryptographic hashes, never in plaintext.

Add custom providers

Braintrust supports custom AI providers at both the organization and project level. See Custom providers for details on configuring custom endpoints.

Add tags

Tags help organize and filter logs, datasets, and experiments:

Go to Settings.
Under Project, select Tags.
Click Add tag.
Enter tag details:
- Name: Tag identifier.
- Color: Visual indicator.
- Description: Optional explanation.
Click Save.

Use tags to track data by user type, feature, environment, or any custom category. Filter by tags in logs, experiments, and datasets. For more information about using tags, see View logs.

Configure human review

Review scores appear in all logs and experiments in a project. Use them for quality control, data labeling, or feedback collection.

Go to Settings.
Under Project, select Human review.
Click + Human review score.
Enter a name and description for your score. Descriptions support Markdown.
Select a score type:
- Categorical score: Predefined options with assigned scores. Each option gets a unique percentage value between 0% and 100% (stored as 0 to 1). Use for classification tasks like sentiment or correctness categories. Also supports writing to the expected field instead of creating a score.
- Continuous score: Numeric values between 0% and 100% with a slider input control. Use for subjective quality assessments like helpfulness or tone.
- Free-form input: String values written to the metadata field at a specified path. Use for explanations, corrections, or structured feedback.
Click Save.

You can also create human review scores as you review traces. In the trace view, click + Human review score and define the score as described above.

For more information, see Add human feedback.

Create aggregate scores

Combine multiple scores into a single metric:

Go to Settings.
Under Project, select Aggregate scores.
Click Add aggregate score.
Define the aggregation:
- Name: Score identifier.
- Type: Weighted average, minimum, or maximum.
- Selected scores: Scores to aggregate.
- Weights: For weighted averages, set score weights.
- Description: Optional explanation.
Click Save.

Aggregate scores appear in experiment summaries and comparisons. Use them to create composite quality metrics or overall performance indicators.

Set up online scoring

Define project-level scoring rules that automatically evaluate production logs as they arrive. These rules can be created here or when creating and editing scorers.

Go to Settings.
Under Project, select Automations.
Click + Create rule.
Configure the rule:
- Name: Rule identifier.
- Scorers: Select which scorers to run.
- Sampling rate: Percentage of logs to evaluate (1-100%).
- Filter: Optional SQL query to select specific logs.
- Span type: Apply to root spans or all spans.
Click Save.

Online scoring rules run asynchronously in the background. View results in the logs page alongside other scores. Rules can also be created and managed when working with individual scorers. For more information, see Create scoring rules.

Configure span iframes

Customize how specific span fields render in the UI:

Go to Settings.
Under Project, select Span iframes.
Click Add iframe.
Configure rendering:
- Field path: Which field to render (e.g., output.html).
- iframe URL: Template for the iframe src attribute.
Click Save.

Use span iframes to render HTML, charts, or custom visualizations directly in trace views. For more information, see Extend traces.

Set comparison key

Customize how experiments match test cases:

Go to Settings.
Under Project, select Advanced.
Enter a SQL expression (default: input).
Click Save.

Examples:

input.question - Match by question field only.
input.user_id - Match by user.
[input.query, metadata.category] - Match by multiple fields.

The comparison key determines which test cases are considered the same across experiments. For more information, see Compare experiments.

Edit project details

Update project name and description:

Go to Settings.
Under Project, select General.
Modify name and description.
Click Save.

Delete a project

Deleting a project permanently removes all logs, experiments, datasets, and functions. This cannot be undone.

Go to Settings.
Under Project, select General.
Click Delete project.
Confirm by typing the project name.
Click Delete.

Next steps

Control access to projects with permission groups
Set up automations for project-specific alerts
View logs filtered by tags and metadata
Run evaluations on project datasets

Start

Instrument

Observe

Annotate

Evaluate

Deploy

Admin

Best practices

View projects

Analyze projects with Loop

Create a project

Configure AI providers

Add custom providers

Add tags

Configure human review

Create aggregate scores

Set up online scoring

Configure span iframes

Set comparison key

Edit project details

Delete a project

Next steps

Start

Instrument

Observe

Annotate

Evaluate

Deploy

Admin

Best practices

​View projects

​Analyze projects with Loop

​Create a project

​Configure AI providers

​Add custom providers

​Add tags

​Configure human review

​Create aggregate scores

​Set up online scoring

​Configure span iframes

​Set comparison key

​Edit project details

​Delete a project

​Next steps

View projects

Analyze projects with Loop

Create a project

Configure AI providers

Add custom providers

Add tags

Configure human review

Create aggregate scores

Set up online scoring

Configure span iframes

Set comparison key

Edit project details

Delete a project

Next steps