Encyclopedia Evalica / Evaluation / Playground

Playground illustration

Playground

/'play.grownd/An interactive environment for rapid prompt iteration, model comparison, and experimentation against live systems or datasets. It's designed to help teams test changes quickly before committing them. (noun)

In the playground, we tested three prompt variants against the same examples.

Related Evaluation terms

From the docs

Get started with Evals

Braintrust is the AI observability and eval platform for production AI. By connecting evals and observability in one workflow, teams at Notion, Stripe, Zapier, Vercel, and Ramp ship quality AI products at scale.

Start building