Encyclopedia Evalica / Evaluation / Score distribution

Score distribution

/skawr dih.struh'byoo.shuhn/The statistical spread of scores across a set of examples. The shape of the distribution often matters more than the mean. (noun)

The mean score stayed flat, but the score distribution shifted with more low-end failures.

Related Evaluation terms

From the docs

Get started with Evals

Braintrust is the AI observability and eval platform for production AI. By connecting evals and observability in one workflow, teams at Notion, Stripe, Zapier, Vercel, and Ramp ship quality AI products at scale.

Start building