Encyclopedia Evalica / Evaluation / Failure mode

Failure mode illustration

Failure mode

/'fay.lyer mohd/A specific, recurring pattern of incorrect or undesired output. Identifying failure modes is the starting point for targeted improvement. (noun)

The main failure mode is confidently answering without citing the retrieved docs.

Related Evaluation terms

From the docs

Get started with Evals

Braintrust is the AI observability and eval platform for production AI. By connecting evals and observability in one workflow, teams at Notion, Stripe, Zapier, Vercel, and Ramp ship quality AI products at scale.

Start building