Encyclopedia Evalica / Datasets / Expected output (ground truth)

Expected output (ground truth)
/ih'kspeh.ktuhd 'ow.tpuut grownd trooth/The reference answer/behavior a dataset record defines as correct (when doing reference-based evals). Ground truth can be human-written, programmatically derived, or imported from production. (noun)
“The expected output includes the exact policy language and a citation.”
Related Datasets terms
From the docs
Get started with Evals
Braintrust is the AI observability and eval platform for production AI. By connecting evals and observability in one workflow, teams at Notion, Stripe, Zapier, Vercel, and Ramp ship quality AI products at scale.
Start building