Encyclopedia Evalica / Evaluation / Safety

Safety

/'sayf.tee/A scoring dimension that measures the absence of toxicity, bias, or policy violations in a model output. Safety scoring is often used as a release criterion. (noun)

Our safety scorer flags responses that include harassment or other aggressive language.

Related Evaluation terms

From the docs

Get started with Evals

Braintrust is the AI observability and eval platform for production AI. By connecting evals and observability in one workflow, teams at Notion, Stripe, Zapier, Vercel, and Ramp ship quality AI products at scale.

Start building