Encyclopedia Evalica / Datasets / Adversarial examples

Adversarial examples
/a.dver'seh.ree.uhl ih'gza.mpuhlz/Test cases designed to break the system (prompt injection, tricky edge cases, ambiguous inputs) to reveal failure modes. They often complement "happy path" test sets. (noun)
“We added adversarial examples to make sure the agent resists prompt injection.”
Related Datasets terms
From the docs
Get started with Evals
Braintrust is the AI observability and eval platform for production AI. By connecting evals and observability in one workflow, teams at Notion, Stripe, Zapier, Vercel, and Ramp ship quality AI products at scale.
Start building