Adversarial examples

/a.dver'seh.ree.uhl ih'gza.mpuhlz/Test cases designed to break the system (prompt injection, tricky edge cases, ambiguous inputs) to reveal failure modes. They often complement "happy path" test sets. (noun)

“We added adversarial examples to make sure the agent resists prompt injection.”

Related Datasets terms

Coverage

•

Dataset

•

Dataset record

•

Edge case

•

Expected output (ground truth)

•

flush()

•

Golden dataset

•

Input

•

Metadata

•

Multimodal dataset

From the docs

Build datasets

•

Create experiments

•

Evaluate systematically

•

Glossary

Get started with Evals

Braintrust is the AI observability and eval platform for production AI. By connecting evals and observability in one workflow, teams at Notion, Stripe, Zapier, Vercel, and Ramp ship quality AI products at scale.