Encyclopedia Evalica / Evaluation / Remote evaluation

Remote evaluation illustration

Remote evaluation

/rih'moht ih.va.lyoo'ay.shuhn/Running evals in a separate environment or service from the main app, often asynchronously. Remote evals can keep user-facing latency low while still measuring quality. (noun)

Remote evals let us score production traces without blocking the user.

Related Evaluation terms

From the docs

Get started with Evals

Braintrust is the AI observability and eval platform for production AI. By connecting evals and observability in one workflow, teams at Notion, Stripe, Zapier, Vercel, and Ramp ship quality AI products at scale.

Start building