Encyclopedia Evalica / Deployment / Streaming

Streaming illustration

Streaming

/'stree.mihng/A response delivery mode where output tokens are sent to the client incrementally as they are generated. Streaming improves perceived latency even when total completion time is unchanged. (noun)

Streaming made the assistant feel faster even though the full answer took several seconds.

Related Deployment terms

From the docs

Get started with Evals

Braintrust is the AI observability and eval platform for production AI. By connecting evals and observability in one workflow, teams at Notion, Stripe, Zapier, Vercel, and Ramp ship quality AI products at scale.

Start building