Trace Evaluation
Trace evaluation lets you assess the quality of individual LLM interactions captured by PandaProbe tracing. Compare trace outputs against expected results, run automated quality checks, and track metrics over time.This page is under construction. Detailed trace evaluation documentation is coming soon.
Topics to be covered
- Setting up trace evaluation datasets
- Defining evaluation criteria
- Running evaluations against historical traces
- Interpreting evaluation results
- Connecting evaluation scores to traces
