Trace Evaluation Metrics
Metrics provide quantitative measures of trace quality across dimensions like accuracy, relevance, and faithfulness.This page is under construction. The metrics catalog and custom metric documentation are coming soon.
Topics to be covered
- Built-in metric catalog (accuracy, relevance, faithfulness, toxicity, etc.)
- Custom metric definitions
- LLM-as-judge metrics
- Deterministic metrics (exact match, BLEU, ROUGE, etc.)
- Metric configuration and thresholds
