Skip to main content

Trace Evaluation Metrics

Metrics provide quantitative measures of trace quality across dimensions like accuracy, relevance, and faithfulness.
This page is under construction. The metrics catalog and custom metric documentation are coming soon.

Topics to be covered

  • Built-in metric catalog (accuracy, relevance, faithfulness, toxicity, etc.)
  • Custom metric definitions
  • LLM-as-judge metrics
  • Deterministic metrics (exact match, BLEU, ROUGE, etc.)
  • Metric configuration and thresholds