Skip to main content

Agent Evaluation Metrics

Metrics tailored for evaluating agent-level behavior, including task completion, tool selection accuracy, and multi-step reasoning quality.
This page is under construction. The agent metrics catalog is coming soon.

Topics to be covered

  • Task completion rate
  • Tool selection accuracy
  • Step efficiency (optimal vs actual step count)
  • Reasoning quality metrics
  • Cost and latency metrics
  • Custom agent metrics