Agent Evaluation Metrics
Metrics tailored for evaluating agent-level behavior, including task completion, tool selection accuracy, and multi-step reasoning quality.This page is under construction. The agent metrics catalog is coming soon.
Topics to be covered
- Task completion rate
- Tool selection accuracy
- Step efficiency (optimal vs actual step count)
- Reasoning quality metrics
- Cost and latency metrics
- Custom agent metrics
