Create Batch Eval Run
Create an eval run for an explicit list of trace IDs.
Evaluates exactly the provided traces with all requested metrics. All metrics for all traces are processed in a single sequential Celery task — no race conditions on concurrent writes.
Auth: Bearer + X-Project-ID | X-API-Key + X-Project-Name
Rate limit: 50/min
Documentation Index
Fetch the complete documentation index at: https://docs.pandaprobe.com/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
Create an eval run for an explicit list of trace IDs.
Use this when the user has manually selected specific traces in the dashboard rather than using filter-based selection.
List of trace UUIDs to evaluate. Duplicates are removed automatically.
1List of metric names to run on each trace. Example: ['task_completion', 'step_efficiency'].
1Optional human-readable label for this run.
LLM model string override for the judge. Null uses the system default.
Response
Successful Response
Full eval run representation used by both list and detail endpoints.
Lifecycle status of an evaluation job.
PENDING, RUNNING, COMPLETED, FAILED 
