Eval
synth-ai eval
Execute evaluation rollouts against a task app.
This is the main CLI entry point for the synth-ai eval command.
Execution Modes:
- Direct Mode: If
--backendis not provided, calls task app directly - Backend Mode: If
--backendis provided, creates eval job on backend
synth_ai.cli.commands.eval.runner.run_eval(): Execution logicsynth_ai.cli.commands.eval.config.resolve_eval_config(): Config resolution
Execution Modes
- Direct Mode: If
--backendis not provided, calls task app directly- Backend Mode: If
--backendis provided, creates eval job on backend
- Backend Mode: If
Example
This documentation is auto-generated from source code docstrings.
Arguments
- APP_ID (optional)
Options
| Option | Type | Default | Description |
|---|---|---|---|
--model | TEXT | "" | - |
--config | TEXT | "" | - |
--trace-db | TEXT | "" | - |
--metadata | TEXT | - | - |
--seeds | TEXT | "" | - |
--url | TEXT | "" | - |
--backend | TEXT | "" | - |
--env-file | TEXT | "" | - |
--ops | TEXT | "" | - |
--return-trace | flag | false | - |
--concurrency | TEXT | "" | - |
--seed-set | Choice(seeds, validation_seeds, test_pool) | seeds | - |
--wait | flag | false | - |
--poll | TEXT | "" | - |
--output | TEXT | "" | - |
--traces-dir | TEXT | "" | - |
--output-txt | TEXT | "" | - |
--output-json | TEXT | "" | - |