synth_ai.cli.commands.eval.core
Alpha
Eval command CLI entry point for task app rollouts.
This module provides the Click command-line interface for the synth-ai eval command.
Command Overview:
The eval command executes rollouts against a task app and summarizes results.
It supports two execution modes:
- Direct Mode: Calls task app directly (no backend required)
- Backend Mode: Routes through backend for trace capture and cost tracking
- TOML config file (
--config) - Command-line arguments (override config)
- Environment variables (for API keys, etc.)
- Prints results table to stdout
- Optionally writes report to
--output-txt - Optionally writes JSON to
--output-json - Optionally saves traces to
--traces-dir
synth_ai.cli.commands.eval.runner: Evaluation execution logicsynth_ai.cli.commands.eval.config: Configuration loadingmonorepo/docs/cli/eval.mdx: Full CLI documentation
Functions
eval_command
synth-ai eval command.
Execution Modes:
- Direct Mode: If
--backendis not provided, calls task app directly - Backend Mode: If
--backendis provided, creates eval job on backend
synth_ai.cli.commands.eval.runner.run_eval(): Execution logicsynth_ai.cli.commands.eval.config.resolve_eval_config(): Config resolution