uvx synth-ai demo.
Prerequisites
uvx synth-ai setuphas been run in the current directory.- Modal CLI installed and authenticated (
modal token new), unless you are staying on local uvicorn. - Task app registered (the demo registers 
grpo-crafter-demoautomatically). 
1. Deploy the task app
ENVIRONMENT_API_KEY, builds a Modal image with your code, and stores the resulting TASK_APP_BASE_URL in .env. For local testing swap --runtime modal with --runtime local.
2. Run smoke tests
3. Launch the RL job
--dry-runis deprecated. Run the command for real; the trainer will perform/rl/verify_task_app,/health, and/task_infochecks before submitting work.- The CLI streams job events until completion. Press 
Ctrl+Cif you prefer to monitor viasynth-ai status jobs …later. 
4. Monitor jobs
5. Iterate
- Adjust rewards and hyperparameters in 
configs/rl_from_base_qwen4b.toml. - Reference the latest checkpoint in 
[model].sourceonce you have a good run. - Combine with the Rejection Loop to feed curated traces into SFT jobs.