examples/warming_up_to_rl/.
1. Initialise the demo
.env template.
2. Pair the CLI
.env with SYNTH_API_KEY and ENVIRONMENT_API_KEY.
3. Start the task app locally
traces/v3.
4. Collect traced rollouts
In a second shell, run the bundled rollout script:5. Export JSONL
6. Launch the SFT job
ft:…) suitable for future RL or eval jobs.
7. Evaluate the checkpoint
Tips
- Keep 
ft_data/under version control so you can track dataset revisions. - Use 
uvx synth-ai train --no-pollif you prefer to submit jobs and monitor them later withsynth-ai status jobs …. - The same trace database can feed multiple filtered JSONL exports—experiment with different achievement filters before retraining.