examples/warming_up_to_rl/ in the SDK.
1. Run a traced RL job
TASKAPP_TRACING_ENABLED=1). When the job completes, download the trace database (Turso/libSQL or the .db file) referenced by the deployment.
2. Filter traces into JSONL
--require-achievement for each condition you want to enforce. The exporter writes JSONL in Synth’s SFT schema.
3. Launch the SFT job
4. Evaluate the tuned checkpoint
model field for your fine-tuned checkpoint to compare metrics against the baseline.
Tips
- Use 
uvx synth-ai status jobs list --status succeeded --jsonto pull job IDs for audit trails. - Keep datasets versioned; each tweak to the filtering criteria should produce a new JSONL for reproducibility.
 - Once you settle on a fine-tuned checkpoint, reference it under 
[model].sourcein your RL configs so future runs start from the refined policy.