synth-ai/examples/rl/
and your backend.
0) Prereqs
- Backend URL + API key
- Modal CLI logged in
uv
installed
1) Create the Task App (environment service)
Deploy the example Task App to Modal (uses Crafter helpers):2) Mint & upload ENVIRONMENT_API_KEY
Use the helper to mint and store the key as a Modal secret used by the Task App:crafter-environment-sdk
secret with ENVIRONMENT_API_KEY
and optional pass‑throughs.
3) Health & wiring check
Run diagnostics against backend + Task App to verify headers/auth and routes:4) OpenAI in the Task App (smoke test)
Call OpenAI from inside the Task App to ensure outbound provider access works:5) Run full on-policy RL (backend‑orchestrated)
Kick off a full RL job via the backend (uses trainer id server‑side):6) Inference with RL weights
synth-laboratories/synth-ai
.