Accurate for Stack
0.1.0 and later unless a section cites a newer release.
See Stack Changelog.What GEPA does
GEPA iteratively proposes and evaluates prompt variants against a task’s training and held-out sets, keeping changes that improve the score. Stack drives the optimizer service, runs the rollouts, and records the result as a StackEval packet you can replay.Run a built-in task
Stack ships StackEval GEPA tasks you can run end-to-end:| Task | What it optimizes |
|---|---|
banking77-local-gepa | intent classification on the Banking77 set |
crafter-local-gepa | an agent policy prompt on Crafter |
stackd and the Stack TUI, scaffolds a StackEval packet
(initial_prompt.txt, metadata.json, trace pointers), and starts the optimization
loop. Progress, candidate scores, and the final improved prompt are written to the
packet’s trace directory and shown in the cockpit.
Inspect the run
Every optimizer run is grounded in stackd receipts:Hosted optimizers
The same loop can run on Synth’s hosted optimizer service when you are signed in. See thehosted-gepa Codex skill for connecting a Stack session to hosted runs.