Supported Tasks
| Task | Description | Algorithm | Config |
|---|---|---|---|
| Banking77 | Intent classification (77 banking intents) | GEPA | banking77_gepa_local.toml |
| Banking77 Pipeline | Two-stage pipeline (classifier → calibrator) | GEPA | banking77_pipeline_gepa_local.toml |
| HotpotQA | Multi-hop question answering | GEPA | hotpotqa_gepa_local.toml |
| IFBench | Instruction following benchmark | GEPA | ifbench_gepa_local.toml |
| HoVer | Claim verification against Wikipedia | GEPA | hover_gepa_local.toml |
| PUPA | Privacy-aware task delegation | GEPA | pupa_gepa_local.toml |
Banking77: Single-Stage vs Multi-Stage
Banking77 has both single-stage and multi-stage (pipeline) variants:-
Single-Stage: Direct intent classification (
banking77_gepa_local.toml)- One LLM call per query
- Simpler configuration
- Faster optimization
-
Multi-Stage Pipeline: Sequential processing (
banking77_pipeline_gepa_local.toml)- Two stages: classifier → calibrator OR query_analyzer → classifier
- Per-stage prompt optimization
- More complex but allows refinement
Quick Start: Banking77
The Banking77 example provides a complete walkthrough:-
Deploy task app:
-
Run optimization:
-
Query results:
Example Configurations
All example configs are available insynth-ai/examples/blog_posts/gepa/configs/:
banking77_gepa_local.toml– Single-stage intent classificationbanking77_pipeline_gepa_local.toml– Multi-stage pipeline (classifier → calibrator)banking77_pipeline_gepa_test.toml– Multi-stage pipeline (query_analyzer → classifier)hotpotqa_gepa_local.toml– Multi-hop QAifbench_gepa_local.toml– Instruction followinghover_gepa_local.toml– Claim verificationpupa_gepa_local.toml– Privacy-aware delegation
- Initial prompt template with
{query}placeholder - Training and validation seed splits
- Algorithm parameters (population size, generations, mutation rate)
Integration Tests
Run the integration test to verify the full workflow:- Deploys the Banking77 task app to Modal
- Runs a GEPA optimization job
- Validates job completion and result structure
Next Steps
- Banking77 Guide – Complete single-stage walkthrough
- Banking77 Comparison – Single-stage vs multi-stage
- Configuration Reference – All parameters
- Training Guide – How to launch jobs