GEPA Examples

GEPA examples demonstrate evolutionary prompt optimization workflows for different task types: classification, multi-hop QA, instruction following, and claim verification.

Supported Tasks

Task	Description	Algorithm	Config
Banking77	Intent classification (77 banking intents)	GEPA	`banking77_gepa_local.toml`
Banking77 Pipeline	Two-stage pipeline (classifier → calibrator)	GEPA	`banking77_pipeline_gepa_local.toml`
HotpotQA	Multi-hop question answering	GEPA	`hotpotqa_gepa_local.toml`
IFBench	Instruction following benchmark	GEPA	`ifbench_gepa_local.toml`
HoVer	Claim verification against Wikipedia	GEPA	`hover_gepa_local.toml`
PUPA	Privacy-aware task delegation	GEPA	`pupa_gepa_local.toml`

Banking77: Single-Stage vs Multi-Stage

Banking77 has both single-stage and multi-stage (pipeline) variants:

Single-Stage: Direct intent classification (banking77_gepa_local.toml)
- One LLM call per query
- Simpler configuration
- Faster optimization
Multi-Stage Pipeline: Sequential processing (banking77_pipeline_gepa_local.toml)
- Two stages: classifier → calibrator OR query_analyzer → classifier
- Per-stage prompt optimization
- More complex but allows refinement

See Banking77 Comparison for detailed differences and when to use each.

Quick Start: Banking77

The Banking77 example provides a complete walkthrough:

Deploy task app:

uvx synth-ai deploy banking77 --runtime uvicorn --port 8102

Run optimization:

uvx synth-ai train \
  --config examples/blog_posts/gepa/configs/banking77_gepa_local.toml \
  --poll

Query results:

from synth_ai.learning import get_prompt_text
best_prompt = get_prompt_text(job_id="pl_abc123", rank=1)

See the Banking77 guide for detailed steps, helper scripts, and troubleshooting.

Example Configurations

All example configs are available in synth-ai/examples/blog_posts/gepa/configs/:

banking77_gepa_local.toml – Single-stage intent classification
banking77_pipeline_gepa_local.toml – Multi-stage pipeline (classifier → calibrator)
banking77_pipeline_gepa_test.toml – Multi-stage pipeline (query_analyzer → classifier)
hotpotqa_gepa_local.toml – Multi-hop QA
ifbench_gepa_local.toml – Instruction following
hover_gepa_local.toml – Claim verification
pupa_gepa_local.toml – Privacy-aware delegation

Each config includes:

Initial prompt template with {query} placeholder
Training and validation seed splits
Algorithm parameters (population size, generations, mutation rate)

Integration Tests

Run the integration test to verify the full workflow:

cd synth-ai
uv run pytest tests/integration/cli/test_cli_train_gepa_banking77.py -v

This test:

Deploys the Banking77 task app to Modal
Runs a GEPA optimization job
Validates job completion and result structure

Next Steps

Banking77 Guide – Complete single-stage walkthrough
Banking77 Comparison – Single-stage vs multi-stage
Configuration Reference – All parameters
Training Guide – How to launch jobs

Get Started

Train Your Model

Training Configs

Prompt Optimization

Supervised Fine Tuning

Reinforcement Learning

SDK Reference

Supported Tasks

Banking77: Single-Stage vs Multi-Stage

Quick Start: Banking77

Example Configurations

Integration Tests

Next Steps

Get Started

Train Your Model

Training Configs

Prompt Optimization

Supervised Fine Tuning

Reinforcement Learning

SDK Reference

​Supported Tasks

​Banking77: Single-Stage vs Multi-Stage

​Quick Start: Banking77

​Example Configurations

​Integration Tests

​Next Steps

Supported Tasks

Banking77: Single-Stage vs Multi-Stage

Quick Start: Banking77

Example Configurations

Integration Tests

Next Steps