Configuration Reference

Complete reference for prompt optimization configuration files, including all algorithm parameters, model requirements, and best practices.

Configuration Structure

[prompt_learning]
algorithm = "gepa"  # or "mipro"
task_app_url = "http://127.0.0.1:8102"
task_app_id = "banking77"
evaluation_seeds = [50, 51, 52, ...]
validation_seeds = [0, 1, 2, ...]

[prompt_learning.initial_prompt]
messages = [
  { role = "system", content = "..." },
  { role = "user", pattern = "Query: {query}" }
]

[prompt_learning.gepa]
initial_population_size = 20
num_generations = 15
mutation_rate = 0.3
crossover_rate = 0.5
rollout_budget = 1000
max_concurrent_rollouts = 20
pareto_set_size = 20

Core Settings

`[prompt_learning]`

Field	Type	Required	Description
`algorithm`	`"gepa" \| "mipro"`	Yes	Optimization algorithm
`task_app_url`	`string`	Yes	Task app endpoint URL
`task_app_id`	`string`	No	Task app identifier
`evaluation_seeds`	`array[int]`	Yes	Training seed indices
`validation_seeds`	`array[int]`	Yes	Validation seed indices

`[prompt_learning.initial_prompt]`

Field	Type	Required	Description
`messages`	`array[object]`	Yes	Initial prompt template

Each message object:

role: "system" \| "user" \| "assistant"
content: Static content (string)
pattern: Template with {query} placeholder (string)

GEPA Parameters

`[prompt_learning.gepa]`

Parameter	Type	Default	Description
`initial_population_size`	`int`	`20`	Starting number of prompt variants
`num_generations`	`int`	`15`	Evolutionary cycles to run
`mutation_rate`	`float`	`0.3`	Probability of mutation (0-1)
`crossover_rate`	`float`	`0.5`	Probability of crossover (0-1)
`rollout_budget`	`int`	`1000`	Total task evaluations allowed
`max_concurrent_rollouts`	`int`	`20`	Parallel rollout limit
`pareto_set_size`	`int`	`20`	Pareto front size

MIPRO Parameters

`[prompt_learning.mipro]`

Parameter	Type	Default	Description
`num_iterations`	`int`	`16`	Number of optimization iterations
`num_evaluations_per_iteration`	`int`	`6`	Prompt variants evaluated per iteration
`batch_size`	`int`	`6`	Concurrent evaluations per iteration
`max_concurrent`	`int`	`20`	Maximum concurrent rollouts
`bootstrap_train_seeds`	`array[int]`	Required	Seeds for bootstrap phase (few-shot collection)
`online_pool`	`array[int]`	Required	Seeds for mini-batch evaluation during optimization
`test_pool`	`array[int]`	Required	Seeds for final held-out evaluation
`reference_pool`	`array[int]`	Optional	Seeds for reference corpus (up to 50k tokens)
`meta_model`	`string`	`"gpt-4o-mini"`	Meta-model for instruction proposals
`meta_model_provider`	`string`	`"openai"`	Provider for meta-model (`"openai"`, `"groq"`, `"google"`)
`meta_model_inference_url`	`string`	Provider default	Inference URL for meta-model
`few_shot_score_threshold`	`float`	`0.85`	Minimum score for bootstrap examples
`max_token_limit`	`int`	Optional	Maximum tokens per prompt
`max_spend_usd`	`float`	Optional	Maximum spend in USD
`token_counting_model`	`string`	Optional	Model for token counting
`enforce_token_limit`	`bool`	`false`	Enforce token limits strictly
`spec_path`	`string`	Optional	Path to system spec JSON file
`spec_max_tokens`	`int`	`5000`	Max tokens from spec to include
`spec_include_examples`	`bool`	`true`	Include examples from spec
`spec_priority_threshold`	`int`	`8`	Minimum priority for spec rules

Example Configurations

Banking77 (GEPA)

[prompt_learning]
algorithm = "gepa"
task_app_url = "http://127.0.0.1:8102"
task_app_id = "banking77"
evaluation_seeds = [50, 51, 52, ..., 79]
validation_seeds = [0, 1, 2, ..., 49]

[prompt_learning.initial_prompt]
messages = [
  { role = "system", content = "You are a banking intent classification assistant." },
  { role = "user", pattern = "Customer Query: {query}\n\nClassify this query into one of 77 banking intents." }
]

[prompt_learning.gepa]
initial_population_size = 20
num_generations = 15
mutation_rate = 0.3
crossover_rate = 0.5
rollout_budget = 1000
max_concurrent_rollouts = 20
pareto_set_size = 20

HotpotQA (GEPA)

[prompt_learning]
algorithm = "gepa"
task_app_url = "http://127.0.0.1:8103"
task_app_id = "hotpotqa"
evaluation_seeds = [0, 1, 2, ..., 29]
validation_seeds = [30, 31, 32, ..., 79]

[prompt_learning.initial_prompt]
messages = [
  { role = "system", content = "You are a question-answering assistant." },
  { role = "user", pattern = "Question: {query}\n\nAnswer this multi-hop question using reasoning." }
]

[prompt_learning.gepa]
initial_population_size = 30
num_generations = 20
mutation_rate = 0.25
crossover_rate = 0.6
rollout_budget = 1500
max_concurrent_rollouts = 25
pareto_set_size = 25

Supported Models

Policy Models (Task Execution)

Both GEPA and MIPRO support policy models from three providers:

OpenAI Models

gpt-4o
gpt-4o-mini
gpt-4.1
gpt-4.1-mini
gpt-4.1-nano
gpt-5
gpt-5-mini
gpt-5-nano

Explicitly REJECTED: gpt-5-pro (too expensive:

15/

120 per 1M tokens)

Groq Models

gpt-oss-Xb pattern (e.g., gpt-oss-20b, openai/gpt-oss-120b)
llama-3.3-70b and variants (e.g., llama-3.3-70b-versatile)
qwen-32b, qwen3-32b, groq/qwen3-32b

Google/Gemini Models

gemini-2.5-pro
gemini-2.5-pro-gt200k
gemini-2.5-flash
gemini-2.5-flash-lite

Mutation Models (GEPA Only)

Used to generate prompt mutations/variations:

Model	Provider	Common Usage
`openai/gpt-oss-120b`	Groq	Most common
`openai/gpt-oss-20b`	Groq	Alternative
`llama-3.3-70b-versatile`	Groq	Alternative
`llama3-groq-70b-8192-tool-use-preview`	Groq	Alternative

Nano models are REJECTED (too small for generation tasks)

Meta Models (MIPRO Only)

Used to generate instruction proposals:

Model	Provider	Common Usage
`gpt-4o-mini`	OpenAI	Most common default
`gpt-4.1-mini`	OpenAI	Alternative
`gpt-4o`	OpenAI	Higher quality, more expensive

Nano models are REJECTED (too small for generation tasks)

Model Configuration

# Policy model (both algorithms)
[prompt_learning.policy]
model = "openai/gpt-oss-20b"
provider = "groq"
inference_url = "https://api.groq.com/openai/v1"

# Mutation model (GEPA only)
[prompt_learning.gepa.mutation]
llm_model = "openai/gpt-oss-120b"
llm_provider = "groq"
llm_inference_url = "https://api.groq.com/openai/v1"

# Meta model (MIPRO only)
[prompt_learning.mipro]
meta_model = "gpt-4o-mini"
meta_model_provider = "openai"
meta_model_inference_url = "https://api.openai.com/v1"

Policy Configuration

`[prompt_learning.policy]`

Parameter	Type	Required	Description
`model`	`string`	Yes	Policy model identifier
`provider`	`string`	Yes	Provider (`"openai"`, `"groq"`, `"google"`)
`inference_url`	`string`	Yes	Inference endpoint URL
`inference_mode`	`string`	Optional	`"synth_hosted"` or custom
`temperature`	`float`	Optional	Sampling temperature (default: 0.0)
`max_completion_tokens`	`int`	Optional	Maximum tokens (default: 512)

GEPA Mutation Configuration

`[prompt_learning.gepa.mutation]`

Parameter	Type	Default	Description
`rate`	`float`	`0.3`	Probability of mutation (0-1)
`llm_model`	`string`	Optional	LLM for guided mutations
`llm_provider`	`string`	Optional	Provider for mutation LLM
`llm_inference_url`	`string`	Optional	Inference URL for mutation LLM
`proposer_type`	`string`	`"dspy"`	`"dspy"` or `"spec"`

System Spec Configuration

Both GEPA and MIPRO support system specifications (specs) for constraint-aware optimization.

`[prompt_learning.gepa]` Spec Parameters

Parameter	Type	Default	Description
`proposer_type`	`string`	`"dspy"`	`"dspy"` or `"spec"` (requires `spec_path`)
`spec_path`	`string`	Optional	Path to system spec JSON file (required if `proposer_type="spec"`)
`spec_max_tokens`	`int`	`5000`	Max tokens for spec context in mutation prompts
`spec_include_examples`	`bool`	`true`	Include examples from spec
`spec_priority_threshold`	`int`	Optional	Only include rules with priority >= threshold

`[prompt_learning.mipro]` Spec Parameters

Parameter	Type	Default	Description
`spec_path`	`string`	Optional	Path to system spec JSON file
`spec_max_tokens`	`int`	`5000`	Max tokens for spec context in meta-prompt
`spec_include_examples`	`bool`	`true`	Include examples from spec
`spec_priority_threshold`	`int`	Optional	Only include rules with priority >= threshold

See System Specifications for complete details on creating and using specs.

Multi-Stage Pipeline Configuration

Both algorithms support multi-stage pipelines:

GEPA Multi-Stage

[[prompt_learning.gepa.modules]]
module_id = "classifier"
max_instruction_slots = 3
max_tokens = 512

[[prompt_learning.gepa.modules]]
module_id = "calibrator"
max_instruction_slots = 2
max_tokens = 256

MIPRO Multi-Stage

[[prompt_learning.mipro.modules]]
module_id = "classifier"
max_instruction_slots = 3
max_demo_slots = 5

[[prompt_learning.mipro.modules]]
module_id = "calibrator"
max_instruction_slots = 3
max_demo_slots = 5

Complete Example Configurations

Banking77 (GEPA)

[prompt_learning]
algorithm = "gepa"
task_app_url = "http://127.0.0.1:8102"
task_app_id = "banking77"
evaluation_seeds = [50, 51, 52, ..., 79]
validation_seeds = [0, 1, 2, ..., 49]

[prompt_learning.initial_prompt]
messages = [
  { role = "system", content = "You are a banking intent classification assistant." },
  { role = "user", pattern = "Customer Query: {query}\n\nClassify this query into one of 77 banking intents." }
]

[prompt_learning.policy]
model = "openai/gpt-oss-20b"
provider = "groq"
inference_url = "https://api.groq.com/openai/v1"
temperature = 0.0
max_completion_tokens = 128

[prompt_learning.gepa]
initial_population_size = 20
num_generations = 15
mutation_rate = 0.3
crossover_rate = 0.5
rollout_budget = 1000
max_concurrent_rollouts = 20
pareto_set_size = 20

[prompt_learning.gepa.mutation]
llm_model = "openai/gpt-oss-120b"
llm_provider = "groq"
llm_inference_url = "https://api.groq.com/openai/v1"

Banking77 (MIPRO)

[prompt_learning]
algorithm = "mipro"
task_app_url = "https://synth-laboratories-dev--synth-banking77-web-web.modal.run"
task_app_id = "banking77"

[prompt_learning.initial_prompt]
messages = [
  { role = "system", content = "You are a banking intent classification assistant." },
  { role = "user", pattern = "Customer Query: {query}\n\nClassify this query into one of 77 banking intents." }
]

[prompt_learning.policy]
model = "openai/gpt-oss-20b"
provider = "groq"
inference_url = "https://api.groq.com/openai/v1"
temperature = 0.0
max_completion_tokens = 128

[prompt_learning.mipro]
num_iterations = 16
num_evaluations_per_iteration = 6
batch_size = 6
max_concurrent = 20
bootstrap_train_seeds = [0, 1, 2, 3, 4]
online_pool = [5, 6, 7, 8, 9]
test_pool = [20, 21, 22, 23, 24]
meta_model = "gpt-4o-mini"
meta_model_provider = "openai"
meta_model_inference_url = "https://api.openai.com/v1"
few_shot_score_threshold = 0.85

Best Practices

GEPA Best Practices

Population Size: Start with 20-30 for most tasks. Increase for complex tasks.
Generations: 10-15 generations usually sufficient. More for complex optimization.
Mutation Rate: 0.2-0.4 works well. Higher = more exploration, lower = more exploitation.
Rollout Budget: Allocate 50-100 rollouts per generation for stable estimates.
Concurrency: Set max_concurrent_rollouts based on task app capacity (typically 10-50).
Mutation Model: Use gpt-oss-120b for best quality mutations, gpt-oss-20b for faster/cheaper.

MIPRO Best Practices

Bootstrap Seeds: Use 5-15 seeds for bootstrap phase. Higher threshold = fewer but better examples.
Iterations: 10-20 iterations usually sufficient. More for complex tasks.
Evaluations per Iteration: 4-6 variants per iteration balances exploration vs. cost.
Meta Model: gpt-4o-mini is the sweet spot (quality + cost). Use gpt-4o for higher quality.
Reference Pool: Optional but recommended. 50-100 seeds provide rich context (up to 50k tokens).
Token Budget: Set max_token_limit and max_spend_usd to control costs.

General Best Practices

Seed Splitting: Keep training, validation, and test seeds separate. Never overlap.
Baseline Prompt: Start with a clear, task-specific baseline. Better baseline = better optimization.
Model Selection: Use Groq models (gpt-oss-20b) for cost-effective policy execution.
Concurrency: Match max_concurrent to your task app’s capacity. Too high = rate limits.
Monitoring: Track accuracy, token count, and cost throughout optimization.

Get Started

Train Your Model

Training Configs

Prompt Optimization

Supervised Fine Tuning

Reinforcement Learning

SDK Reference

​Configuration Structure

​Core Settings

​[prompt_learning]

​[prompt_learning.initial_prompt]

​GEPA Parameters

​[prompt_learning.gepa]

​MIPRO Parameters

​[prompt_learning.mipro]

​Example Configurations

​Banking77 (GEPA)

​HotpotQA (GEPA)

​Supported Models

​Policy Models (Task Execution)

​OpenAI Models

​Groq Models

​Google/Gemini Models

​Mutation Models (GEPA Only)

​Meta Models (MIPRO Only)

​Model Configuration

​Policy Configuration

​[prompt_learning.policy]

​GEPA Mutation Configuration

​[prompt_learning.gepa.mutation]

​System Spec Configuration

​[prompt_learning.gepa] Spec Parameters

​[prompt_learning.mipro] Spec Parameters

​Multi-Stage Pipeline Configuration

​GEPA Multi-Stage

​MIPRO Multi-Stage

​Complete Example Configurations

​Banking77 (GEPA)

​Banking77 (MIPRO)

​Best Practices

​GEPA Best Practices

​MIPRO Best Practices

​General Best Practices

Configuration Structure

Core Settings

`[prompt_learning]`

`[prompt_learning.initial_prompt]`

GEPA Parameters

`[prompt_learning.gepa]`

MIPRO Parameters

`[prompt_learning.mipro]`

Example Configurations

Banking77 (GEPA)

HotpotQA (GEPA)

Supported Models

Policy Models (Task Execution)

OpenAI Models

Groq Models

Google/Gemini Models

Mutation Models (GEPA Only)

Meta Models (MIPRO Only)

Model Configuration

Policy Configuration

`[prompt_learning.policy]`

GEPA Mutation Configuration

`[prompt_learning.gepa.mutation]`

System Spec Configuration

`[prompt_learning.gepa]` Spec Parameters

`[prompt_learning.mipro]` Spec Parameters

Multi-Stage Pipeline Configuration

GEPA Multi-Stage

MIPRO Multi-Stage

Complete Example Configurations

Banking77 (GEPA)

Banking77 (MIPRO)

Best Practices

GEPA Best Practices

MIPRO Best Practices

General Best Practices