Banking77 Prompt Optimization

Run the Notebook

1. Install dependencies

The first cell installs synth-ai, datasets, and cloudflared.

2. Configure API keys

The setup cell auto-generates a Synth AI demo API key. Optionally set OPENAI_API_KEY in Colab secrets to enable the baseline vs optimized preview (Step 4).

3. Define the classifier

The notebook loads the Banking77 dataset and defines a classification function using gpt-4o-mini with function calling.

4. Preview baseline vs optimized

Compares a simple baseline prompt (~78% accuracy) against an optimized prompt (~92% accuracy) on 50 test samples.

5. Start the Local API

Launches a FastAPI server with a Cloudflare tunnel so Synth AI can reach the notebook.

6. Run GEPA optimization

Submits a prompt learning job that evolves prompts over 3 generations, evaluating each on 50 samples.

7. Evaluate on held-out data

Runs a formal eval comparing baseline vs optimized prompts on 50 held-out samples (indices 100-149).

8. Cleanup

Terminates the Cloudflare tunnel.

Results

Baseline accuracy: ~78%
Optimized accuracy: ~92%
Improvement: +14%

Ready to get started?

Get Started

Schedule Demo

See Synth in action with a personalized walkthrough.

Walkthroughs

​Run the Notebook

​1. Install dependencies

​2. Configure API keys

​3. Define the classifier

​4. Preview baseline vs optimized

​5. Start the Local API

​6. Run GEPA optimization

​7. Evaluate on held-out data

​8. Cleanup

​Results

​Ready to get started?