Rubrics

Rubrics define how traces are scored by hosted verifiers. Each rubric contains criteria with weights that determine the final reward.

Structure

from synth_ai.data.rubrics import Rubric, Criterion

rubric = Rubric(
    version="1.0",
    goal_text="Classify banking intents accurately",
    criteria=[
        Criterion(id="accuracy", description="Correct intent label", weight=1.0, required=True),
        Criterion(id="confidence", description="High confidence score", weight=0.5),
    ],
    aggregation="weighted_sum",
)

Field	Type	Description
`version`	`str`	Rubric version identifier
`goal_text`	`str`	Human-readable goal description
`criteria`	`list[Criterion]`	Scoring criteria
`aggregation`	`str`	How to combine scores: `sum`, `weighted_sum`, `custom`, `inherit`

Criterion

Field	Type	Description
`id`	`str`	Unique criterion identifier
`description`	`str`	What the verifier evaluates
`weight`	`float`	Score multiplier (default: 1.0)
`required`	`bool`	If true, failure on this criterion fails the whole rubric

Using with Task Apps

Pass rubrics to your task app config:

from synth_ai.sdk import TaskAppConfig
from synth_ai.data.rubrics import Rubric, Criterion

outcome_rubric = Rubric(
    version="1.0",
    goal_text="Classify banking intents accurately",
    criteria=[
        Criterion(id="accuracy", description="Correct intent label", weight=1.0, required=True),
    ],
    aggregation="weighted_sum",
)

config = TaskAppConfig(
    name="banking77",
    # ... other config
)

The task app’s /info endpoint exposes these rubrics to the backend.

Rubric Sources

Hosted verifiers can fetch rubrics from:

Task app /info endpoint — Rubrics bundled in TaskAppConfig
Task app /task_info?seed=N — Seed-specific rubrics
Backend synth_verifier_id — Pre-registered rubrics or VerifierGraphs in Synth AI’s backend

Configure in your TOML:

[verifier]
enabled = true
reward_source = "verifier"
synth_verifier_id = "banking77-v1"  # Use backend rubric or graph

JSON Example

{
  "version": "1.0",
  "goal_text": "Classify customer banking queries into correct intent categories",
  "criteria": [
    {
      "id": "correct_intent",
      "description": "The predicted intent matches the ground truth label",
      "weight": 1.0,
      "required": true
    },
    {
      "id": "reasoning_quality",
      "description": "Clear reasoning before the final answer",
      "weight": 0.3,
      "required": false
    }
  ],
  "aggregation": "weighted_sum"
}

Rewards — How rubric scores become rewards
Verifier Schemas — LLM-based trace evaluation

Getting started

Algorithms

LocalAPI

Tunnel/Deploy

Datasets & Verifiers

Structure

Criterion

Using with Task Apps

Rubric Sources

JSON Example

Getting started

Algorithms

LocalAPI

Tunnel/Deploy

Datasets & Verifiers

​Structure

​Criterion

​Using with Task Apps

​Rubric Sources

​JSON Example

​Related

Structure

Criterion

Using with Task Apps

Rubric Sources

JSON Example

Related