Home / Competitions / Pareto Router Challenge
Pareto Router Challenge
Build an inference router that improves the cost / latency Pareto on a held-out tool-use workload. Submissions are auto-graded; the leaderboard updates every 15 minutes.
- Prize
- $25,000 · 60/25/15 across top-3
- Status
- Open
- Deadline
- 31 Aug 2026 · 23:59 UTC
- Entries
- 184 across 37 countries
The task
You have a workload of 20,000 tool-use queries, each with a known difficulty distribution. You have a fleet of six models at different price + latency points. Write a router that assigns each query to a model — within a budget — and maximises accuracy on a held-out grader.
What we score
- Pareto AUC across five (cost, latency) budgets — higher is better
- Held-out grader: 4,000 queries you never see, drawn from the same distribution
- Tie-break: lowest p95 wall-clock on the same hardware
Rules
- Submit code in a sandboxed Docker image — no internet at grading time
- One active submission per handle; switching counts as a new submission
- Subsequent submissions allowed every 30 minutes
- Winners must open-source their final submission under MIT or Apache-2 within 30 days of close
- Anyone in the world is eligible. Working scientists at frontier labs may compete but are ineligible for prize money.
Starter pack
The starter notebook ships the workload schema, a baseline random router, a baseline cost-only router, and the grading harness so you can validate locally before submitting.
$25,000 prize pool
Top-3 splits: $15,000 / $6,250 / $3,750. 184 entries so far. Closes 31 Aug 2026.
Updates every 15 min
Leaderboard · top 10
Δ 24h is the change in Pareto AUC over the last 24 hours. Subs = accepted submissions. Late entries can still move up — the board doesn't lock until 31 Aug.
| # | Handle | Score | Δ 24h | Subs |
|---|---|---|---|---|
| 1 | FR minou | 0.8712 | +0.0034 | 14 |
| 2 | LK dineth.k | 0.8688 | +0.0012 | 22 |
| 3 | GR sofia_k | 0.8654 | +0.0008 | 9 |
| 4 | PL tomek_w | 0.8631 | -0.0021 | 17 |
| 5 | SE henrik.l | 0.8612 | +0.0044 | 6 |
| 6 | CA priya.a | 0.8597 | +0.0017 | 12 |
| 7 | JP kenji | 0.8584 | — | 11 |
| 8 | CA lea.b | 0.8571 | +0.0006 | 8 |
| 9 | NG adaeze.o | 0.8559 | +0.0024 | 4 |
| 10 | HK wai.lin | 0.8541 | +0.0011 | 13 |