Home / Competitions / Pareto Router Challenge

Pareto Router Challenge

Build an inference router that improves the cost / latency Pareto on a held-out tool-use workload. Submissions are auto-graded; the leaderboard updates every 15 minutes.

Prize
$25,000 · 60/25/15 across top-3
Status
Open
Deadline
31 Aug 2026 · 23:59 UTC
Entries
184 across 37 countries

The task

You have a workload of 20,000 tool-use queries, each with a known difficulty distribution. You have a fleet of six models at different price + latency points. Write a router that assigns each query to a model — within a budget — and maximises accuracy on a held-out grader.

What we score

  • Pareto AUC across five (cost, latency) budgets — higher is better
  • Held-out grader: 4,000 queries you never see, drawn from the same distribution
  • Tie-break: lowest p95 wall-clock on the same hardware

Rules

  • Submit code in a sandboxed Docker image — no internet at grading time
  • One active submission per handle; switching counts as a new submission
  • Subsequent submissions allowed every 30 minutes
  • Winners must open-source their final submission under MIT or Apache-2 within 30 days of close
  • Anyone in the world is eligible. Working scientists at frontier labs may compete but are ineligible for prize money.

Starter pack

The starter notebook ships the workload schema, a baseline random router, a baseline cost-only router, and the grading harness so you can validate locally before submitting.

Live · auto-graded Open

$25,000 prize pool

Top-3 splits: $15,000 / $6,250 / $3,750. 184 entries so far. Closes 31 Aug 2026.

$25,000· 31 Aug 2026

Updates every 15 min

Leaderboard · top 10

Δ 24h is the change in Pareto AUC over the last 24 hours. Subs = accepted submissions. Late entries can still move up — the board doesn't lock until 31 Aug.

#HandleScoreΔ 24hSubs
1FR minou0.8712+0.003414
2LK dineth.k0.8688+0.001222
3GR sofia_k0.8654+0.00089
4PL tomek_w0.8631-0.002117
5SE henrik.l0.8612+0.00446
6CA priya.a0.8597+0.001712
7JP kenji0.858411
8CA lea.b0.8571+0.00068
9NG adaeze.o0.8559+0.00244
10HK wai.lin0.8541+0.001113