Benchmark · math

MATH-500

saturated 3 results 2 models
0 5 10 15 20 2026-06-18 2026-06-20 2026-06-23 rubric-conditioned self-distillation · 0.9 · 2026-06-18 rubric-conditioned self-distillation · 0.9 · 2026-06-18 selfcompact · 18.1 · 2026-06-23
rubric-conditioned self-distillation selfcompact
Timeline
  1. 2026-06-23 selfcompact 18.1pts SelfCompact: Self-Driving Context Compaction for Language Models
  2. 2026-06-18 rubric-conditioned self-distillation 0.9pts Rubric-Conditioned Self-Distillation Framework
  3. 2026-06-18 rubric-conditioned self-distillation 0.9pts Rubric-Conditioned Self-Distillation Framework