Benchmark · math
GSM8K
- 2026-06-19 LLMs 98.0% GEMS: Geometric Constraints Enable Multi-Semantic Superposition in LLMs
- 2026-06-18 rubric-conditioned self-distillation 1.0pts Rubric-Conditioned Self-Distillation Framework
- 2026-06-18 rubric-conditioned self-distillation 1.0pts Rubric-Conditioned Self-Distillation Framework
- 2026-06-18 SC-GRPO 8.1% Self-Conditioned Credit Assignment for RL with Verifiable Rewards
- 2026-06-18 DAPO 5.9% Self-Conditioned Credit Assignment for RL with Verifiable Rewards