media r/LocalLLaMA · 5d ago · open_models

GLM 5.2 Achieves 98% Max Intelligence with Less Than Half Tokens

from English

GLM 5.2 demonstrates 98% of maximum intelligence in coding tasks using less than half of its total token budget, according to a technical report by z_ai. The model's reasoning efficiency has improved significantly, with token usage increasing from 16.7k to 36.7k between GLM 5.1 and GLM 5.2, though high-level settings may strain local hardware performance.

Importance 2/3 r/LocalLLaMA Zhipu AI Code generation Inference efficiency Reasoning models

Benchmarks

Benchmark	Model	Score
SWE-bench Verified	GLM 5.2	98%

Read original