media r/LocalLLaMA · 5d ago · open_models

GLM-5.2 can now run locally in llama.cpp and Unsloth Studio

from English

GLM-5.2, the strongest open model to date, can now run locally using llama.cpp and Unsloth Studio. The 2-bit quantized model retains ~82% accuracy after reducing size from 1.51TB to 238GB, a 84% reduction, and is compatible with 256GB RAM or VRAM setups.

Importance 2/3 r/LocalLLaMA Zhipu AI Inference efficiency Open weights

Read original