media r/LocalLLaMA · 7d ago · open_models

Does anyone have enough compute to make a distillation dataset from GLM5.2?

from English

A user asks if anyone with sufficient computing resources can create a large distillation dataset of 70-1 million examples from GLM5.2. The goal is to enable better training of smaller models like Qwen3.5, benefiting the broader community.

Importance 2/3 r/LocalLLaMA Zhipu AI Alibaba (Qwen) Open weights Training data Training methods

Read original