A user asks whether running GLM-5.2 on four Ascend GX10 chips (DGX Sparks) is feasible. They inquire about 4-bit quantization using 512GB unified memory and estimate prompt and output token speeds for 100k context length, noting no existing performance data is available online.