A Reddit user inquires whether it is feasible to run a reasonable quantization of the Qwen 3.6 35B A3B model, or potentially Ornith/3.5, on a laptop equipped with 32GB of LPDDR5 RAM and an RTX 5060 GPU.

  • The user notes the laptop has 32GB of LPDDR5 memory running at 7500 MT/s, offering higher bandwidth than standard DDR5.
  • The hardware includes an RTX 5060 laptop GPU, which is described as similar to its desktop counterpart.
  • The user acknowledges that while the 35B A3B architecture is capable for its size, they anticipate potential context or KV cache limitations during use.