A Reddit user demonstrates running the Qwen3.6-27B model quantized to Q3 with KV at Q8 on an AMD Mi50 32GB GPU, achieving approximately 180+ tokens per second for prompt processing and 9 tokens per second for text generation.

  • Hardware setup includes a T5610 with 64 GB DDR3 RAM and a 256 GB SATA SSD.
  • The user utilizes the model to create proof of concepts for a custom SaaS accounting application tailored to the construction industry.
  • A GitHub repository named exaMath is shared, allowing users to run the setup via Docker after configuring environment variables.

The author shares this configuration as an open-source resource to help other contractors and developers who lack access to expensive enterprise software or high-end hardware.