A Reddit user demonstrates running the Qwen3.6-27B model quantized to Q3 with KV at Q8 on an AMD Mi50 32GB GPU, achieving approximately 180+ tokens per second for prompt processing and 9 tokens per second for text generation.
- Hardware setup includes a T5610 with 64 GB DDR3 RAM and a 256 GB SATA SSD.
- The user utilizes the model to create proof of concepts for a custom SaaS accounting application tailored to the construction industry.
- A GitHub repository named exaMath is shared, allowing users to run the setup via Docker after configuring environment variables.
The author shares this configuration as an open-source resource to help other contractors and developers who lack access to expensive enterprise software or high-end hardware.