Qwen3.6-27B with 3-Critic Harness Matches Frontier Quality

A user tested Qwen3.6-27B (8-bit) alongside GLM5.2 using a coding harness that employs three critics—code review, test review, and Playwright e2e—to validate output quality.

The 3-critic pipeline effectively catches the additional mistakes inherent in the dense model, allowing it to match the final output quality of frontier models.
While the execution path is noisier than with larger models, the harness manages retry overhead without disrupting workflow.
The optimal strategy identified is using a frontier model like GLM5.2 for planning and Qwen3.6 for high-volume implementation where the error-catching mechanism compensates for lower raw accuracy.