The Gemma4-12B-QAT Uncensored Balanced model is now available, featuring a 60% speed improvement through multi-token-prediction (MTP) speculative decoding. It includes Q4_K_M quantization, vision support via mmproj, and stable generation with no looping or context drift, making it ideal for creative writing and emotional intelligence tasks.
Gemma4-12B-QAT Uncensored Balanced Released with 60% Speed Boost via MTP
from English