The MiniMax M3 EAGLE3 decoder has been converted to GGUF format and is now compatible with llama.cpp. Testing on a 2x3090, 128GB system with UD-Q2_K_XL quant showed performance improved from 2.3 to 5 tokens per second using --fit and keeping the model in VRAM.