The llama.cpp project has released version b9846, which includes a Vulkan backend optimization for Asahi Linux. This update rolls back the block size loop in matrix multiplication to improve compatibility and performance on Apple Silicon hardware running Linux.
- Vulkan: Rolls back the BK loop in matmul for Asahi Linux via direct SPIR-V editing.
- Vulkan: Reverts previous BK-loop unroll changes and fixes inline comments.
- macOS/iOS: Provides binaries for Apple Silicon (arm64) and Intel (x64), with KleidiAI disabled.
- Linux: Offers builds for Ubuntu x64, arm64, and s390x across CPU, Vulkan, ROCm 7.2, OpenVINO, and SYCL backends.
- Windows: Includes CPU, OpenCL Adreno, CUDA 12/13, Vulkan, OpenVINO, SYCL, and HIP builds for x64 and arm64.
- Android: Releases an arm64 CPU binary.
This release ensures stable Vulkan support on Asahi Linux while providing comprehensive pre-built binaries for a wide range of hardware architectures and acceleration libraries.