LLaMA.cpp release b9678 includes optimization of mul_mat_f16_f32_l4 for decode and introduces new builds for macOS, Linux, Android, Windows, and openEuler. The release offers CPU, Vulkan, ROCm, OpenVINO, SYCL, and HIP support across multiple architectures, with a dedicated UI package available.
LLaMA.cpp Release b9678 Adds Optimizations and Cross-Platform Builds
from English