The llama.cpp project has released version b9838, providing pre-built binaries for a wide range of operating systems and hardware accelerators. This release includes support for CPU, GPU (CUDA, Vulkan, ROCm, OpenCL), and specialized AI accelerators across macOS, Linux, Windows, Android, and openEuler.
- macOS builds available for Apple Silicon (arm64) and Intel (x64), with KleidiAI support disabled.
- Linux binaries provided for Ubuntu x64, arm64, and s390x architectures, including CPU, Vulkan, ROCm 7.2, OpenVINO, and SYCL variants.
- Windows releases include CPU, CUDA 12/13, Vulkan, OpenCL Adreno, OpenVINO, SYCL, and HIP builds for x64 and arm64.
- Android support is limited to arm64 CPU builds.
- openEuler builds are available for x86 and aarch64 architectures using ACL Graph, with standard openEuler support disabled.
The release enables users to deploy llama.cpp on diverse hardware configurations without compiling from source.