The llama.cpp project has released version b9858, which includes a change to use the Hugging Face primary split as the model path. This update resolves issue #25181 regarding model loading paths.
- macOS Apple Silicon (arm64) and Intel (x64) binaries are available.
- iOS XCFramework is included for mobile deployment.
- Linux builds support Ubuntu x64, arm64, s390x, Vulkan, ROCm 7.2, OpenVINO, and SYCL FP32/FP16.
- Android arm64 (CPU) binaries are provided.
- Windows builds cover CPU, OpenCL Adreno, CUDA 12.4/13.3, Vulkan, OpenVINO, SYCL, and HIP.
- openEuler builds for x86 and aarch64 with ACL Graph support are listed, though some configurations are disabled.
This release provides updated binaries for a wide range of hardware architectures and operating systems, ensuring compatibility with the latest model loading standards.