llama.cpp b9858 release with HF model path fix

The llama.cpp project has released version b9858, which includes a change to use the Hugging Face primary split as the model path. This update resolves issue #25181 regarding model loading paths.

macOS Apple Silicon (arm64) and Intel (x64) binaries are available.
iOS XCFramework is included for mobile deployment.
Linux builds support Ubuntu x64, arm64, s390x, Vulkan, ROCm 7.2, OpenVINO, and SYCL FP32/FP16.
Android arm64 (CPU) binaries are provided.
Windows builds cover CPU, OpenCL Adreno, CUDA 12.4/13.3, Vulkan, OpenVINO, SYCL, and HIP.
openEuler builds for x86 and aarch64 with ACL Graph support are listed, though some configurations are disabled.

This release provides updated binaries for a wide range of hardware architectures and operating systems, ensuring compatibility with the latest model loading standards.