LLaMA.cpp version b9722 fixes a non-bound n_discard value issue in server context handling. The release includes precompiled binaries for macOS, Linux, Android, Windows, and openEuler, supporting various architectures and acceleration frameworks like Vulkan, CUDA, OpenVINO, and SYCL.
LLaMA.cpp Release b9722: Fixes and Cross-Platform Binaries
from English