LLaMA.cpp version b9722 fixes a non-bound n_discard value issue in server context handling. The release includes precompiled binaries for macOS, Linux, Android, Windows, and openEuler, supporting various architectures and acceleration frameworks like Vulkan, CUDA, OpenVINO, and SYCL.