LLaMA.cpp version b9674 fixes a use-after-free bug in SYCL's async memcpy during MoE prefill. The release includes binaries for macOS, Linux, Android, Windows, and openEuler, supporting CPU, Vulkan, ROCm, OpenVINO, SYCL, and CUDA across multiple architectures.
LLaMA.cpp Release b9674: Fixes Async memcpy Bug and Adds New Binaries
from English