llama.cpp releases version b9752 with a server refactor focusing on batch construction, including improved handling of batch full cases and bug fixes. The release includes prebuilt binaries for macOS, Linux, Android, Windows, and openEuler, supporting various architectures and acceleration frameworks like CUDA, Vulkan, OpenVINO, and SYCL.