llama.cpp version b9716 introduces batching support for InternVL, enhancing model performance through efficient batch processing. The release includes binary builds for macOS, Linux, Android, Windows, and openEuler across multiple architectures and hardware acceleration options, including Vulkan, OpenVINO, SYCL, and ROCm.
llama.cpp Release b9716 Adds Batching Support for InternVL
Переведено с English → Русский