github llama.cpp · 6 д назад · inference

llama.cpp Release b9716 Adds Batching Support for InternVL

Переведено с English → Русский

llama.cpp version b9716 introduces batching support for InternVL, enhancing model performance through efficient batch processing. The release includes binary builds for macOS, Linux, Android, Windows, and openEuler across multiple architectures and hardware acceleration options, including Vulkan, OpenVINO, SYCL, and ROCm.

Важность 0/3 Доверие 2/3 llama.cpp

Оригинал