全部文章 — korshunov.ai

全部文章页 1 / 14

llama.cpp 发布 b9690，包含 rope_back 算子和跨平台二进制文件

llama.cpp 版本 b9690 引入了一个 rope_back 算子，该算子通过重用现有的 rope 内核并使用函数常量来切换前向/后向旋转来实现。此次发布包含了适用于 macOS、Linux、Android、Windows 和 openEuler 的预构建二进制文件，支持多种架构和硬件加速选项，包括 Vulkan、CUDA、ROCm、OpenVINO 和 SYCL。

github llama.cpp · 14 天前

llama.cpp Release b9687 Adds New Binaries and Fixes

llama.cpp version b9687 introduces new binaries for macOS, Linux, Android, Windows, and openEuler across multiple architectures. The release includes support for Vulkan, ROCm, OpenVINO, SYCL, and HIP, with updates to improve device validation and performance on available hardware.

github llama.cpp · 14 天前

llama.cpp 发布版本 b9688，带来新 API 和跨平台二进制文件

llama.cpp 发布版本 b9688，新增模型管理和 SSE 实时更新 API。该版本包含适用于 macOS、Linux、Android、Windows 和 openEuler 的预编译二进制文件，支持各种架构以及 Vulkan、CUDA、OpenVINO 和 SYCL 等加速框架。

github llama.cpp · 14 天前

LLaMA.cpp Release b9685 Adds SYCL Dev2Dev Memcpy and Multiple Platform Binaries

LLaMA.cpp version b9685 introduces SYCL-based dev2dev memcpy functionality, moving GGML_SYCL_DEV2DEV_MEMCPY to runtime table and improving peer-to-peer communication detection. The release includes precompiled binaries for macOS, Linux, Android, Windows, and openEuler across multiple architectures and APIs including Vulkan, ROCm, OpenVINO, and SYCL (FP32/FP16).

github llama.cpp · 14 天前

llama.cpp 发布 b9686：修复 Eagle3 长提示导致的段错误

llama.cpp 版本 b9686 修复了在使用 Eagle3 模型处理长提示时发生的段错误。该版本包含适用于 macOS、Linux、Android、Windows 和 openEuler 的二进制文件，支持多种架构和硬件加速选项，包括 Vulkan、CUDA、OpenVINO 和 SYCL。

github llama.cpp · 14 天前

LLaMA.cpp Release b9684 Adds Conv_3D and Multiple Platform Binaries

LLaMA.cpp release b9684 introduces a new 3D convolution operation (conv_3d) and includes optimized implementations. The release provides prebuilt binaries for macOS, Linux, Android, Windows, and openEuler across various architectures and hardware acceleration options, including SYCL, Vulkan, CUDA, and OpenVINO.

github llama.cpp · 15 天前

llama.cpp 发布 b9690，包含 rope_back 算子和跨平台二进制文件

llama.cpp Release b9687 Adds New Binaries and Fixes

llama.cpp 发布版本 b9688，带来新 API 和跨平台二进制文件

LLaMA.cpp Release b9685 Adds SYCL Dev2Dev Memcpy and Multiple Platform Binaries

llama.cpp 发布 b9686：修复 Eagle3 长提示导致的段错误

LLaMA.cpp Release b9684 Adds Conv_3D and Multiple Platform Binaries

llama.cpp b9682 版本发布，新增 Vulkan 支持和多平台二进制文件

LLaMA.cpp b9678 版本发布，新增优化与跨平台构建

llama.cpp 发布 b9677：更新与跨平台二进制文件

LLaMA.cpp 发布 b9674：修复异步 memcpy 错误并添加新二进制文件

llama.cpp b9675 版本发布，新增 FP16 支持和多平台二进制文件

llama.cpp 发布 b9680：新二进制文件和 Vulkan 支持

llama.cpp 发布 b9673，支持 USM 系统分配和跨平台二进制文件

v2.1.179 版本说明

llama.cpp 发布 b9660，包含修复和新二进制文件

langgraph-cli 0.4.30 发布

Claude v2.1.178 发布说明

llama.cpp 发布版本 b9672，更新 BoringSSL

v1.38.0 的发布分支已创建

llama.cpp 发布 b96669，为 Eagle3 添加后端采样