All articles
media r/LocalLLaMA · 12d ago

Adding a Second GPU to X670E Motherboard for Local LLMs

A user wants to add a second 16GB VRAM GPU (5060 Ti or 5070 Ti) to their MSI X670E Tomahawk WiFi motherboard for running large local LLMs like Qwen 3.6 27B. The current setup lacks space for a second GPU due to the primary 5070 Ti occupying the second PCIe slot, leaving only the third slot partially available. The user seeks advice on feasible options—such as using the fourth PCIe slot or a riser—while considering cooling, stability, and physical fit, especially with a horizontal GPU mount like the Lian Li VG4v4.

github llama.cpp · 12d ago

llama.cpp Release b9724 with Bug Fixes and Cross-Platform Binaries

llama.cpp version b9724 includes several bug fixes and improvements, such as build fixes, overflow avoidance in area() function, and a sanity check in get_u32(). The release provides pre-built binaries for macOS (arm64 and x64), Linux (x64, arm64, s390x, Vulkan, ROCm, OpenVINO, SYCL), Android (arm64), Windows (x64, arm64, CUDA 12/13, Vulkan, OpenVINO, SYCL, HIP), and openEuler (x86 and aarch64 with ACL Graph support), along with a UI package.