CrewAI 1.15.1a1 Release Notes
The CrewAI 1.15.1a1 update introduces new telemetry tracking, enforces explicit project definitions for CrewAI, and improves the CLI deployment workflow.
The CrewAI 1.15.1a1 update introduces new telemetry tracking, enforces explicit project definitions for CrewAI, and improves the CLI deployment workflow.
The v0.24.0 release includes a continuous integration update to raise the GSM8K startup timeout for MoE Refactor Qwen3 NVFP4 configurations.
OpenAI has initiated a limited preview of the GPT-5.6 series, introducing three new models: Sol as the flagship, Terra for balanced everyday work, and Luna for fast, affordable tasks. The company plans to make these models generally available in the coming weeks following this initial phase with trusted partners.
The llama.cpp project has released version b9821, which introduces command-line interface updates allowing users to invoke --version, --licenses, and --help flags. This release provides a comprehensive set of pre-built binaries for macOS, Linux, Android, Windows, and openEuler across various hardware accelerators.
A job posting seeks an experienced NLP or LLM engineer to develop the first Retrieval-Augmented Generation (RAG) localization engine for a low-resource language spoken in South America. The project utilizes a proprietary corpus of pedagogical content and dictionary data developed over four years.
Claude Code version 2.1.195 introduces several fixes and improvements, including new environment variables for mouse control in fullscreen mode and corrections to hook matcher logic.
The llama.cpp b9820 release introduces performance improvements by reintroducing less synchronizations during split compute, specifically targeting CUDA backends. This update also provides pre-built binaries for macOS, Linux, Windows, Android, and openEuler across CPU, GPU, and specialized hardware accelerators.
The llama.cpp project has released version b9816, which includes a synchronization with the ggml library. This update provides pre-built binaries for macOS, iOS, Linux, Windows, Android, and openEuler platforms.
The llama.cpp b9817 release updates the OpenVINO backend to version 2026.2.1 and makes its release packages self-contained. This update includes several operator improvements within the OpenVINO backend, such as removing hardcoded compute_op_type sets and enabling softmax with sink input.
The llama.cpp b9813 release introduces Vulkan support for Intel Xe-LPG Plus hardware by adding the INTEL_XE1 architecture enum and enabling coopmat1. This update addresses previous code comments, renames the architecture identifier, and includes a Windows driver check.
The llama.cpp project has released version b9814, which includes an optimization for the `mul_mat_vecq` operation in Vulkan specifically targeting the AMD mi50 GPU. This update is accompanied by a comprehensive set of pre-built binaries across multiple operating systems and hardware architectures.
This Hugging Face discussion thread addresses the methods and considerations for testing Large Language Models to ensure they are suitable for real-world applications.
A user on the Hugging Face forum reports that their arXiv paper, "Agent-as-a-Router: Agentic Model Routing for Coding Tasks," was successfully indexed and claimed but never appeared on the Daily Papers homepage. Despite receiving community upvotes and linking a corresponding dataset, the paper has not been featured after several days.
The Model Context Protocol (MCP) Python SDK has released its third alpha version, v2.0.0a3, introducing significant protocol and architectural changes while maintaining backward compatibility for stable 1.x users.
The llama.cpp project has released version b9811, which includes a fix for a compiler bug affecting the conv2d coopmat2 path in Vulkan. This workaround is also applied to the CONV_3D implementation based on suggestions from NVIDIA engineer Jeff Bolz.
The llama.cpp project has released version b9810, introducing a CUDA mapping for `cublasSgemmBatched` in HIP/MUSA vendor headers. This update is accompanied by a comprehensive set of pre-built binaries for macOS, Linux, Windows, Android, and openEuler platforms.
The Model Context Protocol Python SDK has released version 1.28.1, introducing updates to stream handling and transport security.
Pendo is hiring onsite Staff and Senior AI Engineers in New York City to work on Novus, a production-grade product agent designed to autonomously read live codebases and detect real user pain.
This article presents a tutorial on using eBPF with Go to achieve kernel-level observability, addressing the lack of visibility when debugging production issues in AI-generated services.
The llama.cpp b9804 release introduces a fix for the Mamba2 architecture by removing a hardcoded 2x expansion factor and an invalid parameter check, allowing support for any expand value. This change updates the `convert_hf_to_gguf.py` script to make the expand parameter optional with a default of 2.