All articles — korshunov.ai

All articles Page 1 / 130

Claude Code v2.1.198 Release Notes

The v2.1.198 update for Claude Code introduces general availability for Claude in Chrome and enhances background agent capabilities with new notification hooks and automated pull request workflows.

lab Google — The Keyword (AI) · 2h ago

NYC educators and industry leaders gather at Google to shape AI in classrooms

Google, the New York Jobs CEO Council, and Urban Assembly hosted an AI summit for 150 education and industry leaders at Google's offices. The event brought together stakeholders to discuss and shape the future of artificial intelligence integration within classroom environments.

lab Google — The Keyword (AI) · 2h ago

Google's Latest AI News Announced in June 2026

This article presents a recap of the artificial intelligence updates and announcements released by Google during June 2026.

github llama.cpp · 3h ago

llama.cpp b9859 release adds OpenCL precompiled kernel support

The llama.cpp b9859 release introduces the ability to load precompiled binary kernels from libraries for OpenCL, specifically targeting Adreno GPUs. This update also provides binaries for macOS, Linux, Windows, Android, and openEuler across CPU, GPU, and various accelerator backends.

lab xAI News · 4h ago

xAI Launches No-Code Voice Agent Builder for Grok Voice

xAI has announced the beta release of Voice Agent Builder, a no-code platform designed to configure production-grade voice agents on Grok Voice in under two minutes. This tool allows operators and developers to deploy high-volume voice agents without building the underlying telephony or AI stack from scratch.

github llama.cpp · 5h ago

llama.cpp b9858 release with HF model path fix

The llama.cpp project has released version b9858, which includes a change to use the Hugging Face primary split as the model path. This update resolves issue #25181 regarding model loading paths.

github llama.cpp · 7h ago

llama.cpp b9857 release: Flash Attention rework and new binaries

The llama.cpp b9857 release introduces a comprehensive rework of the Hexagon Flash Attention implementation, focusing on optimizations and accuracy improvements. This update includes significant changes to the hex-mm and hex-fa modules, such as folding quant tasks into main matmul threads, fusing with ADD operations, and optimizing mask processing.

github llama.cpp · 11h ago

llama.cpp b9855 release adds AVX2 nvfp4 optimization and new binaries

The llama.cpp project has released version b9855, which introduces an AVX2 optimization for the nvfp4 dot product using a UE4M3 Look-Up Table (LUT) within the ggml-cpu backend.

github llama.cpp · 11h ago

llama.cpp b9856 release with CUDA restrict + PDL for FA

The llama.cpp project has released version b9856, introducing consistent use of the `restrict` keyword and PDL for Flash Attention in CUDA. This update is accompanied by pre-built binaries for macOS, Linux, Android, Windows, and openEuler across various hardware backends.

github llama.cpp · 15h ago

Remove PWA navigate fallback to prevent caching API endpoint requests

The update removes the Progressive Web App (PWA) navigate fallback mechanism. This change is implemented specifically to prevent the unintended caching of API endpoint requests.

github llama.cpp · 15h ago

llama.cpp b9852 release adds OpenCL q1_0 support

The llama.cpp project has released version b9852, introducing initial OpenCL support for the q1_0 quantization format. This update includes general q1_0 capabilities and specific Adreno GEMM/GEMV implementations for OpenCL devices.

lab Anthropic News · 20h ago

Anthropic Redeploys Fable 5 Following US Export Controls

Anthropic is restoring global access to its Claude Fable 5 and Mythos 5 models after the US government lifted export controls that had suspended availability for all users. Fable 5 will be available globally starting July 1 on the Claude Platform, with usage limits applying through July 7 before switching to credit-based access.

github llama.cpp · 20h ago

llama.cpp b9851 release fixes CUDA integer truncation and provides binaries

The llama.cpp project has released version b9851, which includes a fix for CUDA to prevent integer truncation and overflow errors in the flash_attn_mask_to_KV_max kernel. This update addresses issues related to KQ mask strides within the specified kernel.

github llama.cpp · 20h ago

llama.cpp b9850 release: Qwen3 fixes and new binaries

The llama.cpp b9850 release introduces specific model support updates, including registering the t_layer_inp tensor for Qwen3Next, fixing input assignment in the layer processing loop, and addressing DFLASH issues for qwen-coder-next. It also adds a tensor for attention normalization in the Qwen3 model.

github MCP (GitHub org) · 22h ago

MCP Python SDK v2.0.0b1 Released with Full 2026 Spec Support

The Model Context Protocol (MCP) Python SDK has released its first beta version, v2.0.0b1, which introduces full support for the 2026-07-28 MCP specification. This pre-release is opt-in only, ensuring that standard installations continue to resolve to the stable 1.x line.

lab Microsoft Research Blog · 1d ago

SkillOpt: Agent skills as trainable parameters

Microsoft Research introduces SkillOpt, a method that treats agent skill files as trainable parameters outside a frozen target model, transforming manual skill editing into a controlled optimization process. This approach improves agent reliability and consistency without updating the underlying model weights.

lab Anthropic News · 1d ago

Claude Science, an AI workbench for scientists, is now available

Anthropic has launched Claude Science in beta, an AI workbench designed to integrate fragmented scientific tools into a single research environment. The platform aims to accelerate discovery by providing auditable artifacts, flexible compute scaling, and specialized agents for domains like genomics and structural biology.

lab Anthropic News · 1d ago

Introducing Claude Sonnet 5

Anthropic has released Claude Sonnet 5, a new agentic AI model designed to perform complex planning, tool use, and autonomous coding tasks at a lower cost than previous Opus-class models. The update narrows the performance gap with Opus 4.8 while offering significant improvements in reasoning, safety, and execution over its predecessor, Sonnet 4.6.

lab Claude Code Releases · 1d ago

Claude Code v2.1.197 introduces Claude Sonnet 5

Anthropic has released version 2.1.197 of Claude Code, which updates the default model to Claude Sonnet 5. This new model features a native 1M-token context window and is available with promotional pricing through August 31.

lab OpenAI News · 1d ago

Inside Genebench-Pro: 10 Case Studies of Complex Genomic Reasoning

GeneBench-Pro is a benchmark designed to evaluate models on complex genomic reasoning tasks, featuring ten detailed case studies that showcase representative questions and supporting materials. Each case study provides the original prompt, datasets, and context necessary to assess model performance on specific biological challenges.