API & product launches
media Hugging Face Forums · 1h ago Live

User Reports Tool and MCP Server Unavailability for Step 3.7 Flash on HuggingChat

A user on the Hugging Face forums reported that the Step 3.7 Flash model lost the ability to use tools and connect to MCP servers starting that morning. The poster expressed strong satisfaction with the model's performance, noting its high quality relative to its low resource consumption and cost. They emphasized a desire to continue using this specific model rather than switching to alternatives due to its efficiency. The user explicitly asked whether this loss of functionality is permanent and if there are any steps they can take to restore access. The post highlights community concern regarding the sudden disruption of tooling capabilities for a popular, cost-effective model.

blog Simon Willison · 3h ago

Simon Willison converts MDN browser compatibility data into a SQLite database

Inspired by Mozilla's new MDN MCP service, Simon Willison has converted the comprehensive mdn/browser-compat-data repository into a SQLite database. The project utilizes a script generated by Claude Code for web (Opus 4.8) to perform this conversion using sqlite-utils. The resulting database is approximately 66MB in size and is hosted on GitHub with open CORS headers to facilitate direct access. To automate the process, a GitHub Actions workflow was built using Codex Desktop (GPT-5.5) to force-push the updated database to an orphan branch named db. Users can download the final browser-compat.db file directly from the repository or explore its contents via Datasette Lite.

media Hugging Face Forums · 4h ago

User Reports Step 3.7 Flash Model Tool Access Failure on HuggingChat

A user on the Hugging Face discussion forum reported that the Step 3.7 Flash model by StepFun AI has lost its ability to use tools, including MCP servers, as of the morning of the report. The individual expressed concern over whether this outage is temporary or permanent, noting their strong preference for this specific model due to its high performance and low resource costs compared to competitors. Despite praising the model's quality and affordability, the user highlighted the immediate disruption caused by the inability to execute tool-based functions. The post seeks clarification from the community regarding prior experiences with similar issues and potential resolutions. This incident underscores a critical dependency on tool availability for users relying on this specific AI configuration.

github LlamaIndex · 5h ago

Llama Index v0.14.23 Release Notes

Llama Index released version 0.14.23 on June 24, 2026, introducing significant multimodal capabilities and various bug fixes. The core update includes multimodal synthesis features and the introduction of multimodal query engines to support diverse data types. Key fixes address document and video block handling within FunctionTool outputs and ensure URL-backed memory blocks are preserved correctly. Performance improvements were implemented by using sets for within-batch deduplication in the ingestion pipeline and optimizing token text splitting logic. The release also resolves a ZeroDivisionError on empty input sequences and fixes recursion errors in splitters when units exceed chunk sizes. Additionally, explicit UTF-8 encoding was added to file I/O operations, and deep copying of initial states prevents mutation leaks across workflow runs.

lab Claude Code Releases · 5h ago

Claude Code v2.1.191 Release Notes

Claude Code version 2.1.191 introduces /rewind support, allowing users to resume conversations from before a /clear command was executed. The update fixes several critical issues, including background agents resurrecting after being stopped and scroll position jumping during streaming responses. It also corrects behavior where /voice displayed generic error messages and where /login URLs were truncated in Windows Terminal. Significant improvements enhance reliability for MCP servers by adding retry logic for transient network errors during capability discovery and OAuth flows. Headless environments now skip browser popups for OAuth, while sandbox network permissions are remembered for the session duration. Performance optimizations reduce CPU usage during streaming by approximately 37% through text update coalescing and mitigate long-session memory growth from the terminal output cache.

media r/LocalLLaMA · 9h ago

I reverse engineered Windows Copilot into a free OpenAI-compatible API

A user has created a local API that replicates OpenAI-compatible GPT-4 functionality using Microsoft's free Copilot service. The tool logs into a Microsoft account once, runs locally on a Windows device, and exposes a server at http://localhost:8000/v1 that supports streaming and multi-turn conversations without requiring an API key or billing. It is designed for personal and educational use, and available via GitHub at https://github.com/sums001/Windows-Copilot-API.