Hugging Face Forums — korshunov.ai — ML news

Source · Hugging Face Forums

↑ Back to top of feed

media Hugging Face Forums · 2d ago

Buddy System: Rust entropy monitor with NER-gated uncertainty for tiered LLM inference

The Buddy System uses a Rust entropy monitor to detect per-token uncertainty in local Gemma 3 4B inference, routing only uncertain tokens to Sonnet via NER-gated span extraction and semantic retrieval. Benchmarks show it achieves 71.4% accuracy at $0.21, outperforming the Anthropic Advisor pattern (62.9% at $0.44) across seven Hugging Face datasets, with a key improvement on SQuAD v2 by routing source passage chunks to the cloud model.

media Hugging Face Forums · 3d ago

I built a novel triple-hybrid LLM under 1B parameters for ~$50

Mateusz has developed a full pre-trained language model, Project Inkblot's Titan v1, combining Mamba SSM, Multi-Head Attention, and 32-expert MoE in a single decoder-only architecture under 1B parameters. The model, trained on a single NVIDIA L4 GPU for ~$50, achieves 27.5 validation perplexity and demonstrates efficient scaling via a single-line config update, with all components implemented from scratch in PyTorch. Titan v2's first training cycle is now complete, and dataset expansion is underway.

media Hugging Face Forums · 3d ago

LLMs as Epistemic Accelerators: The Risk Is Not Only Hallucination

LLMs do not merely hallucinate; they amplify human epistemic overconfidence by turning weak hypotheses into coherent, polished claims before evidence is verified. This creates a risk of premature certainty in research, policy, and other domains, not because models lie, but because they accelerate human tendencies to favor elegant explanations over uncertainty.

media Hugging Face Forums · 2d ago

My Hugging Face Account Was Locked

A user reports their Hugging Face account, AntixStudioDesign, was locked unexpectedly during experimentation with AI tools. They have contacted the Safety Team via email and seek advice on account recovery, response time, and data preservation options.

media Hugging Face Forums · 2d ago

AI Music Model Runs in Real Time on Most CPUs in Browser

NanoMaestro Realtime is a 50MB AI music model with 13M parameters that generates piano music in real time using a 2-layer LSTM. It runs locally in the browser via ONNX and Transformers.js with WASM, requiring no GPU or server backend, and works on older Raspberry Pi models.

media Hugging Face Forums · 3d ago

BenchHub Ships Major Update to Open Leaderboard Space

BenchHub has released a major update to its open leaderboard platform, now covering vision, audio, and NLP tasks with consistent metrics and reproducible scoring. The platform features 95 boards, 700+ model submissions, and allows free participation via sign-in with GitHub, Google, or Hugging Face, with full exploration and sample comparisons available at runbenchhub.com.

media Hugging Face Forums · 3d ago

Seeking Indic Document Datasets for AI/OCR Training in India

QuantVectors is seeking annotated document datasets in Indic languages from India, including Hindi, Marathi, Gujarati, Bengali, Punjabi, Tamil, Urdu, Telugu, Odia, Kannada, Malayalam, and Assamese. The datasets must include invoice, receipt, utility bill, payment advice, packing list, commercial invoice, and credit note types, with approximately 400 documents per language, human-verified annotations, and 99%+ accuracy. Datasets must be commercially licensable and can be open-source or commercial, with a request for HuggingFace datasets, research datasets, or vendors specializing in this space.

media Hugging Face Forums · 3d ago

ML Surrogate Models in CFD/FEA: Real-World Practices and Challenges

Engineering practitioners report that graph neural networks and MLPs on parameterized designs offer the best practical balance for predicting fields like temperature and stress. Data efficiency is achievable with 10–50 training samples, especially when transfer learning is applied across similar geometries. Physics-informed neural networks (PINNs) remain largely experimental for complex engineering geometries, with most users relying on data-driven surrogates. Generalization remains a key challenge, with models often failing on out-of-distribution boundary conditions, prompting a return to full solver runs.

media Hugging Face Forums · 3d ago

Capability Is Not in the Weights: Empirical Negative Result on MLP Weight Projection

An empirical study found that projecting MLP weights from one transformer model into another fails to transfer semantic capability. Every tested variant performed worse than the unmodified host model, indicating a structural limitation in weight projection. The results challenge public claims about model capabilities based on benchmarks, showing such claims do not reflect actual internal weight geometry.

media Hugging Face Forums · 3d ago

The Clockwork Dark: A Local-First AI Narrative-RPG Engine

The Clockwork Dark is a local-first, AI-driven narrative-RPG engine that uses a deterministic state machine to resolve all game mechanics. It features two autonomous LLMs that narrate the story, with one acting as a patient world voice and the other as an unreliable, godlike assistant. The game offers players a choice: fight the encroaching supernatural corruption or embrace a quiet life in a bakery, with both paths considered valid endings.

media Hugging Face Forums · 3d ago

NOVA-VAD beats Silero, Pyannote, and WebRTC on noisy audio with 93% accuracy

NOVA-VAD, a lightweight and explainable Voice Activity Detector, achieves 93% accuracy on noisy audio from the UrbanSound8K dataset, outperforming WebRTC (58%), Pyannote (62%), and Silero (87%). It uses only scikit-learn, requires no GPU, and provides feature importance and confidence scores in plain English.

media Hugging Face Forums · 3d ago

Small-scale debug comparison of OLMo-core with Engram graft

A 200-step training comparison between a base OLMo3 600M model and a version with a DeepSeek-style Engram graft shows lower training and evaluation loss, faster grad-norm stabilization, and improved early learning behavior. The Engram graft, injected into layers 1 and 5, increases trainable parameters to ~1.7B but maintains only a 40k increase in active parameters per token, indicating efficient memory usage.

media Hugging Face Forums · 3d ago

Hugging Face Spaces Proxy Strips CORS Credentials Header on OPTIONS Preflight

Users report that Hugging Face Spaces' edge proxy is now stripping the Access-Control-Allow-Credentials header from OPTIONS preflight requests, even when explicitly configured in Express middleware. The header is missing in responses despite code that sets it and includes a wildcard middleware for OPTIONS requests, suggesting the proxy intercepts and modifies the response before it reaches the backend container.

media Hugging Face Forums · 3d ago

Gradio web components failing to load

Gradio web components are failing to load on websites, showing an error about an invalid repo name due to URL-encoded slashes. The issue occurs when loading Hugging Face Spaces via Gradio web components, despite working in the HF space directly.

media Hugging Face Forums · 3d ago

Space flagged as abusive by Cloudflare rule

Hugging Face has paused the Space sainstore-QC/quality-hub, citing a Cloudflare abuse rule. The user reports the flag was issued on 2026-06-17, the Space remains paused, and restart attempts fail despite removing suspected trigger files and pushing a clean commit.

media Hugging Face Forums · 3d ago

The 'Starting' status bug is now fully fixed

Hugging Face has resolved a frontend UI synchronization glitch that caused models to show 'Starting' instead of 'Running'. The issue was purely a latency problem in the interface, requiring no backend changes. Users now see models transition smoothly to 'Running' with the green dot restored.

media Hugging Face Forums · 3d ago

What AI Trend Will Have the Biggest Impact in the Next Five Years?

A discussion on Hugging Face asks which AI development trend will most significantly affect businesses, jobs, or daily life within the next five years. Participants are invited to share their views on emerging AI technologies and their potential real-world impact.

media Hugging Face Forums · 3d ago

Infinitely stuck on 'starting' with Docker container running

A user reports their Docker container with R/Shiny on rocker/r2u successfully builds and shows 'Listening on http://0.0.0.0:7860' in logs, yet the space remains in 'starting' state and is inaccessible. The issue persists despite no code errors, and the user seeks broader attention, noting it may be a platform-side problem with Hugging Face.

media Hugging Face Forums · 3d ago

Tenstorrent AI Accelerator Cards Available

Tenstorrent has released Wormhole and Blackhole AI accelerator cards. The hardware section lists these cards, with discussions on which models are likely compatible.

media Hugging Face Forums · 3d ago

Space stuck 'Restarting' on old commit for 16+ hours

A Hugging Face Space has been stuck showing 'Restarting' on commit 8240352 for over 16 hours, despite multiple newer commits building successfully. The container starts healthily in logs, but traffic never switches to the new version, and recovery actions like factory rebuild or restart have no effect.