Open weights — korshunov.ai

Open weights Page 1 / 11

Hypergraph-Based Semantic Reasoning Framework

A new framework called HISR uses hypergraphs to model complex multi-entity relationships, improving semantic interpretation accuracy by up to 36.6% over existing methods. It enables robust semantic inference under partial information loss by mapping entities and higher-order relations into dedicated semantic subspaces.

arxiv arXiv cs.AI · 6d ago

Meaning Intelligence Framework for Nigerian Public Discourse

The Meaning Intelligence Framework (MIF) introduces a nine-dimension schema to analyze Nigerian public discourse, addressing context failure in AI systems. A 30-item calibration dataset shows that schema-informed prompting improves register classification accuracy from 33.3% to 73.3% and boosts the composite Meaning Intelligence Score from 73.2 to 78.6.

arxiv arXiv cs.LG · 6d ago

Critical Percolation as a Synthetic Data Model for Interpretability

A new synthetic dataset based on critical mean-field percolation clusters provides a realistic, analytically tractable model with hierarchical structure. It features sparse, fractal clusters with power-law size distributions and latent variables that generate target values via a taxonomic hierarchy. Neural networks can linearly decode these ground-truth latent variables from activations, demonstrating strong interpretability.

arxiv arXiv cs.CL · 6d ago

FineREX: Fine-Tuned NER-RE for Human Smuggling Knowledge Graphs

FineREX is a domain-specific knowledge graph pipeline that uses a fine-tuned LLM for named entity and relationship extraction. It outperforms general-purpose models by 15.50% in entity F1-score and 31.46% in relationship F1-score, reducing legal noise by nearly half and node duplication from 17.78% to 11.-17%. The system also cuts end-to-end processing time by 50.0% by eliminating redundant steps.

arxiv arXiv cs.CL · 6d ago

Semantic Clusters Pre-Train Tsetlin Machine for Interpretability

A new framework pre-trains the Tsetlin Machine using semantic clusters from language models, avoiding embeddings. The method groups text samples into coherent clusters via K-means or Top2Vec, then uses cluster-sample pairs to train a non-negated TM with Type I feedback. Results show superior performance across five datasets, matching BERT-level accuracy while maintaining full interpretability.

arxiv arXiv cs.CL · 6d ago

Credence: Semantic Metrics and Convergence Analysis for Claim Decomposition

Credence introduces Semantic-F1, a BGE-large cosine similarity metric that improves claim decomposition accuracy over Jaccard by 15-32 percentage points. It establishes convergence theorems for rule- and LLM-based repair, showing rule-based repair is finitely terminating and monotone, while LLM-based repair requires early-exit guards. Evaluations across social-media, encyclopaedic, and news domains show EPR from 0.94 to 1.00, with rule-repair reducing atomicity violations by 47-100% without fidelity loss.

arxiv arXiv cs.CL · 6d ago

LLMs Can Process Non-Readable Text with High Semantic Fidelity

Large language models can maintain 99.5% semantic fidelity when processing compact, non-human-readable text forms called BabelTele, even when the text is reduced to 27.9% of its original length. These model-centric representations show strong performance in cross-model transfer, agent memory, and multi-agent communication, suggesting that human readability is not essential for semantic recovery in LLMs.

arxiv arXiv cs.CL · 6d ago

AI-Driven Deliberation: Scaling Inclusivity and Empowering Marginalised Groups

Large Language Models can scale democratic deliberation by scaffolding argumentation and reducing linguistic biases. The chapter uses Systemic-Functional Linguistics to analyze how socio-demographic and communicative variations affect participation, highlighting AI's potential to challenge exclusionary norms while cautioning against over- or under-claiming its capabilities. It calls for ethical safeguards and further research to ensure equitable AI-assisted engagement.

arxiv arXiv cs.CL · 6d ago

Generative Engine Optimization: Measuring AI Search Visibility

A large-scale study of 100K+ AI prompt responses across 100+ brands reveals a three-tier brand visibility ladder: global brands appear in 73% of answers, mid-market in 44%, and niche brands in just 11%. AI engines primarily cite corporate websites, with YouTube leading non-corporate sources, and best-of listicles accounting for 21% of citations. Sentiment in brand mentions is unstable, flipping six times more often than mere mention.

arxiv arXiv cs.CL · 6d ago

STAGE: Source-Grounded Data Generation for Text-to-JSON

STAGE is a pipeline that generates text-to-JSON training data by using LLMs to synthesize reports and JSON schemas, validated against underlying spreadsheets. Evaluations on STAGE-Eval show it improves Qwen3-4B exact match from 31.37% to 74.27% and value accuracy from 45.46% to 90.69%.

arxiv arXiv cs.CL · 6d ago

Essay Quality Representations in LLMs Found to Be Linearly Accessible

A study reveals that essay quality information in large language models is encoded in linearly accessible forms within their hidden representations. These representations emerge layer-by-layer, remain stable across prompts, and show partial transfer across different essay prompts, with longer essays relying more on deeper model layers. The research identifies specific "essay scoring neurons" whose activation strongly correlates with scores and can be influenced by targeted interventions.

arxiv arXiv cs.CL · 6d ago

Meaning Intelligence Framework for Nigerian Public Discourse

The Meaning Intelligence Framework (MIF) introduces a nine-dimensional schema to analyze Nigerian public discourse, addressing context failure in AI systems. A 30-item calibration dataset shows that schema-informed prompting improves register classification accuracy from 33.3% to 73.3% and boosts the composite Meaning Intelligence Score from 73.2 to 78.6.

media r/LocalLLaMA · 6d ago

Repurposing an Old Multi-GPU Node for Local Inference

The node features 8 NVIDIA Quadro RTX 6000 GPUs with 192 GB VRAM and 512 GB RAM, enabling large-scale local AI model inference. Models like LLaMA-3 or Mistral with 8-13 billion parameters could run efficiently here, offering faster, private, and low-latency performance compared to single-GPU setups, making it worthwhile for internal use.

media r/LocalLLaMA · 6d ago

Local Qwen isn't a worse Opus, it's a different tool

The article argues that Local Qwen is not inferior to Opus, but rather serves a different purpose. It emphasizes that each model is designed for specific use cases, and comparing them directly overlooks their distinct capabilities and intended applications.

media r/LocalLLaMA · 6d ago

North Mini Code: 4-bit quant, Ollama, and OpenRouter support

Cohere Labs has released a 4-bit quantized version of North Mini Code on Hugging Face, reducing its size to approximately 20GB for local execution on devices like Macs. The model is now supported in Ollama, local runtimes based on llama.cpp, and via the OpenRouter API, improving accessibility for developers.

github llama.cpp · 7d ago

llama.cpp Release b9703: Updates and Binary Downloads

llama.cpp version b9703 includes a rework of the server's preset handling, removing remote HF preset support and deprecated functions. The release provides binaries for macOS, Linux, Android, Windows, and openEuler across multiple architectures and hardware acceleration options, including Vulkan, CUDA, OpenVINO, and SYCL.

github llama.cpp · 7d ago

llama.cpp release b9704: fixes invalid grammar handling and adds new binaries

llama.cpp version b9704 now returns HTTP 400 for invalid grammar instead of silently dropping constraints. The release includes binaries for macOS, Linux, Android, Windows, and openEuler across multiple architectures and hardware accelerators, with support for Vulkan, ROCm, OpenVINO, SYCL, and CUDA.

media r/LocalLLaMA · 7d ago

mistral.rs v0.8.10 adds /v1/skills support for local models

mistral.rs v0.8.10 introduces OpenAI-compatible Agent Skills via a /v1/skills endpoint, enabling local models to execute domain-specific instructions and scripts without relying on frontier APIs. The update supports tools like file uploads and downloads via /v1/files and includes prebuilt binaries for Linux, macOS, and Windows.

media r/LocalLLaMA · 7d ago

GLM-5.2 Inference Free on Hugging Face for Next 6 Hours

Hugging Face is offering free inference access for GLM-5.2 for the next six hours. Users can access the model via the Hugging Face platform, with a recommended prompt provided in the post.

media r/LocalLLaMA · 7d ago

unsloth GLM-5.2-GGUF with 2bit quantization at 238GB

The unsloth GLM-5.2-GGUF model is available with 2bit quantization, sized at 238GB. It is hosted on Hugging Face and shared via a Reddit post in the LocalLLaMA community.