Open weights — korshunov.ai

Open weights Page 1 / 11

GLM-5.2 Now Available on HuggingChat

The GLM-5.2 model is now accessible on HuggingChat. Users can access it via the HuggingFace link provided, enabling direct interaction with the model through the platform.

media r/LocalLLaMA · 9d ago

Mistral Announces New Family of Open-Weight Models in July

Mistral has released a new family of open-weight language models in July. The models are designed to be accessible and usable by developers and researchers worldwide, promoting transparency and innovation in AI.

media r/LocalLLaMA · 9d ago

zai-org Releases GLM-5.2

zai-org has released GLM-5.2, a new large language model. The model is available on Hugging Face and is part of the LocalLLaMA community discussions.

media r/LocalLLaMA · 9d ago

bartowski/command-a-plus-05-2026-GGUF on Hugging Face

A GGUF model named command-a-plus-05-2026 is available on Hugging Face. Users are encouraged to test it with the latest version of llama.cpp and share performance benchmarks and feedback.

media r/LocalLLaMA · 9d ago

The Case For Open-Weight Models And Why We Can't Trust Frontier Labs

The article argues for open-weight language models, emphasizing transparency and accessibility. It expresses skepticism toward Frontier Labs, suggesting concerns about their model development and openness.

media r/LocalLLaMA · 9d ago

Qwen3.6 27B Quantization Performance Test Results

A test comparing Q8 and IQ3 XXS turbo4 quantized versions of Qwen3.6 27B shows that Q8 excels in API safety and input sanitization, while IQ3 XXS turbo4 performs better in thread management and modular code design. The model recommends merging both approaches: using Q8 for initial launch protection and IQ3 XXS for atomic writes and thread lifecycle, forming a combined Phase 1 foundation.

media r/LocalLLaMA · 9d ago

Gemma 12b Reasoning Hardening Instructions

A system instruction has been developed to reduce cognitive bias in Gemma 12b's reasoning by requiring strict adherence to premises and explicit user intent. The instruction advises against defaulting to 'usual', 'standard', or 'typical' interpretations, and mandates re-examination of any such assumptions, improving performance on trick questions without overthinking normal ones.

media r/LocalLLaMA · 9d ago

Donate your coding sessions to an open CC-BY-4.0 dataset

A project called Trace Commons invites users to donate their coding session traces to an open dataset licensed under CC-BY-4.0. The initiative aims to provide training data for open-weight and open-source AI models, countering potential data monopolies by Anthropic and OpenAI.

media r/LocalLLaMA · 9d ago

Pooling GPUs to train a community model

A Reddit user asks whether anyone is successfully pooling GPUs to train a community model, highlighting challenges like latency and weight poisoning. The post questions if current distributed volunteer computing projects have achieved successful community model training.

media r/LocalLLaMA · 9d ago

AeroLLM: Fast, open-source AI app for Apple Silicon

AeroLLM is a fast, optimised, and open-source chat application designed for Apple Silicon devices using the MLX backend. It supports local AI tasks like text-to-speech, speech-to-text, and large language models, with models downloaded directly from Hugging Face based on available RAM. The app is notarised due to lack of Apple Developer membership, but users can follow provided steps to run it as a signed macOS app.

media r/LocalLLaMA · 9d ago

Nex-N2 Pro is the real deal

The user found that N2 Pro, when using Rio's chat template, performs reliably on their 128G Mac. It passed a private benchmark on llama.cpp source code 100% of the time without hallucinations, matching only GPT 5.x in consistency.

media r/LocalLLaMA · 9d ago

Are Quantized Image Generation Models Still WIP?

Users report inconsistent results when using quantized models in image generation, with SD 1.5 working well but SDXL failing. Despite successful conversion and quantization using tools like convert.py and llama-quantize, some users obtain poor outputs while others do not, raising questions about the current state and reliability of quantized image generation technology.

media r/LocalLLaMA · 9d ago

Nex2 mini Phase Twin 16GB footprint, 30B model released

The Nex2 mini Phase Twin, a 30B parameter model with 16GB footprint, is now available for Intel users, particularly the A770 lineup. It performs at 89 tokens per second on a single A770 card and is optimized to use the appropriate kernel based on hardware, with enhanced performance when paired with two cards.

arxiv arXiv cs.CL · 9d ago

Symbolic Informalization in Informath Project

The Informath project demonstrates symbolic informalization to convert formal mathematical proofs into fluent, precise natural language. It uses Dedukti as a hub connecting proof systems like Agda, Lean, and Rocq, with Grammatical Framework ensuring linguistic correctness across multiple languages.

arxiv arXiv cs.CL · 9d ago

LOGOS: A General-Purpose Generative Model for Natural Sciences

LOGOS is a unified generative language model that represents scientific objects and their interactions as token sequences in a shared grammar. It achieves consistent or superior performance across diverse natural science tasks, demonstrating the feasibility of a single model serving multiple domains. The model scales positively with parameter count, and its design suggests that AI for Science should align deeply with large language models through shared architectures and training.

arxiv arXiv cs.AI · 9d ago

Symbolic Informalization in Informath Project

The Informath project demonstrates symbolic informalization to convert formal mathematics into fluent, precise natural language. It uses Dedukti as a proof system hub and Grammatical Framework for linguistic correctness across multiple languages, enabling human-readable outputs from AI-generated proofs.

arxiv arXiv cs.AI · 9d ago

RAID: Semantic Graph Diffusion for True Cold-Start and Cross-Lingual Forecasting

RAID introduces a framework that uses metadata-driven semantic retrieval and graph-conditioned diffusion to address true cold-start scenarios. It outperforms foundation models and baselines in forecasting accuracy and interval coverage, reduces inference latency significantly, and enables zero-shot cross-lingual transfer via a shared semantic space.

arxiv arXiv cs.AI · 9d ago

CircuitLasso: Scalable Circuit Learning for LLM Interpretability

CircuitLasso proposes a scalable method for learning sparse circuits in large language models using sparse linear regression. It achieves structural accuracy comparable to state-of-the-art intervention-based methods at significantly lower computational cost, while enabling efficient discovery of semantic feature propagation and improving performance on domain-generalization tasks with reduced cost.

arxiv arXiv cs.AI · 9d ago

Causal Framework for Auditing Synthetic Data Disclosures

A new empirical auditing framework detects and classifies synthetic data disclosures as either true or phantom. It distinguishes direct reproductions of user data from incidental generation without model access or training, using only synthetic output and a held-out control set. The method provides tighter privacy leakage bounds than prior approaches and requires significantly fewer computational resources.

arxiv arXiv cs.AI · 9d ago

AI research documentation improves over decade

Analysis of 56,800 AI conference papers shows documentation practices improved from 2014 to 2024. Papers sharing both code and data increased from 11% to 64%, and estimated reproducibility rose from 28% to 64%. These improvements predate formal reproducibility checklists, indicating a broader shift toward open science.