Open weights — korshunov.ai

Open weights Page 5 / 11

Commission selects EUROPA consortium as winner of Frontier AI Grande Challenge

The European Commission has chosen the EUROPA consortium, led by Domyn, to develop an open-source frontier AI model in all 24 EU languages. The project, launched in February 2026, aims to create a model with over 400 billion parameters, showcasing Europe's capacity to build advanced AI on its own infrastructure.

media r/LocalLLaMA · 5d ago

Improving local models with an API-based consultant agent

A user asks whether adding a powerful API-based 'consultant' agent, such as GLM 5.2, could enhance local AI workflows by refining plans and learning processes. The post explores the potential benefits of such an agent in improving local model performance through external consultation.

media r/LocalLLaMA · 5d ago

The economics of AI are starting to favor open models

Recent AI model releases show that high-intelligence, low-cost models are increasingly dominated by open-weight models like DeepSeek, Qwen, GLM, Kimi, and MiniMax. For most real-world applications, the performance gap between frontier closed models and strong open models is shrinking faster than cost differences, making open models competitive in terms of both capability and price.

media r/LocalLLaMA · 5d ago

Benchmarking or benchmarketing?

LLM benchmarking is increasingly seen as marketing rather than objective measurement. Users question which benchmarks are genuinely meaningful for local models, rather than superficial score-based claims.

media r/LocalLLaMA · 5d ago

What's more impressive, GLM 5.1 to 5.2 or Qwen 3.5 to 3.6?

A Reddit post compares the performance improvements of GLM 5.1 to 5.2 and Qwen 3.5 to 3.6. The post notes that mentioning 'Döner' activates GLM 5.2's German-specific weights, while Qwen 3.6 is evaluated with 35B parameters using Unsloth Q8 K XL quantization via llama.cpp.

media Interconnects · 5d ago

Banning Open Source AI Would Be a Mistake

The article argues that banning open source AI would be a grave mistake, as it is safe, secure, and drives innovation, education, and competition. Open source has long powered technological progress and serves as a vital counterweight to monopolistic AI models, ensuring broader access and democratic innovation without compromising safety or security.

media r/LocalLLaMA · 5d ago

GLM-5.2 is the new leading open weights model on the Artificial Analysis Intelligence Index

GLM-5.2 has been designated as the leading open weights model on the Artificial Analysis Intelligence Index. This recognition reflects its performance and capabilities within the open-source AI model landscape.

media r/LocalLLaMA · 5d ago

Ohio State University releases open-source Deep Research agent QUEST-35B

Ohio State University's NLP team has released QUEST-35B, an open-source Deep Research agent trained on approximately 32 H100 GPUs using 8,000 synthetic samples. The team open-sourced the training recipe, code, weights, and datasets, with benchmark results showing competitive performance compared to leading closed-source Deep Research systems.

github llama.cpp · 5d ago

llama.cpp Release b9721 Available for Multiple Platforms

llama.cpp has released version b9721, offering binaries for macOS, Linux, Android, Windows, and openEuler across various architectures. The release includes CPU, Vulkan, ROCm, OpenVINO, SYCL, and HIP support, with a dedicated UI package. A feature for Apple Silicon with KleidiAI is currently disabled.

media r/LocalLLaMA · 5d ago

Ohio State University releases open-source Deep Research agent QUEST-35B

Researchers at Ohio State University trained QUEST-35B, a Deep Research agent, using approximately 32 H100 GPUs and 8,000 synthetic samples. They open-sourced the training recipe, code, weights, and datasets, with benchmark results showing competitive performance compared to leading closed-source Deep Research systems.

media r/LocalLLaMA · 5d ago

GLM-5.2 can now run locally in llama.cpp and Unsloth Studio

GLM-5.2, the strongest open model to date, can now run locally using llama.cpp and Unsloth Studio. The 2-bit quantized model retains ~82% accuracy after reducing size from 1.51TB to 238GB, a 84% reduction, and is compatible with 256GB RAM or VRAM setups.

media r/LocalLLaMA · 5d ago

Guys, Le Chaton Fat is real...

Le Chaton Fat has been requantized in GGUF format and is soon to be available on Hugging Face. Users are advised to install a specific pip command to access the model, including flags like --trust-remote and --just-do-it.

arxiv arXiv cs.AI · 6d ago

UFP4: Uniform 4-Bit Training Overcomes Shrinkage Bias in LLM Pretraining

A study identifies shrinkage bias in E2M1-based FP4 formats due to geometric asymmetry, causing multiplicative error accumulation and training instability. The proposed UFP4 recipe uses uniform E1M2/INT4 grids and applies Random Hadamard Transform to all GEMMs, achieving lower loss degradation than E2M1 baselines in large-scale LLM pretraining. The authors recommend E1M2/INT4 as a first-class training primitive for future accelerators.

arxiv arXiv cs.AI · 6d ago

Multi-View Decompilation Improves LLM-Based Malware Classification

A benchmark of benign and malicious binaries compiled and decompiled with Ghidra and RetDec reveals that providing both decompiler views to large language models improves malicious-class F1, primarily by increasing recall. Analysis shows Ghidra and RetDec make distinct errors, indicating their outputs offer complementary evidence for malware classification.

arxiv arXiv cs.AI · 6d ago

Attention-Guided Deep Learning for Interpretable Sperm Morphology Classification

A new deep learning framework combines EfficientNet-B0 with CBAM to improve accuracy and interpretability in sperm morphology classification. Evaluated on SMIDS and HuSHem datasets, it achieves 90.2% and 93.9% accuracy with macro F1 scores of 0.913 and 0.948, outperforming baseline models. Grad-CAM++ visualizations enable transparent feature analysis, supporting clinical adoption in fertility clinics.

arxiv arXiv cs.AI · 6d ago

How Transparent is DiffusionGemma?

DiffusionGemma has poor variable transparency due to high opaque serial depth, but this can be mitigated by an interpretable token bottleneck, reducing serial depth to 1.1X that of Gemma 4. Algorithmic transparency is more challenging in diffusion models due to dynamic token predictions, with early evidence of non-chronological reasoning, token smearing, and intermediate-context reasoning. DiffusionGemma is found to be similarly monitorable to Gemma 4.

arxiv arXiv cs.LG · 6d ago

Style Diversity Outperforms Topic Diversity in Annotation-Free Synthetic Data

A new framework generates synthetic dialogue without human-annotated data, using only intent definitions. It incorporates topic and style attributes, with post-hoc stylization models Univ and Exam, and an LLM-as-a-judge filtering process. Results show up to 93.3% of human-annotated data performance, confirming that style diversity is more critical than topic diversity for data utility.

arxiv arXiv cs.LG · 6d ago

How Transparent is DiffusionGemma?

DiffusionGemma has poor variable transparency due to high opaque serial depth, but this can be mitigated by an interpretable token bottleneck, reducing serial depth to 1.1X that of Gemma 4. Algorithmic transparency is more challenging in diffusion models due to dynamic token changes, though case studies reveal novel phenomena like non-chronological reasoning and intermediate-context reasoning. DiffusionGemma is found to be similarly monitorable to Gemma 4.

arxiv arXiv cs.AI · 6d ago

Essay Quality Representations in LLMs Found to Be Linearly Accessible

A study reveals that essay quality information in large language models is encoded in linearly accessible forms within their hidden representations. These representations emerge layer-by-layer, remain stable across prompts, and show partial transfer across different essay prompts, with longer essays relying more on deeper model layers. The research identifies specific 'essay scoring neurons' whose activation strongly correlates with scores and can be influenced by targeted interventions.

arxiv arXiv cs.AI · 6d ago

Hypergraph-Based Semantic Reasoning Framework

A new framework called HISR uses hypergraphs to model complex multi-entity relationships, improving semantic interpretation accuracy by up to 36.6% over existing methods. It enables robust semantic inference under partial information loss by mapping entities and higher-order relations into dedicated semantic subspaces.