All articles — korshunov.ai

All articles Page 1 / 118

Overrefusal from Small On-Premises LLMs in Criminal Legal Context

A study investigates the impact of overrefusal on small, on-device large language models when processing legal prompts, finding that authority-style prefixes systematically increase refusal rates by 2 to 20 times compared to a no-prefix baseline. While role-play jailbreak prefixes showed mixed effects across different models, the results indicate that these small LLMs are unstable under contextual framings typical of real institutional users.

arxiv arXiv cs.AI · 4h ago

ASALT: Adaptive State Alignment for Lateral Transfer in Multi-agent Reinforcement Learning

This paper introduces ASALT, a method for lateral transfer learning in multi-agent reinforcement learning that accommodates mismatched state-space dimensionalities between source and target domains. The approach uses observation-level and state-level adapters to map inputs into a shared embedding space, enabling effective knowledge transfer across heterogeneous environments.

media r/LocalLLaMA · 4h ago

Dual GPU Value: Parallelism Over Model Size for Local LLMs

The author argues that upgrading from a single to dual GPU offers greater benefits through parallel processing rather than enabling the use of larger, higher-quality model quantizations. For coding tasks, the quality difference between Q4 and Q6/Q8 quantizations is minimal, making increased context window and throughput more valuable.

media r/LocalLLaMA · 4h ago

Effect of GLM 5.2 !!

A Reddit user shared an image titled "Effect of GLM 5.2 !!" in the r/LocalLLaMA subreddit.

media r/LocalLLaMA · 4h ago

Proposing a unified open dataset instead of decentralized LLM training

The author argues that the open-source community should prioritize building a massive, high-quality pre-training dataset rather than attempting to coordinate decentralized LLM training across home GPUs. This shift is presented as a more practical and immediate response to recent government bans on commercial frontier models and a scarcity of small-to-medium open-weight releases.

media r/LocalLLaMA · 4h ago

Bolt Graphics GPU to feature 2 DDR5 laptop DIMM slots

Bolt Graphics is developing a GPU that includes two DDR5 SODIMM slots for overflow memory, aiming for full production by Christmas 2027. The company has working prototypes and targets creators as its initial audience.

arxiv arXiv cs.AI · 5h ago

Uncertainty-Aware Longitudinal Forecasting of Alzheimer's Disease Progression Using Deep Learning

This study proposes a probabilistic framework for longitudinal modeling of Alzheimer's disease progression that combines ordinal diagnosis prediction, multi-horizon trajectory generation, and decomposed uncertainty estimation. The approach utilizes a Temporal Fusion Transformer encoder and an autoregressive Mixture Density Network to generate five-year probabilistic trajectories while quantifying both aleatoric and epistemic uncertainty.

arxiv arXiv cs.AI · 5h ago

ScaleToT: Generalizing Structured LLM Reasoning for Billion-Scale Low-Activity User Modeling

The paper introduces ScaleToT, a method that learns structured reasoning from a small subset of users and extends it to billions of low-activity users with sparse profiles. It combines a bounded entropy-guided Tree-of-Thought refinement with supervised fine-tuning and reward policy optimization to transfer reasoning capabilities without full LLM inference.

arxiv arXiv cs.AI · 5h ago

Abstractions of Queries in Ontology-Based Data Access

This article addresses query abstraction in ontology-based data access (OBDA) by translating data queries to the ontology layer using existential rules and certain answer semantics.

arxiv arXiv cs.AI · 5h ago

When CQs Go Wrong: Challenges in CQ Verification with OE-Assist

This paper investigates the challenges of Competency Question (CQ) verification, a process where ontologies are evaluated against natural language questions to ensure proper modeling. The authors analyze why CQs become difficult and how an LLM assistant can support users during this evaluation.

arxiv arXiv cs.AI · 5h ago

Infinitesimal Causality

This paper introduces a categorical account of infinitesimal causality in Frobenius Markov categories equipped with tangent-bundle semantics. It defines causal sufficiency through the compatibility of two distinct Frobenius structures: one encoding classical variable operations and another representing geometric integrability.

arxiv arXiv cs.AI · 5h ago

Themis: An explainable AI-enabled framework for Reinforcement Learning with Human Feedback

The authors introduce Themis, an XAI-enabled testing and evaluation framework that combines transparency through explainability with alignment via human feedback for safe Reinforcement Learning systems.

arxiv arXiv cs.AI · 5h ago

Privacy-Preserving RAG via Multi-Agent Semantic Rewriting

The authors propose a multi-agent framework that sanitizes retrieved content in Retrieval-Augmented Generation (RAG) systems through semantic rewriting to prevent privacy leakage from malicious prompts. By employing three specialized agents for privacy extraction, semantic analysis, and reconstruction, the approach removes sensitive identifiers while preserving the core meaning of the text.

arxiv arXiv cs.AI · 5h ago

SAFARI: Scaling Long Horizon Agentic Fault Attribution via Active Investigation

The article introduces SAFARI, a framework designed to diagnose failures in autonomous agents by replacing linear context loading with a tool-augmented diagnostic loop. This approach decouples diagnostic accuracy from architectural context limits by using specialized tools and short-term memory to analyze trajectory segments.

arxiv arXiv cs.AI · 5h ago

Visualizing 'We the People': Bridging the Perception Gap through Pluralistic Data Storytelling

This article examines how intentional, pluralistic design choices in AI-enabled digital platforms can produce visualizations that emphasize nuance and intergroup commonalities, thereby reducing political polarization. It highlights a specific deliberative technology initiative that maps high-dimensional opinion spaces to reveal areas of both consensus and dissensus among diverse populations.

media r/LocalLLaMA · 5h ago

Mellum2 local deployments

JetBrains has open-sourced the Mellum2 models, a series of 12B-2.5A LLMs trained from scratch to target fast inference on H100/H200 hardware as well as local deployments.

arxiv arXiv cs.AI · 6h ago

CineCap: Structured Reasoning with Spatio-Temporal Anchors for Cinematographic Video Captioning

Researchers propose CineCap, a framework that combines structured reasoning with spatio-temporal anchors and reinforcement learning to improve cinematographic video captioning. The method grounds professional film-language descriptions in explicit visual evidence while balancing descriptive completeness and factual correctness.

media AI News (smol.ai) · 6h ago

Anthropic launches Claude Tag, a Slack-native async delegation tool

Anthropic has launched Claude Tag, a new workflow feature that allows teams to delegate work to Claude asynchronously within Slack. Positioned as a shift from one-user chat to teamwide collaboration, the tool enables Claude to join as a team member with access to selected channels, tools, and codebases.

lab NVIDIA Technical Blog · 6h ago

Maximize AI Factory Energy Efficiency Through Full-Stack Inference and Training Optimizations

Power consumption represents 40% of the operating expenses for running an AI factory, with performance per watt becoming a critical efficiency metric that directly impacts token costs.

media r/LocalLLaMA · 6h ago

Building a web access layer for local AI agents

A developer shares their experience of creating a centralized web access layer to manage interactions between local AI models and external services. This approach addresses the maintenance burden of building individual integrations for every new agent project.