Reasoning models — korshunov.ai

Reasoning models Page 1 / 35

Conservation Laws for Modern Neural Architectures

This paper introduces a unified framework to identify conservation laws in gradient flow for modern neural architectures. It covers feedforward networks with GELU, SiLU, and SwiGLU activations, multihead attention with sinusoidal and rotary positional encodings, and Mixture-of-Experts models under various gating schemes. Experiments validate the predicted invariants, supporting the theoretical findings.

arxiv arXiv cs.LG · 8d ago

Functional Equivalence in Attention with Positional Encodings

A comprehensive study reveals that sinusoidal positional encodings preserve functional equivalence in Transformers, while rotary positional encodings reduce symmetry, enhancing expressivity. The research shows that positional encodings critically influence linear mode connectivity, with empirical results demonstrating variability in connectivity depending on the encoding used.

arxiv arXiv cs.LG · 8d ago

LLM Belief Stabilization via Prompted Predictive Resampling

Large language models exhibit early belief drift in multiple-choice question answering, violating the martingale property. Prompted predictive resampling (PPR) reveals this drift, which self-stabilizes after sufficient resampling, leading to coherent predictive distributions. We propose a seed-answer prompting strategy and a self-consistency loss to accelerate stabilization and reduce drift, improving predictive coherence without affecting accuracy.

arxiv arXiv cs.LG · 8d ago

Qwen-RobotManip Achieves Generalization in Robotic Manipulation

Qwen-RobotManip, a Vision-Language-Action foundation model, enables large-scale training through unified alignment across representation, motion, and behavior. It uses open-source data to build a 38,100-hour pretraining corpus and demonstrates emergent generalization, outperforming prior state-of-the-art models in out-of-distribution settings and ranking first in RoboChallenge with a 20% relative improvement on real-robot platforms.

arxiv arXiv cs.LG · 8d ago

WallZero Beats Go Pros in WallGo

WallZero, an AlphaZero-based agent, defeats two professional Go players in WallGo, averaging 1.98x more territory per game. The study finds that the opening from the Netflix series creates a more balanced game, suggesting improved fairness in play.

arxiv arXiv cs.LG · 8d ago

Order-Independent Cell-Level Representations for Multi-Task Table Recognition

This paper introduces a structural refinement module using non-causal attention to generate order-independent cell features in autoregressive multi-task table recognition. The approach enables parallel cell content inference while maintaining global context, improving cell localization and end-to-end recognition with a threefold reduction in inference time.

arxiv arXiv cs.LG · 8d ago

MKAN: Monotonic Kolmogorov-Arnold Networks with Hard Monotonicity

MKAN introduces a Kolmogorov-Arnold Network with hard monotonicity guaranteed for all parameter values, achieved through exponential reparameterization, positive edge weights, and a monotone base activation. It enables standard gradient descent training and provides a representation-cost theorem showing that any feature extractor can be realized with monotone structure at a size no more than twice the original, offering a principled scaling rule for monotone encoders.

arxiv arXiv cs.LG · 8d ago

Dimensionality Controls When Modularity Helps in Continual Learning

Modular architecture enhances compositional continual learning only in low-dimensional regimes where representational subspaces partially align for similar tasks. In high-dimensional regimes, both modular and single networks perform similarly, indicating modularity's benefit depends on representational dimensionality induced by initialization scale.

arxiv arXiv cs.LG · 8d ago

Hybrid Ret-DNN with XGBoost for Customer Behavior Forecasting

A study proposes a hybrid Ret-DNN with XGBoost model to forecast customer behavior in e-commerce. Using 500,000 transaction records from a UK retailer, the model achieves a Mean Absolute Error of 0.2193, outperforming the existing Ret-DNN model.

arxiv arXiv cs.LG · 8d ago

SoftMoE: Soft Differentiable Routing for Mixture-of-Experts in LLMs

SoftMoE replaces discrete top-k routing with a differentiable soft top-k LapSum relaxation, enabling gradient-based optimization of expert selection. It learns to allocate expert activation non-uniformly across layers, with later layers activating more experts, while using significantly fewer experts than traditional sparse MoE.

arxiv arXiv cs.LG · 8d ago

CERS: CoT-Enhanced Reasoning for Medical Image Segmentation

CERS introduces Chain-of-Thought reasoning to improve semi-supervised medical image segmentation by integrating linguistic descriptions from large language models. It uses a semantic-aware reference selection and multi-scale coordinate attention to resolve boundary ambiguities and semantic inconsistencies, outperforming state-of-the-art methods in clinical scenarios with visual-semantic mismatch.

arxiv arXiv cs.LG · 8d ago

Half-Link Sufficiency in Knowledge Graph Foundation Models

A new study shows that KGFMs can predict whole links using only partial observations, such as half-links. It identifies four scenarios based on observed half-links and reveals that state-of-the-art models leverage seen half-links, while unseen ones present significant generalization challenges. This taxonomy offers a diagnostic framework for evaluating and improving KGFM robustness.

arxiv arXiv cs.AI · 8d ago

STAR: SpatioTemporal Adaptive Reward Allocation for Text-to-Image RL Post-Training

STAR introduces a spatio-temporal reward allocation method for text-to-image generation, using attention maps to dynamically assign advantages across denoising steps. It improves semantic alignment, text rendering, and preference optimization in Stable Diffusion 3.5 Medium, achieving 0.9759, 0.9757, and 23.60 on GenEval, OCR, and PickScore respectively.

arxiv arXiv cs.AI · 8d ago

Semantics-First Latent Modeling for 3D MRI Reconstruction

A new framework prioritizes anatomical semantics during 3D MRI latent compression, addressing long-range coherence and clinical detail loss. It introduces a Latent Harmonization Encoder and Semantic Recovery Block to preserve meaningful structures, and an Anatomy-aware Frequency Loss to maintain high-frequency diagnostic features. Experiments on public MRI datasets show improved reconstruction and cross-contrast synthesis quality.

arxiv arXiv cs.AI · 8d ago

T-API-Compliant ReAct Loop for Optical Networks

A T-API-compliant ReAct agentic loop is introduced for optical networks, enabling intent-driven, closed-loop management. Domain-specific composite tools achieve 90% oracle-validated correctness and reduce token usage by threefold compared to generic tools.

arxiv arXiv cs.AI · 8d ago

LLM Consumer Behavior Theory: A New Research Field

This paper introduces LLM Consumer Behavior Theory, a new field analyzing how large language models make consumption decisions on behalf of users. It unifies research on LLM decision-making, human behavior simulation, and preference elicitation under economic principles, identifying key gaps in assumptions like rationality and heterogeneity in agentic markets.

arxiv arXiv cs.AI · 8d ago

LegalHalluLens: Auditing Hallucinations in Legal AI

LegalHalluLens introduces a framework to audit AI hallucinations in legal contexts by analyzing typed hallucination profiles across four claim categories. It reveals a 38-40 point gap between obligation/numeric and temporal claims, and shows two systems with identical 52% hallucination rates can have opposite risk directions. The framework uses a Risk Direction Index and calibrated debate pipelines to reduce fabricated detections by 45% and improve accountability in legal AI deployment.

arxiv arXiv cs.AI · 8d ago

Catastrophic Forgetting is Low-Rank: A Function-Space Theory

A function-space theory reveals that catastrophic forgetting in continual adaptation concentrates in a small number of old-task NTK eigenmodes. In frozen-backbone linear-head PEFT-CL, the forgetting vector is exactly predictable up to numerical precision, with a Kronecker scaling rule for the vulnerable rank.

arxiv arXiv cs.AI · 8d ago

Source Language Effects in Cross-Lingual In-Context Learning

A study finds that fine-tuning-based assumptions about cross-lingual transfer do not apply in few-shot In-Context Learning. The research reveals that source language selection significantly impacts performance and identifies new heuristics for effective cross-lingual ICL.

arxiv arXiv cs.AI · 8d ago

ProvenanceGuard: Source-Aware Factuality Verification for MCP-Based LLM Agents

ProvenanceGuard introduces a source-aware verifier for MCP-based LLM agents that detects cross-source conflation by routing claims to specific evidence sources and comparing stated attribution with actual source ownership. It achieves block F1 of 0.802 and source accuracy of 0.858 on 260 source-eligible claims, outperforming source-blind baselines, and detects all injected attribution swaps in 50 clinical probes.