Reasoning models — korshunov.ai

Reasoning models Page 3 / 35

Privacy-Preserving RAG via Multi-Agent Semantic Rewriting

A multi-agent framework sanitizes retrieved content by removing sensitive identifiers through semantic rewriting, reducing privacy leakage in targeted attacks. It maintains strong contextual fidelity with a BLEU-1 score of 0.122, outperforming SAGE's 0.117, and operates as an asynchronous preprocessing step with no added latency to online inference.

arxiv arXiv cs.LG · 1d ago

Memory-Efficient Graph Filtering for Scalable Collaborative Filtering

Mem-GF introduces a memory-efficient graph filtering method that approximates polynomial graph filters using Krylov subspaces, eliminating the need to store the full item similarity graph. It achieves up to 5.74× lower memory usage and 4.38× faster runtime while maintaining superior recommendation accuracy compared to state-of-the-art methods, scaling effectively to datasets with tens of millions of interactions.

arxiv arXiv cs.LG · 1d ago

Distilling Transformers into Recurrent Transformers for Efficient Memory

A new distillation method transfers the observation compression strategy of full-history transformers to recurrent models. By training a teacher model to compress observation histories into fixed-size bottlenecks, the approach aligns the student's memory with the teacher's compression. This enables recurrent transformers to achieve near-full-history performance with linear-time complexity, making them viable for long-horizon robotics applications.

arxiv arXiv cs.LG · 1d ago

LIG: Layer-wise Integrated Gradients for Transformer Flow Analysis

LIG extends Integrated Gradients to set-to-set maps in Transformers, enabling token-level attribution within layers. It analyzes module-wise and layer-wide attribution consistency and tracks information flow via separate attention and MLP contributions, using target token embedding and zero or zero-attention outputs as baselines. LIG operates at module boundaries without retraining or custom interpreters, offering a diagnostic XAI tool for Transformer internals.

arxiv arXiv cs.LG · 1d ago

Cost Geometry of Belief in Noisy Inference

A finite-machine inference model uses cost geometry to quantify belief transitions, combining optimal transport with Fisher information. The framework reveals a wall, honesty, and rigidity in belief spaces, with the Gaussian belief achieving maximal hyperbolic curvature. Thermodynamics sets the cost unit, and the geometric floor of precision diverges at certainty, with the value -1/4 representing a key scale.

Privacy-Preserving RAG via Multi-Agent Semantic Rewriting

Memory-Efficient Graph Filtering for Scalable Collaborative Filtering

Distilling Transformers into Recurrent Transformers for Efficient Memory

LIG: Layer-wise Integrated Gradients for Transformer Flow Analysis

Cost Geometry of Belief in Noisy Inference

Profile-Based Reference in LLM Grounding

Linguistic Distance Affects Consensus in Neural Cellular Automata

Coherence Illusions in Dutch LLMs Revealed

ARCO: Adaptive Rubric with Co-Evolution for Multi-Step LLM Agents

FastGAN and Transformer Models Improve Aphid Detection in Faba Beans

Topological Neural Dynamics: Neuron-wise Sequence Modeling

NASDAQ: Normalized Observation Space Dynamics-Augmented Q-Learning

Influence-Based Explanations for Dysarthria Severity Assessment

TASER: Task-Differentiated Skill Expansion for Heterogeneous Continual Learning

Social World Model for Lifelong Social Intelligence

Ramanujan Graph Rewiring Alleviates GNN Over-Squashing

Transformer Models Highly Sensitive to Noisy Data in Trajectory Prediction

LLMs Benchmarked for Web Vulnerability Detection

Benchmark Evaluation of Small Language Models for Arabic NLP

Explainable AI Model for Career-Related Depression in University Students