Reasoning models — korshunov.ai

Reasoning models Page 1 / 35

PACT: Small Language Model Deliberation for Reactive Reinforcement Learning

PACT combines a reactive RL policy with a 2B-parameter Small Language Model to generate and validate action plans. The SLM plan is executed directly if verified as safe, feasible, and complete, bypassing the RL policy. PACT outperforms baselines on three increasingly difficult FrozenLake environments.

arxiv arXiv cs.AI · 9d ago

ActiveSAM: Fast and Accurate Open-Vocabulary Segmentation

ActiveSAM is a training-free, zero-shot framework that enhances SAM 3 for open-vocabulary semantic segmentation by identifying an image-conditioned active class set. It improves speed-accuracy tradeoff, outperforming SegEarth-OV3 by +1.4 mIoU on average and running up to 5.5x faster on large-vocabulary datasets, with strong robustness under image corruption.

arxiv arXiv cs.AI · 9d ago

FusionRS: First Large-Scale RGB-Infrared Remote Sensing Dataset

FusionRS introduces the first large-scale RGB-infrared-text dataset for remote sensing vision-language modeling. It aligns RGB and infrared images with IR-aware captions, enabling dual-modal vision-language foundation models. Experiments show improved RGB-IR alignment, retrieval, and captioning, with ablation studies confirming the critical role of modality-specific textual supervision.

arxiv arXiv cs.AI · 9d ago

Phase in Neural Representations: An Internal Oppenheim-Lim Test

Image classifiers like PRISM2D, GFNet, and ViT-B/16 show that phase, not magnitude, drives predictions in hidden layers. ResNet-50 reveals a latent sign code in late blocks, indicating phase/sign identity exists across architectures, though expressed differently due to activation and readout mechanisms.

PACT: Small Language Model Deliberation for Reactive Reinforcement Learning

ActiveSAM: Fast and Accurate Open-Vocabulary Segmentation

FusionRS: First Large-Scale RGB-Infrared Remote Sensing Dataset

Phase in Neural Representations: An Internal Oppenheim-Lim Test

Factorized Neural Operators Decompose Dynamic and Persistent Responses

CEAP Reduces Variance in LLM Circuit Discovery

Adaptive Functional Gradient Descent with Convergence Guarantees

Unified Causal-Origin Taxonomy of Distributional Shifts in RL

Key Properties for Effective Code Interpreter Reasoning

CrossMaps: Confidence-Aware Semantic Mapping for Rover Navigation

CircuitLasso: Scalable Circuit Learning for LLM Interpretability

Hybrid Convolutional VAE for Crypto Volatility Surfaces

Task-Error Residual Learning for Real-Robot Five-Ball Juggling

Dynestyx: Probabilistic Programming for Dynamical Systems

Fingerprinting agent behavior through procedural trajectories

Analytic Torsion and Spectral Gap Capture Persistent-Laplacian Performance

PACT: Small Language Model Deliberation for Reactive Reinforcement Learning

ActiveSAM: Fast and Accurate Open-Vocabulary Segmentation

Neural EXposure Interaction Search for Interpretable HTE

ROVE: Reinforcement Learning with Human Interventions for Humanoid Manipulation