Reasoning models
arxiv arXiv cs.LG · 8d ago

Fairness in Graph Neural Networks via Laplacian Adaptation

A new framework modifies the Laplacian operator in graph diffusion to enhance fairness by incorporating subspace projections, spectral adjustments, and frequency-based filtering. The method leverages graph diffusion's smoothing properties to mitigate bias, with theoretical analysis and empirical validation on synthetic and real-world datasets showing improved fairness without significant computational overhead.

arxiv arXiv cs.LG · 8d ago

Vision-language models don't always need images for chest X-ray accuracy

A causal audit shows that many vision-language models achieve high chest radiograph accuracy without using images. Text-only models match multimodal models in performance and outperform them in grounding, with accuracy and confidence flags only appearing when image use occurs. These findings suggest that accuracy alone is insufficient to validate clinical deployment, and grounding must be assessed.

arxiv arXiv cs.LG · 8d ago

Blind Recovery of Latent Domains via Unsupervised Symmetry Discovery

The paper proposes an unsupervised framework to recover latent domains and signals from corrupted observations by discovering data symmetries. It models observations as linear measurements of signals from a latent random field and uses a shallow group-convolutional network with stationarity and locality constraints to learn latent symmetry actions and filters, enabling recovery from unstructured data.

arxiv arXiv cs.LG · 8d ago

LLM Belief Stabilization via Prompted Predictive Resampling

Large language models exhibit early belief drift in multiple-choice question answering, violating the martingale property. Prompted predictive resampling (PPR) reveals this drift, which self-stabilizes after sufficient resampling, leading to coherent predictive distributions. We propose a seed-answer prompting strategy and a self-consistency loss to accelerate stabilization and reduce drift, improving predictive coherence without affecting accuracy.

arxiv arXiv cs.LG · 8d ago

Qwen-RobotManip Achieves Generalization in Robotic Manipulation

Qwen-RobotManip, a Vision-Language-Action foundation model, enables large-scale training through unified alignment across representation, motion, and behavior. It uses open-source data to build a 38,100-hour pretraining corpus and demonstrates emergent generalization, outperforming prior state-of-the-art models in out-of-distribution settings and ranking first in RoboChallenge with a 20% relative improvement on real-robot platforms.

arxiv arXiv cs.LG · 8d ago

MKAN: Monotonic Kolmogorov-Arnold Networks with Hard Monotonicity

MKAN introduces a Kolmogorov-Arnold Network with hard monotonicity guaranteed for all parameter values, achieved through exponential reparameterization, positive edge weights, and a monotone base activation. It enables standard gradient descent training and provides a representation-cost theorem showing that any feature extractor can be realized with monotone structure at a size no more than twice the original, offering a principled scaling rule for monotone encoders.