Reasoning models
arxiv arXiv cs.LG · 8d ago

DIPHINE: Neural Estimator for $Φ$-ID in Continuous Systems

DIPHINE is the first neural estimator that uses score-based diffusion models to jointly estimate all mutual information terms required by Integrated Information Decomposition ($Φ$ID) from a single amortized network. It recovers the sixteen non-overlapping information atoms via Möbius inversion and provides a theoretical analysis showing synergy-to-synergy estimation is the most challenging, with accurate results on synthetic benchmarks and real-world physiological data.

arxiv arXiv cs.LG · 8d ago

Smoothness-Based Derandomization of PAC-Bayes Bounds

A new framework derandomizes PAC-Bayes bounds for smooth loss functions by analyzing the generalization gap of the Jensen gap class via Rademacher complexity. The resulting bounds for deterministic predictors involve flatness measures derived from Jacobians and Hessians of the score map, and are applied to linear models and smooth neural networks. A practical regularizer is proposed, computed using folded BatchNorm weights, and validated on CIFAR-10 with varying batch sizes.

arxiv arXiv cs.LG · 8d ago

INDEQS: Graph-Informed Neural Controlled Differential Equations

INDEQS introduces a graph-based neural controlled differential equation framework that incorporates prior directed graph knowledge at architectural levels. It separates inner and outer mixing, offering both graph-constrained and data-adaptive variants, with outer informedness reducing mean absolute error on larger graphs, while inner informedness provides parameter efficiency for known adjacency adherence. Continuous decoders outperform discrete ones in real-world traffic and hydrological forecasting tasks.

arxiv arXiv cs.LG · 8d ago

OrthoReg: Orthogonal Regularization for Hybrid Symbolic-Neural Dynamical Systems

OrthoReg introduces orthogonal regularization to prevent neural components from relearning symbolic structures in hybrid dynamical systems. By directly penalizing overlap between symbolic and neural parts, it enables a complementary decomposition where symbolic models capture expressible physics and neural models handle remaining dynamics. On benchmarks with partial library mismatch, OrthoReg improves symbolic recovery and out-of-distribution performance.

arxiv arXiv cs.CL · 8d ago

CDDTLDA: Transfer Learning for Chinese Dialect Discrimination

A novel framework named CDDTLDA uses transfer learning and data augmentation to address Chinese dialects discrimination with limited annotations. It trains a source ASR model on a large dialect corpus, applies speed, pitch, and noise augmentation to low-resource target dialects, and fine-tunes a target ASR model using self-attention to capture shared semantic features. Experimental results show CDDTLDA outperforms state-of-the-art methods on two benchmark Chinese dialect corpora.

arxiv arXiv cs.CL · 8d ago

PragReST: Self-Reinforcing Counterfactual Reasoning for Pragmatic Language Understanding

PragReST is a self-supervised framework that enhances large language models' pragmatic reasoning by generating counterfactual reasoning traces and training via supervised fine-tuning and reinforcement learning. It outperforms baseline models on four pragmatic benchmarks, improving Qwen3-8B and Qwen3-14B by 5.37% and 5-5.50% accuracy respectively, and maintains strong performance on general-knowledge and mathematical reasoning tasks.