Reasoning models
arxiv arXiv cs.LG · 7d ago

Alzheimer's Diagnosis via Multimodal 3D MRI and PET Fusion

A new study combines 3D MRI and PET data using advanced fusion strategies including GMU and gated self-attention, along with a sparsely gated MoE classifier. Results show GMU achieves 80.46% accuracy on NC vs. MCI and 95.47% on NC vs. AD, with gated self-attention reaching 82.08% on MCI vs. AD. Ablations confirm the MoE significantly improves performance, highlighting the importance of input-adaptive multimodal modeling for accurate Alzheimer's diagnosis.

arxiv arXiv cs.LG · 7d ago

EEG Foundation Models for Burst-Suppression Detection in ICU

A study evaluates EEG Foundation Models for event-based burst-suppression detection in ICU EEG without patient-specific calibration. REVE-base achieved the highest event-based F1-score of 0.868 and reduced burst-per-minute error by 52.1% compared to EEGNet. Ablation experiments show full fine-tuning outperforms other strategies, and pretrained REVE-base surpasses random initialization by 0.723 F1 points at 25% labeled data.

arxiv arXiv cs.LG · 7d ago

Information-Theoretic Analysis of Effective Supervision in Latent Chain-of-Thought

This paper identifies a dual collapse in latent reasoning: gradient attenuation and representational drift. It proposes Trajectory and Space Supervision, showing that generative reconstruction preserves information capacity better than geometric compression. The Unified Latent Probe measures mutual information between latent trajectories and reasoning steps, revealing an information-performance binding in reasoning accuracy.

arxiv arXiv cs.LG · 7d ago

MELT and SALT: Multimodal Contrastive Learning for Earth Embeddings

MELT and SALT are multimodal contrastive learning models that use unpaired geospatial data to improve location embeddings. Both achieve performance equal to the best two-modality baseline across four tasks, but adding more modalities does not consistently boost results, indicating the location encoder's design is the primary performance limit. MELT offers more stable training and is better suited for future model scaling.