Research paper
arxiv arXiv cs.CL · 7d ago

LOCUS: A Local Ordinance Corpus for the United States

LOCUS provides machine-readable access to nearly all publicly available U.S. municipal and county ordinance codes, covering 9,239 cities and counties. It includes a county-harmonized access layer for 2,309 of 3,144 U.S. counties, serving the majority of the population. The corpus, built with OCR and metadata for reproducibility, enables large-scale analysis of local law, including dimensions like opacity and paternalism, using ModernBERT-based models.

arxiv arXiv cs.LG · 7d ago

Domain-Shift Aware Neural Networks for Unbalance Mass Estimation

A domain-shift aware neural network is proposed for estimating unbalance masses in rotating shafts under varying operating conditions. The model uses maximum mean discrepancy to align feature representations across different operational domains, improving prediction accuracy when system behaviors differ from training conditions. Results show its effectiveness in structural health monitoring applications where domain discrepancies are unknown or unaccounted for.

arxiv arXiv cs.LG · 7d ago

TransitNet Achieves 95.2% Accuracy in Low-SNR Transit Searches

TransitNet, a compact attention-augmented deep learning framework, achieves 95.2% accuracy in low-SNR transit blind searches, outperforming TLS and BLS in ROC-AUC and PR-AP values. It recovers 93.0% of injected Earth- and sub-Earth-size transits, with 97.4% of injected transits fully covered by estimated transit windows, and successfully recovers all 34 confirmed Kepler planets with a mean midpoint error of 1.24 hours.

arxiv arXiv cs.LG · 8d ago

Quantum GAN Augmentation Shows No Benefit in Brain MRI

A controlled benchmark finds no significant performance gain from quantum generative models in brain MRI augmentation. Synthetic samples produced by quantum and classical GANs are statistically indistinguishable, with both showing mode collapse and off-distribution samples, especially at low data fractions. The study concludes that quantum augmentation does not outperform classical methods and acts more as regularization than data expansion.

arxiv arXiv cs.LG · 8d ago

DIPHINE: Neural Estimator for $Φ$-ID in Continuous Systems

DIPHINE is the first neural estimator that uses score-based diffusion models to jointly estimate all mutual information terms required by Integrated Information Decomposition ($Φ$ID) from a single amortized network. It recovers the sixteen non-overlapping information atoms via Möbius inversion and provides a theoretical analysis showing synergy-to-synergy estimation is the most challenging, with accurate results on synthetic benchmarks and real-world physiological data.

arxiv arXiv cs.LG · 8d ago

Geometric and Stochastic Analysis of Discontinuities in Sparse Mixture-of-Experts

This paper analyzes discontinuities in Sparse Mixture-of-Experts models, classifying them by order and showing that lower-order discontinuities dominate in volume. It proves that random input paths almost surely first hit an order-1 discontinuity with finite-time probability bounds and derives occupation-time bounds for each order. A simple smoothing mechanism is proposed that enhances model continuity and performance with minimal computational overhead.

arxiv arXiv cs.LG · 8d ago

Context-Aware Follow-Up Optimization for Type 2 Diabetes

A study uses a Contextual Markov Decision Process to optimize follow-up intervals for Type 2 Diabetes patients based on EHR data from 22,154 patients. The model identifies two clinical contexts—low and high risk—and recommends adaptive intervals: 1 month for unmeasured lab values, up to 3 months for elevated values or hospitalizations, and 6–12 months for stable control, with shorter intervals for high-risk patients. The CMDP policies reduced expected cumulative costs by 34.8% in high-comorbidity and 6.4% in low-comorbidity contexts compared to a fixed interval policy.

arxiv arXiv cs.LG · 8d ago

OrthoReg: Orthogonal Regularization for Hybrid Symbolic-Neural Dynamical Systems

OrthoReg introduces orthogonal regularization to prevent neural components from relearning symbolic structures in hybrid dynamical systems. By directly penalizing overlap between symbolic and neural parts, it enables a complementary decomposition where symbolic models capture expressible physics and neural models handle remaining dynamics. On benchmarks with partial library mismatch, OrthoReg improves symbolic recovery and out-of-distribution performance.