All articles
arxiv arXiv cs.LG · 3d ago

Learning Process Rewards via Success Visitation Matching for Efficient RL

The authors propose a method to transform inherently sparse outcome rewards in reinforcement learning into dense process rewards by training a discriminator to distinguish between successful and unsuccessful episodes. This approach incentivizes the policy to match the state-action visitations of successful episodes while avoiding those of unsuccessful ones, providing dense feedback on progress without altering the optimal policy.

arxiv arXiv cs.LG · 3d ago

PsyBridge: A Hybrid Intelligent Framework for Multi-Dimensional Mental Health Assessment

This study introduces PsyBridge, a hybrid intelligent framework designed to address the limitations of isolated mental health screening tools by integrating clinically validated assessments with cognitive and personality profiling. The system utilizes a modular architecture and weighted aggregation mechanism to generate interpretable risk classifications and decision support recommendations.

arxiv arXiv cs.AI · 6d ago

Adaptive Hard-Soft Physics-Informed Neural Networks for Robust Boundary-Constrained PDE Solving

This study proposes a unified hard--soft physics--informed neural network (HSPINN) with adaptive loss weighting to address the slow convergence and inaccurate boundary enforcement of conventional PINNs. The framework enforces Dirichlet and periodic boundary conditions exactly through analytical lifting or masking, while treating PDE residuals and initial conditions as soft constraints balanced by an inverse-share softmax strategy.

arxiv arXiv cs.AI · 6d ago

Measuring & Mitigating Over-Alignment for LLMs in Multilingual Criminal Law Courts

This article addresses the challenge of over-alignment in large language models used within Swiss Federal Supreme Court criminal law contexts, where model guardrails frequently trigger refusals when processing sensitive case details. The authors introduce TF-RefusalBench, a multilingual benchmark derived from public rulings, to measure this phenomenon across French, German, Italian, and English.