All articles
arxiv arXiv cs.LG · 4h ago

Learning Process Rewards via Success Visitation Matching for Efficient RL

The authors propose a method to transform inherently sparse outcome rewards in reinforcement learning into dense process rewards by training a discriminator to distinguish between successful and unsuccessful episodes. This approach incentivizes the policy to match the state-action visitations of successful episodes while avoiding those of unsuccessful ones, providing dense feedback on progress without altering the optimal policy.

arxiv arXiv cs.LG · 5h ago

PsyBridge: A Hybrid Intelligent Framework for Multi-Dimensional Mental Health Assessment

This study introduces PsyBridge, a hybrid intelligent framework designed to address the limitations of isolated mental health screening tools by integrating clinically validated assessments with cognitive and personality profiling. The system utilizes a modular architecture and weighted aggregation mechanism to generate interpretable risk classifications and decision support recommendations.