MORL-A2C: Multi-Objective Reinforcement Learning Reranker for Health
Researchers introduce MORL-A2C, a sequential decision-making extension to the MOPI-HFRS system that uses an Advantage Actor-Critic algorithm to optimize the trade-off between user preference and nutritional health in food recommendations.