A study evaluates model-free reinforcement learning controllers in nonlinear systems under cyberattacks. Lyapunov reward offers best resilience with low tracking error, while Proximal Policy Optimization outperforms Deep Deterministic Policy Gradient in reducing KPI variance.
RL Reward Types Enhance Resilience in Cyber-Physical Systems
from English