Abstract: Safety guarantee is an important topic when training real-world tasks with reinforcement learning (RL). During online environmental exploration, any constraint violation can lead to ...
Abstract: This article proposes a data-driven model-free inverse Q-learning algorithm for continuous-time linear quadratic regulators (LQRs). Using an agent’s trajectories of states and optimal ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results