Publikation

Tractable Bayesian Dynamics Priors from Differentiable Physics for Learning and Control

Joe Watson; Benedikt Hahner; Boris Belousov; Jan Peters

In: 40th Anniversary of the IEEE International Conference on Robotics and Automation (ICRA@40). IEEE International Conference on Robotics and Automation (ICRA-2024), September 23-26, Rotterdam, Netherlands, IEEE, 2024.

Zusammenfassung

Statistical model-based reinforcement learning methods should enable efficient data-driven policy optimization for robotic systems. However, the success of these approaches relies on how well the learned dynamics model generalizes outside of its training data distribution, which is difficult to ensure in practice with ‘black-box’ models unless inductive biases are incorporated. We demonstrate how a differentiable simulation model can be used to synthesize a tractable Gaussian process prior using the linearized Laplace approximation, a principled approximate inference technique for Gaussian pos- teriors. Using this statistical model as an inductive bias, we can perform exploration and decision-making in an informed way using posterior sampling, performing reinforcement learning for control, and active learning for system identification. In the case of simulation-to-reality mismatch, we empirically investi- gate how this physics-informed prior is best used, comparing architecture- and objective-based approaches.