Smart Forgetting for Safe Online Learning with Gaussian Processes

Jonas Umlauft, Thomas Beckers, Alexandre Capone, Armin Lederer, Sandra Hirche

Research output: Contribution to journalConference articlepeer-review

7 Scopus citations

Abstract

The identification of unknown dynamical systems using supervised learning enables model-based control of systems that cannot be modeled based on first principles. While most control literature focuses on the analysis of a static dataset, online learning control, where data points are added while the controller is running, has rarely been studied in depth. In this paper, we present a data-efficient approach for online learning control based on Gaussian process models. To enable real-time capability despite high computational loads with growing datasets, we propose a safe forgetting mechanism. Using an entropy criterion, data points are selected based on their utility for the future trajectory under consideration of the stability of the closed-loop system. The approach is evaluated in a simulation and in a robotic experiment to demonstrate its computational efficiency.

Original languageEnglish
Pages (from-to)160-169
Number of pages10
JournalProceedings of Machine Learning Research
Volume120
StatePublished - 2020
Event2nd Annual Conference on Learning for Dynamics and Control, L4DC 2020 - Berkeley, United States
Duration: 10 Jun 202011 Jun 2020

Keywords

  • Gaussian processes
  • data-driven control
  • data-efficient learning
  • online learning
  • safe exploration
  • safe forgetting

Fingerprint

Dive into the research topics of 'Smart Forgetting for Safe Online Learning with Gaussian Processes'. Together they form a unique fingerprint.

Cite this