Supervisory output prediction for bilinear systems by reinforcement learning

G. Chasparis, T. Natschläger. Supervisory output prediction for bilinear systems by reinforcement learning. IET Control Theory & Applications, volume 11, number 10, pages 1514-1521, DOI 10.1049/iet-cta.2016.1400, 6, 2017.

Autoren
  • Georgios Chasparis
  • Thomas Natschläger
TypArtikel
JournalIET Control Theory & Applications
Nummer10
Band11
DOI10.1049/iet-cta.2016.1400
ISSN1751-8644
Monat6
Jahr2017
Seiten1514-1521
Abstract

Online output prediction is an indispensable part of any model predictive control implementation. For several application scenarios, operating conditions may change quite often, while designing the data collection process may not be an option. To this end, this paper introduces a supervisory output prediction scheme, tailored specifically for input-output stable bilinear systems, that intends on automating the process of selecting the most appropriate prediction model during runtime. The selection process is based upon a reinforcement-learning scheme, where prediction models are selected according to their prior prediction performance. An additional selection process is concerned with appropriately partitioning the control-inputsaAZ domain in order to also allow for switched-system approximations of the original bilinear dynamics. We show analytically that the proposed scheme converges (in probability) to the best model and partition. We also demonstrate these properties through simulations of temperature prediction in residential buildings.