$$\\mathsf {SafePILCO}$$ : A Software Tool for Safe and Data-Efficient Policy Synthesis

Polymenakos K, Rontsis N, Abate A, Roberts S

1 January 2020

Conference paper

Journal:

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volume:

12289 LNCS

pp.

18 - 26

$$\mathsf {Safe PILCO}$$ is a software tool for safe and data-efficient policy search with reinforcement learning. It extends the known$$\mathsf {PILCO}$$ algorithm, originally written in MATLAB, to support safe learning.$$\mathsf {Safe PILCO}$$ is a Python implementation and leverages existing libraries that allow the codebase to remain short and modular, towards wider use by the verification, reinforcement learning, and control communities.

DOI

10.1007/978-3-030-59854-9_3

$$\mathsf {SafePILCO}$$ : A Software Tool for Safe and Data-Efficient Policy Synthesis