$$\mathsf {SafePILCO}$$ : A Software Tool for Safe and Data-Efficient Policy Synthesis

Polymenakos K, Rontsis N, Abate A, Roberts S

$$\mathsf {Safe PILCO}$$ is a software tool for safe and data-efficient policy search with reinforcement learning. It extends the known$$\mathsf {PILCO}$$ algorithm, originally written in MATLAB, to support safe learning.$$\mathsf {Safe PILCO}$$ is a Python implementation and leverages existing libraries that allow the codebase to remain short and modular, towards wider use by the verification, reinforcement learning, and control communities.