Safe Reinforcement Learning Through Regret and State Restorations in Evaluation Stages
Timo P. Gros; Nicola Müller; Daniel Höller; Verena Wolf
In: Workshop on Reliable Data-Driven Planning and Scheduling. International Conference on Automated Planning and Scheduling (ICAPS-2024), Springer, 2024.