Video
Learn More
Different labelling instances of PL-MDP $\mathfrak{M}$ being solved in Pacman
Learn More
Related Articles
M. Hasanbeig, A. Abate, and D. Kroening,
“Logically-constrained Reinforcement Learning,”
arXiv preprint arXiv:1801.08099, 2018.
Y. Kantaros, and M. M. Zavlanos,
“$\text {STyLuS}^{*} $: A Temporal Logic Optimal Control Synthesis Algorithm for Large-Scale Multi-Robot Systems,”
arXiv preprint arXiv:1809.08345, 2018.
M. Hasanbeig, A. Abate, and D. Kroening,
“Logically-constrained Neural Fitted Q-iteration,”
(to be appeared in AAMAS’19) arXiv preprint arXiv:1809.07823, 2018.