Control What You Can: Intrinsically Motivated Task-Planning Agent | Sebastian Blaes · Marin Vlastelica Pogančić · Jiajie Zhu · Georg Martius |
Depth-First Proof-Number Search with Heuristic Edge Cost and Application to Chemical Synthesis Planning | Akihiro Kishimoto · Beat Buesser · Bei Chen · Adi Botea |
Maximum Entropy Monte-Carlo Planning | Chenjun Xiao · Ruitong Huang · Jincheng Mei · Dale Schuurmans · Martin Müller |
Non-Stationary Markov Decision Processes, a Worst-Case Approach using Model-Based Reinforcement Learning | Erwan Lecarpentier · Emmanuel Rachelson |
Planning in entropy-regularized Markov decision processes and games | Jean-Bastien Grill · Omar Darwiche Domingues · Pierre Menard · Remi Munos · Michal Valko |
Planning with Goal-Conditioned Policies | Soroush Nasiriany · Vitchyr Pong · Steven Lin · Sergey Levine |
Regression Planning Networks | Danfei Xu · Roberto Martín-Martín · De-An Huang · Yuke Zhu · Silvio Savarese · Li Fei-Fei |
Search on the Replay Buffer: Bridging Planning and Reinforcement Learning | Ben Eysenbach · Russ Salakhutdinov · Sergey Levine |