DAC: The Double Actor-Critic Architecture for Learning Options | Shangtong Zhang · Shimon Whiteson |
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards | Siyuan Li · Rui Wang · Minxue Tang · Chongjie Zhang |
Language as an Abstraction for Hierarchical Deep Reinforcement Learning | YiDing Jiang · Shixiang (Shane) Gu · Kevin Murphy · Chelsea Finn |
Learning Robust Options by Conditional Value at Risk Optimization | Takuya Hiraoka · Takahisa Imagawa · Tatsuya Mori · Takashi Onishi · Yoshimasa Tsuruoka |
The Option Keyboard: Combining Skills in Reinforcement Learning | Andre Barreto · Diana Borsa · Shaobo Hou · Gheorghe Comanici · Eser Aygün · Philippe Hamel · Daniel Toyama · Jonathan hunt · Shibl Mourad · David Silver · Doina Precup |