| DAC: The Double Actor-Critic Architecture for Learning Options | Shangtong Zhang · Shimon Whiteson |
| Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards | Siyuan Li · Rui Wang · Minxue Tang · Chongjie Zhang |
| Language as an Abstraction for Hierarchical Deep Reinforcement Learning | YiDing Jiang · Shixiang (Shane) Gu · Kevin Murphy · Chelsea Finn |
| Learning Robust Options by Conditional Value at Risk Optimization | Takuya Hiraoka · Takahisa Imagawa · Tatsuya Mori · Takashi Onishi · Yoshimasa Tsuruoka |
| The Option Keyboard: Combining Skills in Reinforcement Learning | Andre Barreto · Diana Borsa · Shaobo Hou · Gheorghe Comanici · Eser Aygün · Philippe Hamel · Daniel Toyama · Jonathan hunt · Shibl Mourad · David Silver · Doina Precup |