Reinforcement Learning and Planning · Decision and Control

TitleAuthors
Generalized Off-Policy Actor-CriticShangtong Zhang · Wendelin Boehmer · Shimon Whiteson
Learner-aware Teaching: Inverse Reinforcement Learning with Preferences and ConstraintsSebastian Tschiatschek · Ahana Ghosh · Luis Haug · Rati Devidze · Adish Singla
Logarithmic Regret for Online ControlNaman Agarwal · Elad Hazan · Karan Singh
Adaptive Auxiliary Task Weighting for Reinforcement LearningXingyu Lin · Harjatin Baweja · George Kantor · David Held
Causal Confusion in Imitation LearningPim de Haan · Dinesh Jayaraman · Sergey Levine
Hierarchical Decision Making by Generating and Following Natural Language InstructionsHengyuan Hu · Denis Yarats · Qucheng Gong · Yuandong Tian · Mike Lewis
Non-Cooperative Inverse Reinforcement LearningXiangyuan Zhang · Kaiqing Zhang · Erik Miehling · Tamer Basar
Robust exploration in linear quadratic reinforcement learningJack Umenberger · Mina Ferizbegovic · Thomas Schön · Håkan Hjalmarsson
Compositional Plan VectorsColine Devin · Daniel Geng · Pieter Abbeel · Trevor Darrell · Sergey Levine
Online Optimal Control with Linear Dynamics and Predictions: Algorithms and Regret AnalysisYingying Li · Xin Chen · Na Li
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic GamesKaiqing Zhang · Zhuoran Yang · Tamer Basar
Policy Continuation with Hindsight Inverse DynamicsHao Sun · Zhizhong Li · Xiaotong Liu · Bolei Zhou · Dahua Lin