Kernel Instrumental Variable Regression | Rahul Singh · Maneesh Sahani · Arthur Gretton |
Uniform convergence may be unable to explain generalization in deep learning | Vaishnavh Nagarajan · J. Zico Kolter |
Logarithmic Regret for Online Control | Naman Agarwal · Elad Hazan · Karan Singh |
Updates of Equilibrium Prop Match Gradients of Backprop Through Time in an RNN with Static Input | Maxence Ernoult · Benjamin Scellier · Yoshua Bengio · Damien Querlioz · Julie Grollier |
Causal Confusion in Imitation Learning | Pim de Haan · Dinesh Jayaraman · Sergey Levine |
Generative Modeling by Estimating Gradients of the Data Distribution | Yang Song · Stefano Ermon |
Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration | Clarice Poon · Jingwei Liang |
Scalable Bayesian inference of dendritic voltage via spatiotemporal recurrent state space models | Ruoxi Sun · Ian Kinsella · Scott Linderman · Liam Paninski |
Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning | Harm Van Seijen · Mehdi Fatemi · Arash Tavakoli |
HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models | Sharon Zhou · Mitchell Gordon · Ranjay Krishna · Austin Narcomey · Li Fei-Fei · Michael Bernstein |
Necessary and Sufficient Geometries for Gradient Methods | Daniel Levy · John Duchi |
Parameter elimination in particle Gibbs sampling | Anna Wigren · Riccardo Sven Risuleo · Lawrence Murray · Fredrik Lindsten |
Fast and Accurate Least-Mean-Squares Solvers | Ibrahim Jubran · Alaa Maalouf · Dan Feldman |
Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations | Vincent Sitzmann · Michael Zollhoefer · Gordon Wetzstein |
A neurally plausible model learns successor representations in partially observable environments | Eszter Vértes · Maneesh Sahani |
Faster width-dependent algorithm for mixed packing and covering LPs | Digvijay Boob · Saurabh Sawlani · Di Wang |
Guided Similarity Separation for Image Retrieval | Chundi Liu · Guangwei Yu · Maksims Volkovs · Cheng Chang · Himanshu Rai · Junwei Ma · Satya Krishna Gorti |
Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup | Sebastian Goldt · Madhu Advani · Andrew Saxe · Florent Krzakala · Lenka Zdeborová |
Batched Multi-armed Bandits Problem | Zijun Gao · Yanjun Han · Zhimei Ren · Zhengqing Zhou |
Efficient and Thrifty Voting by Any Means Necessary | Debmalya Mandal · Ariel Procaccia · Nisarg Shah · David Woodruff |
Geometry-Aware Neural Rendering | Joshua Tobin · Wojciech Zaremba · Pieter Abbeel |
On Making Stochastic Classifiers Deterministic | Andrew Cotter · Maya Gupta · Harikrishna Narasimhan |
Strategizing against No-regret Learners | Yuan Deng · Jon Schneider · Balasubramanian Sivan |
Distribution-Independent PAC Learning of Halfspaces with Massart Noise | Ilias Diakonikolas · Themis Gouleakis · Christos Tzamos |
Exponentially convergent stochastic k-PCA without variance reduction | Cheng Tang |
Brain-Like Object Recognition with High-Performing Shallow Recurrent ANNs | Jonas Kubilius · Martin Schrimpf · Ha Hong · Najib Majaj · Rishi Rajalingham · Elias Issa · Kohitij Kar · Pouya Bashivan · Jonathan Prescott-Roy · Kailyn Schmidt · Aran Nayebi · Daniel Bear · Daniel Yamins · James J DiCarlo |
Variance Reduction for Matrix Games | Yair Carmon · Yujia Jin · Aaron Sidford · Kevin Tian |
Optimizing Generalized Rate Metrics with Three Players | Harikrishna Narasimhan · Andrew Cotter · Maya Gupta |
Blind Super-Resolution Kernel Estimation using an Internal-GAN | Sefi Bell-Kligler · Assaf Shocher · Michal Irani |
Average Individual Fairness: Algorithms, Generalization and Experiments | Saeed Sharifi-Malvajerdi · Michael Kearns · Aaron Roth |
Putting An End to End-to-End: Gradient-Isolated Learning of Representations | Sindy Löwe · Peter O'Connor · Bastiaan Veeling |
Understanding Sparse JL for Feature Hashing | Meena Jagadeesan |
Nonparametric Density Estimation & Convergence Rates for GANs under Besov IPM Losses | Ananya Uppal · Shashank Singh · Barnabas Poczos |
On Robustness of Principal Component Regression | Anish Agarwal · Devavrat Shah · Dennis Shen · Dogyoon Song |
XLNet: Generalized Autoregressive Pretraining for Language Understanding | Zhilin Yang · Zihang Dai · Yiming Yang · Jaime Carbonell · Russ Salakhutdinov · Quoc V Le |
R2D2: Reliable and Repeatable Detector and Descriptor | Jerome Revaud · Cesar De Souza · Martin Humenberger · Philippe Weinzaepfel |