Theory · Statistical Physics of Learning

TitleAuthors
A Solvable High-Dimensional Model of GANChuang Wang · Hong Hu · Yue Lu
Data-Dependence of Plateau Phenomenon in Learning with Neural Network --- Statistical Mechanical AnalysisYuki Yoshida · Masato Okada
Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setupSebastian Goldt · Madhu Advani · Andrew Saxe · Florent Krzakala · Lenka Zdeborová
First Exit Time Analysis of Stochastic Gradient Descent Under Heavy-Tailed Gradient NoiseThanh Huy Nguyen · Umut Simsekli · Mert Gurbuzbalaban · Gaël RICHARD
The Normalization Method for Alleviating Pathological Sharpness in Wide Neural NetworksRyo Karakida · Shotaro Akaho · Shun-ichi Amari
Untangling in Invariant Speech RecognitionCory Stephenson · Jenelle Feather · Suchismita Padhy · Oguz Elibol · Hanlin Tang · Josh McDermott · SueYeon Chung
Wide Feedforward or Recurrent Neural Networks of Any Architecture are Gaussian ProcessesGreg Yang