Algorithms · Large Scale Learning

TitleAuthors
A Linearly Convergent Proximal Gradient Algorithm for Decentralized OptimizationSulaiman Alghunaim · Kun Yuan · Ali H Sayed
Asymptotics for Sketching in Least Squares RegressionEdgar Dobriban · Sifan Liu
DETOX: A Redundancy-based Framework for Faster and More Robust Gradient AggregationShashank Rajput · Hongyi Wang · Zachary Charles · Dimitris Papailiopoulos
Large-scale optimal transport map estimation using projection pursuitCheng Meng · Yuan Ke · Jingyi Zhang · Mengrui Zhang · Wenxuan Zhong · Ping Ma
Locality-Sensitive Hashing for f-Divergences: Mutual Information Loss and BeyondLin Chen · Hossein Esfandiari · Gang Fu · Vahab Mirrokni
Massively scalable Sinkhorn distances via the Nyström methodJason Altschuler · Francis Bach · Alessandro Rudi · Jonathan Niles-Weed
On the Global Convergence of (Fast) Incremental Expectation Maximization MethodsBelhal Karimi · Hoi-To Wai · Eric Moulines · Marc Lavielle
Optimal Sparsity-Sensitive Bounds for Distributed Mean Estimationzengfeng Huang · Ziyue Huang · Yilei WANG · Ke Yi
Qsparse-local-SGD: Distributed SGD with Quantization, Sparsification and Local ComputationsDebraj Basu · Deepesh Data · Can Karakus · Suhas Diggavi
Random Projections with Asymmetric QuantizationXiaoyun Li · Ping Li
Re-randomized Densification for One Permutation Hashing and Bin-wise Consistent Weighted SamplingPing Li · Xiaoyun Li · Cun-Hui Zhang
Robust and Communication-Efficient Collaborative LearningAmirhossein Reisizadeh · Hossein Taheri · Aryan Mokhtari · Hamed Hassani · Ramtin Pedarsani
Sampled Softmax with Random Fourier FeaturesAnkit Singh Rawat · Jiecao Chen · Felix Xinnan Yu · Ananda Theertha Suresh · Sanjiv Kumar
Extreme Classification in Log Memory using Count-Min Sketch: A Case Study of Amazon Search with 50M ProductsTharun Kumar Reddy Medini · Qixuan Huang · Yiqiu Wang · Vijai Mohan · Anshumali Shrivastava
Sliced Gromov-WassersteinVayer Titouan · Rémi Flamary · Nicolas Courty · Romain Tavenard · Laetitia Chapel
SySCD: A System-Aware Parallel Coordinate Descent AlgorithmNikolas Ioannou · Celestine Mendler-Dünner · Thomas Parnell
GPipe: Efficient Training of Giant Neural Networks using Pipeline ParallelismYanping Huang · Youlong Cheng · Ankur Bapna · Orhan Firat · Dehao Chen · Mia Chen · HyoukJoong Lee · Jiquan Ngiam · Quoc V Le · Yonghui Wu · zhifeng Chen