Accelerated Coordinate Descent with Arbitrary Sampling and Best Rates for Minibatches

Filip Hanzely; Peter Richtarik

Back

Conference proceeding

Accelerated Coordinate Descent with Arbitrary Sampling and Best Rates for Minibatches

Filip Hanzely and Peter Richtarik

22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, Vol.89, pp.304-312

Proceedings of Machine Learning Research

01/01/2019

Abstract

Computer Science

Computer Science, Artificial Intelligence

Mathematics

Physical Sciences

Science & Technology

Statistics & Probability

Technology

Accelerated coordinate descent is a widely popular optimization algorithm due to its efficiency on large-dimensional problems. It achieves state-of-the-art complexity on an important class of empirical risk minimization problems. In this paper we design and analyze an accelerated coordinate descent (ACD) method which in each iteration updates a random subset of coordinates according to an arbitrary but fixed probability law, which is a parameter of the method. While minibatch variants of ACD are more popular and relevant in practice, there is no importance sampling for ACD that outperforms the standard uniform minibatch sampling. Through insights enabled by our general analysis, we design new importance sampling for minibatch ACD which significantly outperforms previous state-of-the-art minibatch ACD in practice. We prove a rate that is at most O(root tau) times worse than the rate of minibatch ACD with uniform sampling, but can be O(n/T) times better, where tau is the minibatch size. Since in modern supervised learning training systems it is standard practice to choose tau << n, and often tau = O(1), our method can lead to dramatic speedups. Lastly, we obtain similar results for minibatch nonaccelerated CD as well, achieving improvements on previous best rates.

Metrics

1 Record Views

Details

Title: Accelerated Coordinate Descent with Arbitrary Sampling and Best Rates for Minibatches
Creators - without role: Filip Hanzely - KAUST, Thuwal, Saudi Arabia
Peter Richtarik - KAUST, Thuwal, Saudi Arabia
Contributors - without role: K Chaudhuri
M Sugiyama
Publication Details: 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, Vol.89, pp.304-312
Series: Proceedings of Machine Learning Research
Publisher: Microtome Publishing
Number of pages: 9
Identifiers: 9941144708331
Academic Unit: King Abdullah University of Science & Technology
Language: English
Resource Type: Conference proceeding