Stochastic Gradient

Stephen J. Wright; Benjamin Recht

doi:10.1017/9781009004282.006

5 - Stochastic Gradient

Published online by Cambridge University Press: 31 March 2022

Stephen J. Wright and

Benjamin Recht

Show author details

Stephen J. Wright: Affiliation:
University of Wisconsin, Madison
Benjamin Recht: Affiliation:
University of California, Berkeley

Book contents

Get access

Summary

We describe the stochastic gradient method, the fundamental algorithm for several important problems in data science, including deep learning. We give several example problems for which this method is suitable, then described its operation for the simple problem of computing a mean of a collection of values. We related it to a classical method, the Kaczmarz method for solving a system of linear equalities and inequalities. Next, we describe the key assumptions to be used in convergence analysis, then describe the convergence rates attainable by several variants of stochastic gradient under several scenarios. Finally, we discuss several aspects of practical implementation of stochastic gradient, including minibatching and acceleration.

Keywords

Stochastic Gradient Methods Stochastic Gradient Descent Stochastic Approximation Kaczmarz Method

Type: Chapter
Information: Optimization for Data Analysis , pp. 75 - 99

DOI: https://doi.org/10.1017/9781009004282.006 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2022

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

5 - Stochastic Gradient

Summary

Keywords

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive