Mean Field Analysis of Deep Neural Networks

Justin Sirignano
Corresponding Author
Justin Sirignano
[email protected]
https://orcid.org/0000-0002-0971-1349
Mathematical Institute, University of Oxford, Oxford OX2 6GG, United Kingdom
Search for more papers by this author
,
Konstantinos Spiliopoulos
Konstantinos Spiliopoulos
[email protected]
Department of Mathematics and Statistics, Boston University, Boston, Massachusetts 02215
Search for more papers by this author

Justin Sirignano

Corresponding Author

Justin Sirignano

[email protected]

https://orcid.org/0000-0002-0971-1349

Mathematical Institute, University of Oxford, Oxford OX2 6GG, United Kingdom

Search for more papers by this author

Konstantinos Spiliopoulos

[email protected]

Department of Mathematics and Statistics, Boston University, Boston, Massachusetts 02215

Search for more papers by this author

Published Online:21 Apr 2021https://doi.org/10.1287/moor.2020.1118

Abstract

We analyze multilayer neural networks in the asymptotic regime of simultaneously (a) large network sizes and (b) large numbers of stochastic gradient descent training iterations. We rigorously establish the limiting behavior of the multilayer neural network output. The limit procedure is valid for any number of hidden layers, and it naturally also describes the limiting behavior of the training loss. The ideas that we explore are to (a) take the limits of each hidden layer sequentially and (b) characterize the evolution of parameters in terms of their initialization. The limit satisfies a system of deterministic integro-differential equations. The proof uses methods from weak convergence and stochastic analysis. We show that, under suitable assumptions on the activation functions and the behavior for large times, the limit neural network recovers a global minimum (with zero loss for the objective function).

cover image Mathematics of Operations Research

Volume 47, Issue 1

February 2022

Pages 1-846, C2

Article Information

Metrics

Information

Received:June 10, 2019
Accepted:September 24, 2020
Published Online:April 21, 2021

Cite as

Justin Sirignano, Konstantinos Spiliopoulos (2021) Mean Field Analysis of Deep Neural Networks. Mathematics of Operations Research 47(1):120-152.

https://doi.org/10.1287/moor.2020.1118

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Mean Field Analysis of Deep Neural Networks

Abstract

Volume 47, Issue 1

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News