arXiv Analytics

Sign in

arXiv:2202.04777 [stat.ML]AbstractReferencesReviewsResources

Exact Solutions of a Deep Linear Network

Liu Ziyin, Botao Li, Xiangming Meng

Published 2022-02-10Version 1

This work finds the exact solutions to a deep linear network with weight decay and stochastic neurons, a fundamental model for understanding the landscape of neural networks. Our result implies that weight decay strongly interacts with the model architecture and can create bad minima in a network with more than $1$ hidden layer, qualitatively different for a network with only $1$ hidden layer. As an application, we also analyze stochastic nets and show that their prediction variance vanishes to zero as the stochasticity, the width, or the depth tends to infinity.

Related articles: Most relevant | Search more
arXiv:1301.5088 [stat.ML] (Published 2013-01-22)
Piecewise Linear Multilayer Perceptrons and Dropout
arXiv:1606.01487 [stat.ML] (Published 2016-06-05)
Bounds for Vector-Valued Function Estimation
arXiv:1609.01596 [stat.ML] (Published 2016-09-06)
Direct Feedback Alignment Provides Learning in Deep Neural Networks