arXiv:1606.05340 Abstract | arXiv Analytics

arXiv:1606.05340 [stat.ML]Abstract References Reviews Resources

Exponential expressivity in deep neural networks through transient chaos

Ben Poole, Subhaneil Lahiri, Maithra Raghu, Jascha Sohl-Dickstein, Surya Ganguli

Published 2016-06-16Version 1

We combine Riemannian geometry with the mean field theory of high dimensional chaos to study the nature of signal propagation in generic, deep neural networks with random weights. Our results reveal an order-to-chaos expressivity phase transition, with networks in the chaotic phase computing nonlinear functions whose global curvature grows exponentially with depth but not width. We prove this generic class of deep random functions cannot be efficiently computed by any shallow network, going beyond prior work restricted to the analysis of single functions. Moreover, we formalize and quantitatively demonstrate the long conjectured idea that deep networks can disentangle highly curved manifolds in input space into flat manifolds in hidden space. Our theoretical analysis of the expressive power of deep networks broadly applies to arbitrary nonlinearities, and provides a quantitative underpinning for previously abstract notions about the geometry of deep functions.

Categories: stat.ML, cs.LG

Keywords: deep neural networks, transient chaos, exponential expressivity, chaotic phase computing nonlinear functions, order-to-chaos expressivity phase transition

Related articles: Most relevant | Search more

arXiv:1805.08266 [stat.ML] (Published 2018-05-21)

On the Selection of Initialization and Activation Function for Deep Neural Networks

Soufiane Hayou, Arnaud Doucet, Judith Rousseau

arXiv:1402.1869 [stat.ML] (Published 2014-02-08, updated 2014-06-07)

On the Number of Linear Regions of Deep Neural Networks

Guido Montúfar, Razvan Pascanu, Kyunghyun Cho, Yoshua Bengio

arXiv:1712.09482 [stat.ML] (Published 2017-12-27)

Robust Loss Functions under Label Noise for Deep Neural Networks

Aritra Ghosh, Himanshu Kumar, P. S. Sastry