arXiv:1709.02540 Abstract | arXiv Analytics

arXiv:1709.02540 [cs.LG]Abstract References Reviews Resources

The Expressive Power of Neural Networks: A View from the Width

Zhou Lu, Hongming Pu, Feicheng Wang, Zhiqiang Hu, Liwei Wang

Published 2017-09-08Version 1

The expressive power of neural networks is important for understanding deep learning. Most existing works consider this problem from the view of the depth of a network. In this paper, we study how width affects the expressiveness of neural networks. Classical results state that \emph{depth-bounded} (e.g. depth-$2$) networks with suitable activation functions are universal approximators. We show a universal approximation theorem for \emph{width-bounded} ReLU networks: width-$(n+4)$ ReLU networks, where $n$ is the input dimension, are universal approximators. Moreover, except for a measure zero set, all functions cannot be approximated by width-$n$ ReLU networks, which exhibits a phase transition. Several recent works demonstrate the benefits of depth by proving the depth-efficiency of neural networks. That is, there are classes of deep networks which cannot be realized by any shallow network whose size is no more than an \emph{exponential} bound. Here we pose the dual question on the width-efficiency of ReLU networks: Are there wide networks that cannot be realized by narrow networks whose size is not substantially larger? We show that there exist classes of wide networks which cannot be realized by any narrow network whose depth is no more than a \emph{polynomial} bound. On the other hand, we demonstrate by extensive experiments that narrow networks whose size exceed the polynomial bound by a constant factor can approximate wide and shallow network with high accuracy. Our results provide more comprehensive evidence that depth is more effective than width for the expressiveness of ReLU networks.

Comments: accepted by NIPS 2017

Categories: cs.LG

Keywords: neural networks, relu networks, expressive power, narrow network, shallow network

Related articles: Most relevant | Search more

arXiv:2401.15121 [cs.LG] (Published 2024-01-26)

Expressive Power of ReLU and Step Networks under Floating-Point Operations

Yeachan Park, Geonho Hwang, Wonyeol Lee, Sejun Park

arXiv:2310.17513 [cs.LG] (Published 2023-10-26)

The Expressive Power of Low-Rank Adaptation

Yuchen Zeng, Kangwook Lee

arXiv:1602.05897 [cs.LG] (Published 2016-02-18)

Toward Deeper Understanding of Neural Networks: The Power of Initialization and a Dual View on Expressivity