arXiv Analytics

Sign in

arXiv:1509.08985 [stat.ML]AbstractReferencesReviewsResources

Generalizing Pooling Functions in Convolutional Neural Networks: Mixed, Gated, and Tree

Chen-Yu Lee, Patrick W. Gallagher, Zhuowen Tu

Published 2015-09-30Version 1

We seek to improve deep neural networks by generalizing the pooling operations that play a central role in current architectures. We pursue a careful exploration of approaches to allow pooling to learn and to adapt to complex and variable patterns. The two primary directions lie in (1) learning a pooling function via (two strategies of) combining of max and average pooling, and (2) learning a pooling function in the form of a tree-structured fusion of pooling filters that are themselves learned. In our experiments every generalized pooling operation we explore improves performance when used in place of average or max pooling. We experimentally demonstrate that the proposed pooling operations provide a boost in invariance properties relative to conventional pooling and set the state of the art on several widely adopted benchmark datasets; they are also easy to implement, and can be applied within various deep neural network architectures. These benefits come with only a light increase in computational overhead during training (ranging from $5\%$ to $15\%$ in timing experiments) and a very modest increase in the number of model parameters. For example, using 45 additional parameters, we improve AlexNet performance on ImageNet by $6\%$ relative (top-5, single-view).

Comments: Patent disclosure, UCSD Docket No. SD2015-184, "Forest Convolutional Neural Network", filed on March 4, 2015. UCSD Docket No. SD2016-053, "Generalizing Pooling Functions in Convolutional Neural Network", filed on Sept 23, 2015
Categories: stat.ML, cs.LG, cs.NE
Related articles: Most relevant | Search more
arXiv:2010.03733 [stat.ML] (Published 2020-10-08)
Neural Group Actions
arXiv:1801.03523 [stat.ML] (Published 2018-01-09)
Generative Models for Stochastic Processes Using Convolutional Neural Networks
arXiv:2301.11732 [stat.ML] (Published 2023-01-27)
Convolutional neural networks for valid and efficient causal inference