arXiv:1506.06472 Abstract | arXiv Analytics

arXiv:1506.06472 [cs.LG]Abstract References Reviews Resources

The Ebb and Flow of Deep Learning: a Theory of Local Learning

Published 2015-06-22Version 1

In a physical neural system, where storage and processing are intimately intertwined, the rules for adjusting the synaptic weights can only depend on variables that are available locally, such as the activity of the pre- and post-synaptic neurons, resulting in local learning rules. A systematic framework for studying the space of local learning rules must first define the nature of the local variables, and then the functional form that ties them together into each learning rule. We consider polynomial local learning rules and analyze their behavior and capabilities in both linear and non-linear networks. As a byproduct, this framework enables also the discovery of new learning rules as well as important relationships between learning rules and group symmetries. Stacking local learning rules in deep feedforward networks leads to deep local learning. While deep local learning can learn interesting representations, it cannot learn complex input-output functions, even when targets are available for the top layer. Learning complex input-output functions requires local deep learning where target information is propagated to the deep layers through a backward channel. The nature of the propagated information about the targets, and the backward channel through which this information is propagated, partition the space of learning algorithms. For any learning algorithm, the capacity of the backward channel can be defined as the number of bits provided about the gradient per weight, divided by the number of required operations per weight. We estimate the capacity associated with several learning algorithms and show that backpropagation outperforms them and achieves the maximum possible capacity. The theory clarifies the concept of Hebbian learning, what is learnable by Hebbian learning, and explains the sparsity of the space of learning rules discovered so far.

Categories: cs.LG

Keywords: deep learning, backward channel, learning algorithm, deep local learning, learn complex input-output functions

Related articles: Most relevant | Search more

arXiv:1404.1559 [cs.LG] (Published 2014-04-06)

Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation

R. Vidya, Dr. G. M. Nasira, R. P. Jaia Priyankka

arXiv:1712.00563 [cs.LG] (Published 2017-12-02)

Anesthesiologist-level forecasting of hypoxemia with only SpO2 data using deep learning

Gabriel Erion, Hugh Chen, Scott M. Lundberg, Su-In Lee

arXiv:1506.00619 [cs.LG] (Published 2015-06-01)

Blocks and Fuel: Frameworks for deep learning

Bart van Merriënboer, Dzmitry Bahdanau, Vincent Dumoulin, Dmitriy Serdyuk, David Warde-Farley, Jan Chorowski, Yoshua Bengio