arXiv:1805.04933 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords stochastic optimization, better overall convergence rate, individual neural network layer, momentum gradient descent optimization, adaptive stepsize Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset