arXiv:2305.05448 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords weight normalization, robust implicit regularization, diagonal linear network models, gradient descent tend, neural network models Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset