arXiv:2105.14077 [cs.CV]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords inductive biases, isotropic networks, huge transformer models, quadratic self-attention activations, natural language Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset