arXiv:2505.07070 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords scaling laws, simple hierarchical languages, representation learning, convolutional architectures, transformer Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset