arXiv:2305.07102 [cs.CV]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords fine-grained classification, lower input image resolution, sm-vit achieves state-of-the-art performance, challenging computer vision problem, inherent self-attention mechanism Tags journal article Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset