arXiv:2410.22217 [cs.CV]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords generation, unifying understanding, autoregression perspective, large language models, summarize autoregressive vision foundation models Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset