arXiv:2404.11207 [cs.CV]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords multimodal large language models, visual prompting, downstream task, transferability, contain richer task-specific semantics Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset