arXiv:2402.12750 [cs.CV]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords multimodal large language models, model composition, understand inputs, address parameter interference, paired multimodal instruction data Tags conference paper, github project Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset