arXiv:2402.05935 [cs.CV]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords multi-modal large language models, scaling data, extensive multimodality large language model, removing redundant visual encoders, one-stage all-in-one paradigm Tags github project Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset