arXiv:2312.06968 [cs.CV]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords multimodal large language model, hallucination augmented contrastive learning, indicating unsatisfactory cross-modal representation alignment Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset