arXiv:2407.09413 [cs.CL]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords multimodal question answering, comprises 270k questions, paper image question answering, scientific research articles, first large-scale qa dataset Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset