arXiv:2308.08934 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords pre-training, data imbalance, input data, machine learning models, theoretical calculation Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset