arXiv:1901.09054 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords small datasets, popular datasets confirm, pre-training, cosine loss function, contemporary deep learning discourse Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset