arXiv:2307.07134 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords machine learning algorithm, multi-dimensional ability diagnosis, multi-dimensional diagnostic metric ability, task-agnostic evaluation framework camilla, outperforms state-of-the-art baselines Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset