arXiv Analytics

Sign in

arXiv:1906.11882 [cs.LG]AbstractReferencesReviewsResources

From Data Quality to Model Quality: an Exploratory Study on Deep Learning

Tianxing He, Shengcheng Yu, Ziyuan Wang, Jieqiong Li, Zhenyu Chen

Published 2019-06-10Version 1

Nowadays, people strive to improve the accuracy of deep learning models. However, very little work has focused on the quality of data sets. In fact, data quality determines model quality. Therefore, it is important for us to make research on how data quality affects on model quality. In this paper, we mainly consider four aspects of data quality, including Dataset Equilibrium, Dataset Size, Quality of Label, Dataset Contamination. We deign experiment on MNIST and Cifar-10 and try to find out the influence the four aspects make on model quality. Experimental results show that four aspects all have decisive impact on the quality of models. It means that decrease in data quality in these aspects will reduce the accuracy of model.

Related articles: Most relevant | Search more
arXiv:2011.09789 [cs.LG] (Published 2020-11-19)
An Experimental Study of Semantic Continuity for Deep Learning Models
arXiv:2108.03579 [cs.LG] (Published 2021-08-08)
Expressive Power and Loss Surfaces of Deep Learning Models
arXiv:1510.04781 [cs.LG] (Published 2015-10-16)
A Survey: Time Travel in Deep Learning Space: An Introduction to Deep Learning Models and How Deep Learning Models Evolved from the Initial Ideas