arXiv:1707.03905 Abstract | arXiv Analytics

arXiv:1707.03905 [stat.ML]Abstract References Reviews Resources

Influence of Resampling on Accuracy of Imbalanced Classification

Evgeny Burnaev, Pavel Erofeev, Artem Papanov

Published 2017-07-12Version 1

In many real-world binary classification tasks (e.g. detection of certain objects from images), an available dataset is imbalanced, i.e., it has much less representatives of a one class (a minor class), than of another. Generally, accurate prediction of the minor class is crucial but it's hard to achieve since there is not much information about the minor class. One approach to deal with this problem is to preliminarily resample the dataset, i.e., add new elements to the dataset or remove existing ones. Resampling can be done in various ways which raises the problem of choosing the most appropriate one. In this paper we experimentally investigate impact of resampling on classification accuracy, compare resampling methods and highlight key points and difficulties of resampling.

Comments: 5 pages, 2 figures, Eighth International Conference on Machine Vision (December 8, 2015)

Journal: Proc. SPIE9875, 2015

DOI: 10.1117/12.2228523

Categories: stat.ML, cs.LG, stat.AP

Keywords: imbalanced classification, resampling, minor class, real-world binary classification tasks, accurate prediction

Tags: conference paper, journal article

Related articles: Most relevant | Search more

arXiv:2409.05598 [stat.ML] (Published 2024-09-09)

When resampling/reweighting improves feature learning in imbalanced classification?: A toy-model study

Tomoyuki Obuchi, Toshiyuki Tanaka

arXiv:1508.01235 [stat.ML] (Published 2015-08-05)

Empirical Similarity for Absent Data Generation in Imbalanced Classification