arXiv Analytics

Sign in

arXiv:2006.13319 [cs.LG]AbstractReferencesReviewsResources

Classification Performance Metric for Imbalance Data Based on Recall and Selectivity Normalized in Class Labels

Robert Burduk

Published 2020-06-23Version 1

In the classification of a class imbalance dataset, the performance measure used for the model selection and comparison to competing methods is a major issue. In order to overcome this problem several performance measures are defined and analyzed in several perspectives regarding in particular the imbalance ratio. There is still no clear indication which metric is universal and can be used for any skewed data problem. In this paper we introduced a new performance measure based on the harmonic mean of Recall and Selectivity normalized in class labels. This paper shows that the proposed performance measure has the right properties for the imbalanced dataset. In particular, in the space defined by the majority class examples and imbalance ratio it is less sensitive to changes in the majority class and more sensitive to changes in the minority class compared with other existing single-value performance measures. Additionally, the identity of the other performance measures has been proven analytically.

Related articles: Most relevant | Search more
arXiv:2201.11653 [cs.LG] (Published 2022-01-25)
Representation learnt by SGD and Adaptive learning rules -- Conditions that Vary Sparsity and Selectivity in Neural Network
arXiv:1301.0565 [cs.LG] (Published 2012-12-12)
An Information-Theoretic External Cluster-Validity Measure
arXiv:2006.07589 [cs.LG] (Published 2020-06-13)
Adversarial Self-Supervised Contrastive Learning