arXiv Analytics

Sign in

arXiv:1908.06487 [cs.LG]AbstractReferencesReviewsResources

Neural Network Based Undersampling Techniques

Md. Adnan Arefeen, Sumaiya Tabassum Nimi, M Sohel Rahman

Published 2019-08-18Version 1

Class imbalance problem is commonly faced while developing machine learning models for real-life issues. Due to this problem, the fitted model tends to be biased towards the majority class data, which leads to lower precision, recall, AUC, F1, G-mean score. Several researches have been done to tackle this problem, most of which employed resampling, i.e. oversampling and undersampling techniques to bring the required balance in the data. In this paper, we propose neural network based algorithms for undersampling. Then we resampled several class imbalanced data using our algorithms and also some other popular resampling techniques. Afterwards we classified these undersampled data using some common classifier. We found out that our resampling approaches outperform most other resampling techniques in terms of both AUC, F1 and G-mean score.

Comments: 8 pages in IEEE format
Categories: cs.LG, stat.ML
Related articles: Most relevant | Search more
arXiv:1811.09054 [cs.LG] (Published 2018-11-22)
Enhanced Expressive Power and Fast Training of Neural Networks by Random Projections
arXiv:1812.03699 [cs.LG] (Published 2018-12-10)
Taxi Demand-Supply Forecasting: Impact of Spatial Partitioning on the Performance of Neural Networks
arXiv:1804.07669 [cs.LG] (Published 2018-04-20)
Modelling customer online behaviours with neural networks: applications to conversion prediction and advertising retargeting