标题：Imbalanced Data Classification Algorithm Based on Integrated Sampling and Ensemble Learning
作者：Han, Yan ;He, Mingxiang ;Lu, Qixian
作者机构：[Han, Yan ;He, Mingxiang ] College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao; 266590, China;[Lu, Qix 更多
会议名称：12th International Conference on Genetic and Evolutionary Computing, ICGEC 2018
会议日期：December 14, 2018 - December 17, 2018
来源：Advances in Intelligent Systems and Computing
摘要：In order to alleviate the impact of imbalanced data on support vector machine (SVM), an integrated hybrid sampling imbalanced data classification method is proposed. First, the imbalance rate of imbalanced data is reduced by the ADASYN-NCL (Adaptive Synthetic Sampling Technique—Domain Cleanup Rule Downsampling Method) hybrid sampling method. Then, the AdaBoost algorithm framework is used to give different weight adjustments to the misclassification of minority and majority classes, and selectively integrate several classifiers to obtain better classification. Finally, use the 10 sets of imbalanced data in the KEEL database as test objects, and F-value and G-mean are used as evaluation indicators to verify the performance of the classification algorithm. The experimental results show that the classification algorithm has certain advantages for the classification effect of imbalanced data sets.
© 2019, Springer Nature Singapore Pte Ltd.