An Ensemble Learning Approach with Gradient Resampling for Class-Imbalance Problems
Published Online:31 Mar 2023https://doi.org/10.1287/ijoc.2023.1274
References
- (2015) To combat multi-class imbalanced problems by means of over-sampling techniques. IEEE Trans. Knowledge Data Engrg. 28(1):238–251.Crossref, Google Scholar
- (2020) A-stacking and a-bagging: Adaptive versions of ensemble learning algorithms for spoof fingerprint detection. Expert Systems Appl. 146:113160.Crossref, Google Scholar
- (2017) Hybrid methods for class imbalance learning employing bagging with sampling techniques. Proc. 2nd Internat. Conf. on Comput. Systems and Inform. Tech. for Sustainable Solution (CSITSS), (IEEE, New York), 1–5.Google Scholar
- (2007) UCI machine learning repository. University of California, Irvine, School of Information and Computer Sciences. http://www.ics.uci.edu/mlearn/MLRepository.html.Google Scholar
- (2021) Resampling imbalanced data for network intrusion detection data sets. J. Big Data 8(1):6.Crossref, Google Scholar
- (2021) Vfc-smote: Very fast continuous synthetic minority oversampling for evolving data streams. Data Mining Knowledge Discovery 35(11):2679–2713.Crossref, Google Scholar
- (2018) An empirical comparison on state-of-the-art multi-class imbalance learning algorithms and a new diversified ensemble learning scheme. Knowledge Base Systems 158:81–93.Crossref, Google Scholar
- (1997) The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition 30(7):1145–1159.Crossref, Google Scholar
- (2016) A survey of predictive modeling on imbalanced domains. ACM Comput. Survey 49(2):1–50.Crossref, Google Scholar
- (1996) Bagging predictors. Machine Learn. 24(2):123–140.Crossref, Google Scholar
- (2018) Imcstacking: Cost-sensitive stacking learning with feature inverse mapping for imbalanced problems. Knowledge Base Systems 150:27–37.Crossref, Google Scholar
- (2020) Predicting customer satisfaction based on online reviews and hybrid ensemble genetic programming algorithms. Engrg. Appl. Artificial Intelligence 95:103902.Crossref, Google Scholar
- (2019) Dealing with difficult minority labels in imbalanced mutilabel data sets. Neurocomput. 326:39–53.Crossref, Google Scholar
- (2011) Smote: Synthetic minority over-sampling technique. J. Artificial Intelligence Res. 16(1):321–357.Google Scholar
- (2022) Generalized one-class learning using pairs of complementary classifiers. IEEE Trans. Pattern Anal. Machine Intelligence 44(10):6993–7009.Crossref, Google Scholar
- (2019) Fire-des++: Enhanced online pruning of base classifiers for dynamic ensemble selection. Pattern Recognition 85:149–160.Crossref, Google Scholar
- (2020) A survey on ensemble learning. Frontiers Comput. Sci. 14(2):241–258.Crossref, Google Scholar
- (2020) Finding the sweet spot: Batch selection for one-class active learning. Demeniconi C, Chawla N, eds. Proc. SIAM Internat. Conf. on Data Mining (SIAM, Philadelphia), 118–126.Google Scholar
- (2018) Cost-sensitive learning. Learning from Imbalanced Data Sets (Springer, Berlin), 63–78.Crossref, Google Scholar
- (2020) Handling imbalanced medical image data: A deep-learning-based one-class classification approach. Artificial Intelligence Medicine 108:101935.Crossref, Google Scholar
- (2012) On the effectiveness of preprocessing methods when dealing with different levels of class imbalance. Knowledge Base Systems 25(1):13–21.Crossref, Google Scholar
- (2017) Predicting and deterring default with social media information in peer-to-peer lending. J. Management Inform. Systems 34(2):401–424.Crossref, Google Scholar
- (2021) Dbig-us: A two-stage under-sampling algorithm to face the class imbalance problem. Expert Systems Appl. 168:114301.Crossref, Google Scholar
- (2017) Combining over-sampling and under-sampling techniques for imbalance data set. Proc. 9th Internat. Conf. on Machine Learn. and Comput. ICMLC 2017 (Association for Computing Machinery, New York), 243–247.Google Scholar
- (2021) Augmenting medical diagnosis decisions? An investigation into physicians! Decision-making process with artificial intelligence. Inform. Systems Res. 32(3):713–735.Link, Google Scholar
- (2018) Mixed bagging: A novel ensemble learning framework for supervised classification based on instance hardness. Proc. IEEE Internat. Conf. Data Mining (ICDM), (IEEE, New York), 1073–1078.Google Scholar
- (2020) Hboost: A heterogeneous ensemble classifier based on the boosting method and entropy measurement. Expert Systems Appl. 157:113482.Crossref, Google Scholar
- (2016) A noise-filtered under-sampling scheme for imbalanced classification. IEEE Trans. Cybernetics 47(12):4263–4274.Crossref, Google Scholar
- (2017) A distance-based weighted undersampling scheme for support vector machines and its application to imbalanced classification. IEEE Trans. Neural Networks Learn. Systems 29(9):4152–4165.Crossref, Google Scholar
- (2020) Toward explaining anomalies: A deep Taylor decomposition of one-class models. Pattern Recognition 101:107198.Crossref, Google Scholar
- (2017) Cost-sensitive learning of deep feature representations from imbalanced data. IEEE Trans. Neural Networks Learn. Systems 29(8):3573–3587.Google Scholar
- (2007) Improving software quality prediction by noise filtering techniques. J. Comput. Sci. Tech. 22(3):387–396.Crossref, Google Scholar
- (2019) Smote-variants: A python implementation of 85 minority oversampling techniques. Neurocomput. 366:352–354.Crossref, Google Scholar
- (2019) Radial-based oversampling for noisy imbalanced data classification. Neurocomput. 343:19–33.Crossref, Google Scholar
- (2018) A survey on addressing high-class imbalance in big data. J. Big Data 5(1):1–30.Crossref, Google Scholar
- (2019) Gradient harmonized single-stage detector. Proc. Conf. AAAI Artificial Intelligence (AAAI Press, Palo Alto, CA), 33:8577–8584.Crossref, Google Scholar
- (2020) Lr-smote—An improved unbalanced data set oversampling based on k-means and svm. Knowledge Base Systems 196:105845.Crossref, Google Scholar
- (2016) Evolutionary cluster-based synthetic oversampling ensemble (eco-ensemble) for imbalance learning. IEEE Trans. Cybernetics 47(9):2850–2861.Crossref, Google Scholar
- (2017a) Focal loss for dense object detection. IEEE Trans. Pattern Analysis Machine Intelligence 99:2999–3007.Google Scholar
- (2017b) Clustering-based undersampling in class-imbalanced data. Inform. Sci. 409:17–26.Crossref, Google Scholar
- (2020) Dealing with class imbalance in classifier chains via random undersampling. Knowledge Base Systems 192:105292.Crossref, Google Scholar
- (2009) Exploratory undersampling for class-imbalance learning. IEEE Trans. Systems Man Cybernetics B Cybernetics 39(2):539–550.Crossref, Google Scholar
- (2020a) Mesa: Boost ensemble imbalanced learning with meta-sampler. Adv. Neural Inform. Processing Systems 33:14463–14474.Google Scholar
- (2020b) Self-paced ensemble for highly imbalanced massive data classification. Proc. IEEE 36th Internat. Conf. on Data Engrg. (ICDE) (IEEE, New York), 841–852.Google Scholar
- (2020) Commercial vehicle activity prediction with imbalanced class distribution using a hybrid sampling and gradient boosting approach. IEEE Trans. Intelligent Transportation Systems 22(3):1401–1410.Crossref, Google Scholar
- (2014) Diversified sensitivity-based undersampling for imbalance classification problems. IEEE Trans. Cybernetics 45(11):2402–2412.Crossref, Google Scholar
- (2020) Imbalance problems in object detection: A review. IEEE Trans. Pattern Anal. Machine Intelligence 43(10):3388–3415.Crossref, Google Scholar
- (2020) Mltl: A multi-label approach for the tomek link undersampling algorithm. Neurocomput. 383:95–105.Crossref, Google Scholar
- (2021) Toward hierarchical classification of imbalanced data using random resampling algorithms. Inform. Sci. 578:344–363.Crossref, Google Scholar
- (2019) Learning deep features for one-class classification. IEEE Trans. Image Processing 28(11):5450–5463.Crossref, Google Scholar
- (2021) Imputation-based ensemble techniques for class imbalance learning. IEEE Trans. Knowledge Data Engrg. 33(5):1988–2001.Google Scholar
- (2020) Improvement of bagging performance for classification of imbalanced data sets using evolutionary multi-objective optimization. Engrg. Appl. Artificial Intelligence 87:103319.Crossref, Google Scholar
- (2018) Handling imbalanced data: A survey. Reddy MS, Viswanath K, KM SP, eds. Internat. Proc. Adv. Soft Comput., Intelligent Systems Appl. (Springer, Berlin), 431–443.Google Scholar
- (2019) Performance comparison of machine learning platforms. INFORMS J. Comput. 31(2):207–225.Link, Google Scholar
- (2015) Smote–ipf: Addressing the noisy and borderline examples problem in imbalanced classification by a re-sampling method with filtering. Inform. Sci. 291:184–203.Crossref, Google Scholar
- (1999) A brief introduction to boosting. Proc. 16th Internat. Joint Conf. Artificial Intelligence, IJCAI’99, vol. 2 (Morgan Kaufmann Publishers Inc., San Francisco), 1401–1406.Google Scholar
- (2009) Rusboost: A hybrid approach to alleviating class imbalance. IEEE Trans. Systems Man Cybernetics A Systems Human 40(1):185–197.Crossref, Google Scholar
- (2015) Binarization with boosting and oversampling for multiclass classification. IEEE Trans. Cybernetics 46(5):1078–1091.Crossref, Google Scholar
- (2021) A neighborhood undersampling stacked ensemble (nus-se) in imbalanced classification. Expert Systems Appl. 168:114246.Crossref, Google Scholar
- (2019) Learning to sample: An active learning framework. Proc. IEEE Internat. Conf. on Data Mining (IEEE, New York), 538–547.Google Scholar
- (2021) Improved parallel magnetic resonance imaging reconstruction with multiple variable density sampling. Sci. Rep. 11(1):1–15.Crossref, Google Scholar
- (2021) Hcbst: An efficient hybrid sampling technique for class imbalance problems. ACM Trans. Knowledge Discovery Data 16(3):1–37.Google Scholar
- (2018) Imbalanced enterprise credit evaluation with dte-sbd: Decision tree ensemble based on smote and bagging with differentiated sampling rates. Inform. Sci. 425:76–91.Crossref, Google Scholar
- (2020) Adaboost-cnn: An adaptive boosting algorithm for convolutional neural networks to classify multi-class imbalanced data sets using transfer learning. Neurocomput. 404:351–366.Crossref, Google Scholar
- (2019) Self-adaptive cost weights-based support vector machine cost-sensitive ensemble for imbalanced data classification. Inform. Sci. 487:31–56.Crossref, Google Scholar
- (2021) A review of methods for imbalanced multi-label classification. Pattern Recognition 118:107965.Crossref, Google Scholar
- (1976) Two modifications of cnn. IEEE Trans. Syst. Man Cybern. 6(11):769–772.Google Scholar
- (2018) An investigation of bankruptcy prediction in imbalanced data sets. Decision Support Systems 112:111–124.Crossref, Google Scholar
- (2022) Gaussian distribution based oversampling for imbalanced data classification. IEEE Trans. Knowledge Data Engrg. 34(2):667–679.Google Scholar
- (2021) A cluster-based oversampling algorithm combining smote and k-means for imbalanced medical data. Inform. Sci. 572:574–589.Crossref, Google Scholar
- (2020) A novel model for imbalanced data classification. Proc. Conf. AAAI Artificial Intelligence, vol. 34 (AAAI Press, Palo Alto, CA), 6680–6687.Crossref, Google Scholar
- (2018) A dbn-based resampling svm ensemble learning paradigm for credit classification with imbalanced data. Appl. Soft Comput. 69:192–202.Crossref, Google Scholar
- (2022) An ensemble learning approach with gradient resampling for class-imbalance problems v2021.0104 URL http://dx.doi.org/10.5281/zenodo.6360996, available for download at https://github.com/INFORMSJoC/2021.0104.Google Scholar
- (2021) Uffdfr: Undersampling framework with denoising, fuzzy c-means clustering, and representative sample selection for imbalanced data classification. Inform. Sci. 576:658–680.Crossref, Google Scholar
- (2021) Efficient utilization of missing data in cost-sensitive learning. IEEE Trans. Knowledge Data Engrg. 33(6):2425–2436.Google Scholar
- (2018) Geometric structural ensemble learning for imbalanced problems. IEEE Trans. Cybernetics 50(4):1617–1629.Crossref, Google Scholar

