Base Model Combination Algorithm for Resolving Tied Predictions for K-Nearest Neighbor OVA Ensemble Models

Published Online:https://doi.org/10.1287/ijoc.1120.0518

References

  • Ali KM, Pazzani J. Error reduction through learning multiple descriptions. Machine Learn. (1996) 24(3):173–202CrossrefGoogle Scholar
  • Bay SD, Kibler D, Pazzani MJ, Smyth P. The UCI KDD archive of large data sets for data mining research and experimentation. ACM SIGKDD (2000) 2(2):81–85CrossrefGoogle Scholar
  • Berry MJA, Linoff GS. Mastering Data Mining: The Art and Science of Customer Relationship Management (2000) (John Wiley & Sons, New York) Google Scholar
  • Bishop CM. Neural Network for Pattern Recognition (1995) (Clarendon Press, Oxford, UK) CrossrefGoogle Scholar
  • Breiman L. Bagging predictors. Machine Learn. (1996) 24(1):123–140CrossrefGoogle Scholar
  • Boser BE, Guyon IM, Vapnik VN, Haussler D. A training algorithm for optimal margin classifiers. Proc. 5th Annual ACM Workshop on Computational Learning Theory (1992) (ACM, New York) 144–152CrossrefGoogle Scholar
  • Cohen P. Empirical Methods for Artificial Intelligence (1995) (MIT Press, Cambridge, MA) Google Scholar
  • Cover TM, Hart PE. Nearest neighbor pattern classification. IEEE Trans. Inform. Theory (1967) IT-13(1):21–27CrossrefGoogle Scholar
  • Dietterich T, Bakiri G. Solving multiclass learning problems via error-correcting output codes. J. Artificial Intelligence Res. (1995) 2:263–286CrossrefGoogle Scholar
  • Dietterich T, Kong E. Machine learning bias, statistical bias, and statistical variance of decision tree algorithms. (1995) . Technical report, Department of Computer Science, Oregon State University, CorvallisGoogle Scholar
  • Freund Y, Schapire R. A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. System Sci. (1997) 55(1):119–139CrossrefGoogle Scholar
  • Galar M, Fernández Z, Barenenchea E, Bustince H, Herrara F. An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study of one-vs-one and one-vs-all schemes. Pattern Recognition (2011) 44(8):1761–1776CrossrefGoogle Scholar
  • Giudici P. Applied Data Mining: Statistical Methods for Business and Industry (2003) (John Wiley & Sons, Chichester) Google Scholar
  • Hand DJ. Construction and Assessment of Classification Rules (1997) (John Wiley & Sons, Chichester) Google Scholar
  • Hand DJ, Manila H, Smyth P. Principles of Data Mining (2001) (MIT Press, Cambridge, MA) Google Scholar
  • Hansen LK, Salamon P. Neural network ensembles. IEEE Trans. Pattern Anal. Machine Intelligence (1990) 12(10):993–1001CrossrefGoogle Scholar
  • Hettich S, Bay SD. The UCI KDD archive. (1999) . Department of Information and Computer Science. University of California, Irvine, http://kdd.ics.uci.eduGoogle Scholar
  • Ho T. The random subspace method for constructing decision forests. IEEE Trans. Pattern Anal. Machine Intelligence (1998) 20(8):832–844CrossrefGoogle Scholar
  • Kittler J. Combining classifiers: A theoretical framework. Pattern Anal. Appl. (1998) 1(1):18–27CrossrefGoogle Scholar
  • Krogh A, Vedelsby J, Tesauro G, Touretzky DS, Leen TK. Neural network ensembles, cross validation and active learning. Advances in Neural Information Processing Systems (1995) (MIT Press, Cambridge, MA) 231–238Google Scholar
  • Kuncheva LI. Combining Pattern Classifiers: Methods and Algorithms (2004) (John Wiley & sons, Hoboken, NJ) CrossrefGoogle Scholar
  • Kwok SW, Carter C, Shachter RD, Levitt TS, Kanal LN, Lemmer JF. Multiple decision trees. Proc. Fourth Annual Conf. Uncertainty in Artificial Intelligence (1990) (North-Holland, Amsterdam) 327–338CrossrefGoogle Scholar
  • Laskov P, Düssel P, Schäfer C, Rieck K. Learning intrusion detection: Supervised or unsupervised? ICAP: International Conf. Image Anal. Processing, Cagliari, Italy (2005) CrossrefGoogle Scholar
  • Lutu PEN. Data set selection for aggregate model implementation in predictive data mining. (2010) . Ph.D. thesis, Department of Computer Science, University of Pretoria, South Africa, http://upetd.up.ac.za/thesis/available/etd-11152010–203041/Google Scholar
  • Lutu PEN. Using confusion matrices and confusion graphs to design ensemble classification models from large data sets. Thirteenth Internat. Conf. Data Warehousing and Knowledge Discovery, Dawak 2011 Toulouse, France (2011) Google Scholar
  • Lutu PEN, Engelbrecht AP. A decision rule-based method for feature selection in predictive data mining. Expert Systems Appl. (2010) 37(1):602–609CrossrefGoogle Scholar
  • Lutu PEN, Engelbrecht AP. Using OVA modeling to improve classification performance for large data sets. Expert Systems Appl. (2012) 39(4):4358–4376CrossrefGoogle Scholar
  • Mitchell TM. Machine Learning (1997) (WCB/McGraw-Hill, Burr Ridge, IL) Google Scholar
  • Olken F. Random sampling from databases. (1993) . Ph.D. thesis, Department of Computer Science, University of California at BerkeleyGoogle Scholar
  • Ooi CH, Chetty M, Teng SW. Differential prioritization in feature selection and classifier aggregation for multiclass microarray data sets. Data Mining Knowledge Discovery (2007) 14:329–366CrossrefGoogle Scholar
  • Osei-Bryson K-M, Kah MO, Kah JML. Selecting predictive models for inclusion in an ensemble. The 18th Triennial Conf. Internat. Federation Oper. Res. Societies, Sandton, Johannesburg (2008) Google Scholar
  • Rao PSRS. Sampling Methodologies with Applications (2000) (Chapman & Hall/CRC, Boca Raton, FL) CrossrefGoogle Scholar
  • Rifkin R, Klautau A. In defense of one-vs-all classification. J. Machine Learn. Res. (2004) 5:101–141Google Scholar
  • Schapire R, Denison DD, Hansen MH, Holmes CC, Mallick B, Yu B. The boosting approach to machine learning: An overview. Nonlinear Estimation and Classification. (2003) 171(Springer, New York) 149–172Lecture Notes in StatisticsCrossrefGoogle Scholar
  • Shin SW, Lee CH. Using attack-specific feature subsets for network intrusion detection. Proc. 19th Australian Conf. Artificial Intelligence (2006) (Hobart, Australia) CrossrefGoogle Scholar
  • Wu X, Kumar V, Quinlan JR, Ghosh J, Yang Q, Motoda H, Mclachlan GJ, et al. Top 10 algorithms in data mining. Knowledge Inform. Systems (2007) 14(1):1–37CrossrefGoogle Scholar
  • Zhang Y, Burer S, Street WN. Ensemble pruning via semi-definite programming. J. Machine Learn. Res. (2006) 7:1315–1338Google Scholar
  • Zhou Z-H, Wu J, Tang W. Ensembling neural networks many could be better than all. Artificial Intelligence (2002) 137:239–263CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.