Base Model Combination Algorithm for Resolving Tied Predictions for K-Nearest Neighbor OVA Ensemble Models
Published Online:14 Aug 2012https://doi.org/10.1287/ijoc.1120.0518
References
- . Error reduction through learning multiple descriptions. Machine Learn. (1996) 24(3):173–202Crossref, Google Scholar
- . The UCI KDD archive of large data sets for data mining research and experimentation. ACM SIGKDD (2000) 2(2):81–85Crossref, Google Scholar
- . Mastering Data Mining: The Art and Science of Customer Relationship Management (2000) (John Wiley & Sons, New York) Google Scholar
- . Neural Network for Pattern Recognition (1995) (Clarendon Press, Oxford, UK) Crossref, Google Scholar
- . Bagging predictors. Machine Learn. (1996) 24(1):123–140Crossref, Google Scholar
- , Haussler D. A training algorithm for optimal margin classifiers. Proc. 5th Annual ACM Workshop on Computational Learning Theory (1992) (ACM, New York) 144–152Crossref, Google Scholar
- . Empirical Methods for Artificial Intelligence (1995) (MIT Press, Cambridge, MA) Google Scholar
- . Nearest neighbor pattern classification. IEEE Trans. Inform. Theory (1967) IT-13(1):21–27Crossref, Google Scholar
- . Solving multiclass learning problems via error-correcting output codes. J. Artificial Intelligence Res. (1995) 2:263–286Crossref, Google Scholar
- . Machine learning bias, statistical bias, and statistical variance of decision tree algorithms. (1995) . Technical report, Department of Computer Science, Oregon State University, CorvallisGoogle Scholar
- . A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. System Sci. (1997) 55(1):119–139Crossref, Google Scholar
- . An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study of one-vs-one and one-vs-all schemes. Pattern Recognition (2011) 44(8):1761–1776Crossref, Google Scholar
- . Applied Data Mining: Statistical Methods for Business and Industry (2003) (John Wiley & Sons, Chichester) Google Scholar
- . Construction and Assessment of Classification Rules (1997) (John Wiley & Sons, Chichester) Google Scholar
- . Principles of Data Mining (2001) (MIT Press, Cambridge, MA) Google Scholar
- . Neural network ensembles. IEEE Trans. Pattern Anal. Machine Intelligence (1990) 12(10):993–1001Crossref, Google Scholar
- . The UCI KDD archive. (1999) . Department of Information and Computer Science. University of California, Irvine, http://kdd.ics.uci.eduGoogle Scholar
- . The random subspace method for constructing decision forests. IEEE Trans. Pattern Anal. Machine Intelligence (1998) 20(8):832–844Crossref, Google Scholar
- . Combining classifiers: A theoretical framework. Pattern Anal. Appl. (1998) 1(1):18–27Crossref, Google Scholar
- , Tesauro G, Touretzky DS, Leen TK. Neural network ensembles, cross validation and active learning. Advances in Neural Information Processing Systems (1995) (MIT Press, Cambridge, MA) 231–238Google Scholar
- . Combining Pattern Classifiers: Methods and Algorithms (2004) (John Wiley & sons, Hoboken, NJ) Crossref, Google Scholar
- , Shachter RD, Levitt TS, Kanal LN, Lemmer JF. Multiple decision trees. Proc. Fourth Annual Conf. Uncertainty in Artificial Intelligence (1990) (North-Holland, Amsterdam) 327–338Crossref, Google Scholar
- . Learning intrusion detection: Supervised or unsupervised? ICAP: International Conf. Image Anal. Processing, Cagliari, Italy (2005) Crossref, Google Scholar
- . Data set selection for aggregate model implementation in predictive data mining. (2010) . Ph.D. thesis, Department of Computer Science, University of Pretoria, South Africa, http://upetd.up.ac.za/thesis/available/etd-11152010–203041/Google Scholar
- . Using confusion matrices and confusion graphs to design ensemble classification models from large data sets. Thirteenth Internat. Conf. Data Warehousing and Knowledge Discovery, Dawak 2011 Toulouse, France (2011) Google Scholar
- . A decision rule-based method for feature selection in predictive data mining. Expert Systems Appl. (2010) 37(1):602–609Crossref, Google Scholar
- . Using OVA modeling to improve classification performance for large data sets. Expert Systems Appl. (2012) 39(4):4358–4376Crossref, Google Scholar
- . Machine Learning (1997) (WCB/McGraw-Hill, Burr Ridge, IL) Google Scholar
- . Random sampling from databases. (1993) . Ph.D. thesis, Department of Computer Science, University of California at BerkeleyGoogle Scholar
- . Differential prioritization in feature selection and classifier aggregation for multiclass microarray data sets. Data Mining Knowledge Discovery (2007) 14:329–366Crossref, Google Scholar
- . Selecting predictive models for inclusion in an ensemble. The 18th Triennial Conf. Internat. Federation Oper. Res. Societies, Sandton, Johannesburg (2008) Google Scholar
- . Sampling Methodologies with Applications (2000) (Chapman & Hall/CRC, Boca Raton, FL) Crossref, Google Scholar
- . In defense of one-vs-all classification. J. Machine Learn. Res. (2004) 5:101–141Google Scholar
- , Denison DD, Hansen MH, Holmes CC, Mallick B, Yu B. The boosting approach to machine learning: An overview. Nonlinear Estimation and Classification. (2003) 171(Springer, New York) 149–172Lecture Notes in StatisticsCrossref, Google Scholar
- . Using attack-specific feature subsets for network intrusion detection. Proc. 19th Australian Conf. Artificial Intelligence (2006) (Hobart, Australia) Crossref, Google Scholar
- Top 10 algorithms in data mining. Knowledge Inform. Systems (2007) 14(1):1–37Crossref, Google Scholar
- . Ensemble pruning via semi-definite programming. J. Machine Learn. Res. (2006) 7:1315–1338Google Scholar
- . Ensembling neural networks many could be better than all. Artificial Intelligence (2002) 137:239–263Crossref, Google Scholar

