Base Model Combination Algorithm for Resolving Tied Predictions for K-Nearest Neighbor OVA Ensemble Models

Patricia E. N. Lutu
Patricia E. N. Lutu
[email protected]
Department of Computer Science, University of Pretoria, Pretoria 0002, South Africa
Search for more papers by this author
,
Andries P. Engelbrecht
Andries P. Engelbrecht
[email protected]
Department of Computer Science, University of Pretoria, Pretoria 0002, South Africa
Search for more papers by this author

Patricia E. N. Lutu

[email protected]

Department of Computer Science, University of Pretoria, Pretoria 0002, South Africa

Search for more papers by this author

Andries P. Engelbrecht

[email protected]

Department of Computer Science, University of Pretoria, Pretoria 0002, South Africa

Search for more papers by this author

Published Online:14 Aug 2012https://doi.org/10.1287/ijoc.1120.0518

References

Ali KM, Pazzani J. Error reduction through learning multiple descriptions. Machine Learn. (1996) 24(3):173–202Crossref, Google Scholar
Bay SD, Kibler D, Pazzani MJ, Smyth P. The UCI KDD archive of large data sets for data mining research and experimentation. ACM SIGKDD (2000) 2(2):81–85Crossref, Google Scholar
Berry MJA, Linoff GS. Mastering Data Mining: The Art and Science of Customer Relationship Management (2000) (John Wiley & Sons, New York) Google Scholar
Bishop CM. Neural Network for Pattern Recognition (1995) (Clarendon Press, Oxford, UK) Crossref, Google Scholar
Breiman L. Bagging predictors. Machine Learn. (1996) 24(1):123–140Crossref, Google Scholar
Boser BE, Guyon IM, Vapnik VN, Haussler D. A training algorithm for optimal margin classifiers. Proc. 5th Annual ACM Workshop on Computational Learning Theory (1992) (ACM, New York) 144–152Crossref, Google Scholar
Cohen P. Empirical Methods for Artificial Intelligence (1995) (MIT Press, Cambridge, MA) Google Scholar
Cover TM, Hart PE. Nearest neighbor pattern classification. IEEE Trans. Inform. Theory (1967) IT-13(1):21–27Crossref, Google Scholar
Dietterich T, Bakiri G. Solving multiclass learning problems via error-correcting output codes. J. Artificial Intelligence Res. (1995) 2:263–286Crossref, Google Scholar
Dietterich T, Kong E. Machine learning bias, statistical bias, and statistical variance of decision tree algorithms. (1995) . Technical report, Department of Computer Science, Oregon State University, CorvallisGoogle Scholar
Freund Y, Schapire R. A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. System Sci. (1997) 55(1):119–139Crossref, Google Scholar
Galar M, Fernández Z, Barenenchea E, Bustince H, Herrara F. An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study of one-vs-one and one-vs-all schemes. Pattern Recognition (2011) 44(8):1761–1776Crossref, Google Scholar
Giudici P. Applied Data Mining: Statistical Methods for Business and Industry (2003) (John Wiley & Sons, Chichester) Google Scholar
Hand DJ. Construction and Assessment of Classification Rules (1997) (John Wiley & Sons, Chichester) Google Scholar
Hand DJ, Manila H, Smyth P. Principles of Data Mining (2001) (MIT Press, Cambridge, MA) Google Scholar
Hansen LK, Salamon P. Neural network ensembles. IEEE Trans. Pattern Anal. Machine Intelligence (1990) 12(10):993–1001Crossref, Google Scholar
Hettich S, Bay SD. The UCI KDD archive. (1999) . Department of Information and Computer Science. University of California, Irvine, http://kdd.ics.uci.eduGoogle Scholar
Ho T. The random subspace method for constructing decision forests. IEEE Trans. Pattern Anal. Machine Intelligence (1998) 20(8):832–844Crossref, Google Scholar
Kittler J. Combining classifiers: A theoretical framework. Pattern Anal. Appl. (1998) 1(1):18–27Crossref, Google Scholar
Krogh A, Vedelsby J, Tesauro G, Touretzky DS, Leen TK. Neural network ensembles, cross validation and active learning. Advances in Neural Information Processing Systems (1995) (MIT Press, Cambridge, MA) 231–238Google Scholar
Kuncheva LI. Combining Pattern Classifiers: Methods and Algorithms (2004) (John Wiley & sons, Hoboken, NJ) Crossref, Google Scholar
Kwok SW, Carter C, Shachter RD, Levitt TS, Kanal LN, Lemmer JF. Multiple decision trees. Proc. Fourth Annual Conf. Uncertainty in Artificial Intelligence (1990) (North-Holland, Amsterdam) 327–338Crossref, Google Scholar
Laskov P, Düssel P, Schäfer C, Rieck K. Learning intrusion detection: Supervised or unsupervised? ICAP: International Conf. Image Anal. Processing, Cagliari, Italy (2005) Crossref, Google Scholar
Lutu PEN. Data set selection for aggregate model implementation in predictive data mining. (2010) . Ph.D. thesis, Department of Computer Science, University of Pretoria, South Africa, http://upetd.up.ac.za/thesis/available/etd-11152010–203041/Google Scholar
Lutu PEN. Using confusion matrices and confusion graphs to design ensemble classification models from large data sets. Thirteenth Internat. Conf. Data Warehousing and Knowledge Discovery, Dawak 2011 Toulouse, France (2011) Google Scholar
Lutu PEN, Engelbrecht AP. A decision rule-based method for feature selection in predictive data mining. Expert Systems Appl. (2010) 37(1):602–609Crossref, Google Scholar
Lutu PEN, Engelbrecht AP. Using OVA modeling to improve classification performance for large data sets. Expert Systems Appl. (2012) 39(4):4358–4376Crossref, Google Scholar
Mitchell TM. Machine Learning (1997) (WCB/McGraw-Hill, Burr Ridge, IL) Google Scholar
Olken F. Random sampling from databases. (1993) . Ph.D. thesis, Department of Computer Science, University of California at BerkeleyGoogle Scholar
Ooi CH, Chetty M, Teng SW. Differential prioritization in feature selection and classifier aggregation for multiclass microarray data sets. Data Mining Knowledge Discovery (2007) 14:329–366Crossref, Google Scholar
Osei-Bryson K-M, Kah MO, Kah JML. Selecting predictive models for inclusion in an ensemble. The 18th Triennial Conf. Internat. Federation Oper. Res. Societies, Sandton, Johannesburg (2008) Google Scholar
Rao PSRS. Sampling Methodologies with Applications (2000) (Chapman & Hall/CRC, Boca Raton, FL) Crossref, Google Scholar
Rifkin R, Klautau A. In defense of one-vs-all classification. J. Machine Learn. Res. (2004) 5:101–141Google Scholar
Schapire R, Denison DD, Hansen MH, Holmes CC, Mallick B, Yu B. The boosting approach to machine learning: An overview. Nonlinear Estimation and Classification. (2003) 171(Springer, New York) 149–172Lecture Notes in StatisticsCrossref, Google Scholar
Shin SW, Lee CH. Using attack-specific feature subsets for network intrusion detection. Proc. 19th Australian Conf. Artificial Intelligence (2006) (Hobart, Australia) Crossref, Google Scholar
Wu X, Kumar V, Quinlan JR, Ghosh J, Yang Q, Motoda H, Mclachlan GJ, et al. Top 10 algorithms in data mining. Knowledge Inform. Systems (2007) 14(1):1–37Crossref, Google Scholar
Zhang Y, Burer S, Street WN. Ensemble pruning via semi-definite programming. J. Machine Learn. Res. (2006) 7:1315–1338Google Scholar
Zhou Z-H, Wu J, Tang W. Ensembling neural networks many could be better than all. Artificial Intelligence (2002) 137:239–263Crossref, Google Scholar

cover image INFORMS Journal on Computing

Volume 25, Issue 3

Summer 2013

Pages 395-598

Article Information

Metrics

Information

Received:November 01, 2010
Accepted:April 01, 2012
Published Online:August 14, 2012

Cite as

Patricia E. N. Lutu, Andries P. Engelbrecht, (2012) Base Model Combination Algorithm for Resolving Tied Predictions for K-Nearest Neighbor OVA Ensemble Models. INFORMS Journal on Computing 25(3):517-526.

https://doi.org/10.1287/ijoc.1120.0518

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Base Model Combination Algorithm for Resolving Tied Predictions for K-Nearest Neighbor OVA Ensemble Models

References

Volume 25, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News