235
Views
11
CrossRef citations to date
0
Altmetric
Original Articles

Out-of-Bag Estimation of the Optimal Hyperparameter in SubBag Ensemble Method

, &
Pages 1877-1892 | Received 14 May 2010, Accepted 21 Aug 2010, Published online: 29 Oct 2010

References

  • Amalral , G. J. A. , Ferreira , M. R. P. ( 2009 ). New bootstrap applications in supervised learning . Communications in Statistics – Simulation and Computation 38 ( 2 ): 416 – 425 .
  • Asuncion , A. , Newman , D. ( 2007 ). UCI Machine Learning Repository. http:\\www.ics. uci.edu\ ~mlearn\MLRepository.html. Irvine , CA : University of California, School of Information and Computer Science .
  • Bernard , S. , Heutte , L. , Adam , S. (2009). Influence of hyperparameters on random forest accuracy. In: Benediktsson , J. A. , Kittler , J. , Roli , F. , eds., Multiple Classifier Systems . Berlin : Springer, pp. 171–180.
  • Breiman , L. ( 1996a ). Bagging predictors . Machine Learning 24 ( 2 ): 123 – 140 .
  • Breiman , L. ( 1996b ). Out-of-bag estimation. Tech. Report, Statistics Department, University of California, Berkely, California .
  • Breiman , L. ( 2001 ). Random forests . Machine Learning 45 ( 1 ): 5 – 32 .
  • Breiman , L. , Friedman , J. , Olshen , R. , Stone , C. ( 1984 ). Classification and Regression Trees . New York : Chapman & Hall, CRC Press .
  • Bylander , T. ( 2002 ). Estimating generalization error on two-class datasets using out-of-bag estimates . Machine Learning 48 ( 1 ): 287 – 297 .
  • Croux , C. , Joossens , K. , Lemmens , A. ( 2007 ). Trimmed bagging . Computational Statistics & Data Analysis 52 ( 1 ): 362 – 368 .
  • Dettling , M. ( 2004 ). Bagboosting for tumor classification with gene expression data . Bioinformatics 20 ( 18 ): 3583 – 3593 .
  • Efron , B. , Tibshirani , R. ( 1993 ). An Introduction to the Bootstrap . London : Chapman & Hall/CRC .
  • Hernández-Lobato , D. , Martínez-Muñoz , G. , Suárez , A. ( 2007 ). Out of bootstrap estimation of generalization error curves in bagging ensembles . In: Yin , H.-J. , Tino , P. , Corchado , E. , Byrne , W. , Yao , X. , eds., Intelligent Data Engineering and Automated Learning – IDEAL . Berlin : Springer , pp. 47 – 56 .
  • Ho , T. K. ( 1998 ). The random subspace method for constructing decision forests . IEEE Transactions on Pattern Analysis and Machine Intelligence 20 ( 8 ): 832 – 844 .
  • Hothorn , T. , Lausen , B. ( 2003 ). Double-bagging: combining classifiers by bootstrap aggregation . Pattern Recognition 36 ( 6 ): 1303 – 1309 .
  • Hothorn , T. , Lausen , B. ( 2005 ). Bundling classifiers by bagging trees . Computational Statistics & Data Analysis 49 ( 4 ): 1068 – 1078 .
  • Kuncheva , L. I. ( 2004 ). Combining Pattern Classifiers: Methods and Algorithms. Wiley-Interscience .
  • Lawrence , R. , Bunn , A. , Powell , S. , Zambon , M. ( 2004 ). Classification of remotely sensed imagery using stochastic gradient boosting as a refinement of classification tree analysis . Remote Sensing of Environment 90 ( 3 ): 331 – 336 .
  • Martínez-Muñoz , G. , Suárez , A. ( 2010 ). Out-of-bag estimation of the optimal sample size in bagging . Pattern Recognition 43 ( 1 ): 143 – 152 .
  • Panov , P. , Džeroski , S. ( 2007 ). Combining bagging and random subspaces to create better ensembles . In: Berthold , M. R. , Shawe-Taylor , J. , Lavrač , N. , eds., Advances in Intelligent Data Analysis VII . Berlin : Springer , pp. 118 – 129 .
  • Polikar , R. ( 2006 ). Ensemble based systems in decision making . IEEE Circuits and Systems Magazine 6 ( 3 ): 21 – 45 .
  • Polikar , R. , Topalis , A. , Parikh , D. , Green , D. , Frymiare , J. , Kounios , J. , Clark , C. M. ( 2008 ). An ensemble based data fusion approach for early diagnosis of alzheimer's disease . Information Fusion 9 ( 1 ): 83 – 95 .
  • Rao , S. , Tibshirani , R. ( 1996 ). The out-of-bootstrap method for model averaging and selection. Techinical Report, University of Toronto, Toronto, Canada .
  • Rokach , L. ( 2009 ). Taxonomy for characterizing ensemble methods in classification tasks: a review and annotated bibliography . Computational Statistics & Data Analysis 53 ( 12 ): 4046 – 4072 .
  • Tibshirani , R. ( 1996 ). Bias, variance and prediction error for classification rules. Technical Report, Department of Preventive Medicine and Biostatistics, Department of Statistics, University of Toronto, Canada .
  • Webb , G. I. ( 2000 ). Multiboosting: a technique for combining boosting and wagging . Machine Learning 40 ( 2 ): 159 – 196 .
  • Wolpert , D. H. , Macready , W. G. (1996). Combining stacking with bagging to improve a learning algorithm. Technical Report, Santa Fe Institute, Santa Fe, New Mexico.
  • Wolpert , D. H. , Macready , W. G. ( 1999 ). An efficient method to estimate bagging's generalization error . Machine Learning 35 ( 1 ): 41 – 55 .
  • Zhang , C. X. , Zhang , J. S. ( 2009 ). A novel method for constructing ensemble classifiers . Statistics and Computing 19 ( 3 ): 317 – 327 .
  • Zhang , C. X. , Zhang , J. S. , Zhang , G. Y. ( 2009 ). Using boosting to prune double-bagging ensembles . Computational Statistics & Data Analysis 53 ( 4 ): 1218 – 1231 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.