Home > Ensembles
Boosting
Boosting (Schapire, 1990) is a meta-algorithm which can be viewed as a model averaging method. It is the most widely used ensemble method and one of the most powerful learning ideas introduced in the last ten years. Originally designed for classification, it can also be profitably extended to regression. We first create a “weak” classifier, that is, it suffices that its accuracy on the training set is only slightly better than a random guessing. A succession of models are built iteratively, each one being trained on a data set in which points misclassified (or, with regression, those poorly predicted) by the previous model are given more weight. Finally, all of the successive models are weighted according to their success and then the outputs are combined using voting (for classification) or averaging (for regression), thus creating a final model. The original boosting algorithm combined three weak learners to generate a strong learner.
AdaBoost
AdaBoost (Freund and Schapire, 1996), short for “adaptive boosting”, is the most popular boosting algorithm. It uses the same training set over and over again (thus it need not be large) and can also combine an arbitrary number of base-learners.
Original Paper
SCHAPIRE, Robert E., 1990. The strength of weak learnability , Machine Learning , Volume 5, Number 2, June, Pages 197-227. [Cited by 674 ] (41.46/year)
Abstract: "This paper addresses the problem of improving the accuracy of an hypothesis output by a learning algorithm in the distribution-free (PAC ) learning model. A concept class is learnable (or strongly learnable ) if, given access to a source of examples of the unknown concept, the learner with high probability is able to output an hypothesis that is correct on all but an arbitrarily small fraction of the instances. The concept class is weakly learnable if the learner can produce an hypothesis that performs only slightly better than random guessing. In this paper, it is shown that these two notions of learnability are equivalent.
A method is described for converting a weak learning algorithm into one that achieves arbitrarily high accuracy. This construction may have practical applications as a tool for efficiently converting a mediocre learning algorithm into one that performs extremely well. In addition, the construction has some interesting theoretical consequences, including a set of general upper bounds on the complexity of any strong learning algorithm as a function of the allowed error ε."
AdaBoost Paper
FREUND, Yoav and Robert E. SCHAPIRE, 1996. Experiments with a new boosting algorithm , Machine Learning: Proceedings of the Thirteenth International Conference (ICML '96) , edited by Lorenza Saitta, pages 148-156, Morgan Kaufmann. [Cited by 1404 ] (136.91/year)
Abstract: "In an earlier paper, we introduced a new “boosting” algorithm called AdaBoost which, theoretically, can be used to significantly reduce the error of any learning algorithm that consistently generates classifiers whose performance is a little better than random guessing. We also introduced the related notion of a “pseudo-loss” which is a method for forcing a learning algorithm of multi-label concepts to concentrate on the labels that are hardest to discriminate. In this paper, we describe experiments we carried out to assess how well AdaBoost with and without pseudo-loss, performs on real learning problems.
We performed two sets of experiments. The first set compared boosting to Breiman's “bagging” method when used to aggregate various classifiers (including decision trees and single attribute-value tests). We compared the performance of the two methods on a collection of machine-learning benchmarks. In the second set of experiments, we studied in more detail the performance of boosting using a nearest-neighbor classifier on an OCR problem."
10 Important Papers
FRIEDMAN, J.H., T. HASTIE and R. TIBSHIRANI, 2000. Additive logistic regression: a statistical view of boosting (With discussion and a rejoinder by the … . Ann. Statist. [Cited by 639 ] (102.16/year)
FREUND, Y. and R.E. SCHAPIRE, 1996. Experiments with a new boosting algorithm . Machine Learning: Proceedings of the Thirteenth …. [Cited by 1404 ] (136.91/year)
FREUND, Y. and R.E. SCHAPIRE, 1997. A decision-theoretic generalization of on-line learning and an application to boosting . Journal of Computer and System Sciences. [Cited by 1576 ] (170.29/year)
FREUND, Y., 1995. Boosting a weak learning algorithm by majority . Information and Computation. [Cited by 397 ] (35.27/year)
SCHAPIRE, R.E., et al. , 1998. Boosting the Margin: A New Explanation for the Effectiveness of Voting Methods . The Annals of Statistics. [Cited by 779 ] (94.37/year)
SCHAPIRE, R.E. and Y. SINGER, 1999. Improved Boosting Algorithms Using Confidence-rated Predictions . Machine Learning. [Cited by 648 ] (89.32/year)
QUINLAN, J.R., 1996. Bagging, boosting, and C4. 5 . Proceedings of the Thirteenth National Conference on …. [Cited by 456 ] (44.47/year)
BAUER, E. and R. KOHAVI, 1999. An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants . Machine Learning. [Cited by 576 ] (79.40/year)
SCHNEIDER, J., et al. , … and complete protective efficacy of malaria DNA vaccination by boosting with modified vaccinia … . nature.com. [Cited by 314 ] (?/year)
SCHAPIRE, R.E., margin. A Brief Introduction to Boosting . [Cited by 253 ] (?/year)
Links
Bibliography
ABNEY, S., R.E. SCHAPIRE and Y. SINGER, 1999. Boosting applied to tagging and PP attachment . Proceedings of the Joint SIGDAT Conference on Empirical …. [Cited by 72 ] (9.92/year)
ALLEN, T.M., et al. , 2000. Induction of AIDS Virus-Specific CTL Activity in Fresh, Unstimulated Peripheral Blood Lymphocytes … . The Journal of Immunology. [Cited by 152 ] (24.30/year)
BARRIOS, C., et al. , 1996. … from adult responses: predominance of a Th2-biased pattern which persists after adult boosting. . Eur J Immunol. [Cited by 121 ] (11.80/year)
BAUER, E. and R. KOHAVI, 1999. An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants . Machine Learning. [Cited by 576 ] (79.40/year)
BERTHOLD, M.R. and J. DIAMOND…, 1995. Boosting the performance of RBF networks with dynamic decay adjustment . Advances in Neural Information Processing Systems. [Cited by 60 ] (5.33/year)
BETZ, M. and R. GOLDFLAM, 1983. Boosting the bag . Physical Review D. [Cited by 77 ] (3.31/year)
BOOSTING, P.I., HIV-Protease Inhibitors: Viral Resistance, Pharmacokinetic Boosting, and Medication Adherence . uspharmacist.com. [not cited] (?/year)
BROOKS, J.V., et al. , 2001. Boosting Vaccine for Tuberculosis . Infection and Immunity. [Cited by 55 ] (10.47/year)
BUHLMANN, P. and B. YU, 2003. Boosting With the L 2 Loss: Regression and Classification . Journal of the American Statistical Association. [Cited by 79 ] (24.27/year)
BURNS, D.L., et al. , 2001. … and Protective Efficacy of a Tuberculosis DNA Vaccine Encoding Ag85 by Protein Boosting . Infection and Immunity. [Cited by 80 ] (15.22/year)
CARRERAS, X. and L. MARQUEZ, 2001. Boosting Trees for Anti-Spam Email Filtering . Arxiv preprint cs.CL/0109015. [Cited by 86 ] (16.37/year)
COLLINS, M., 2001. Ranking algorithms for named-entity extraction: boosting and the voted perceptron . Proceedings of the 40th Annual Meeting on Association for …. [Cited by 67 ] (12.75/year)
COONEY, E.L., et al. , 1993. … of Priming with a Vaccinia Recombinant Expressing HIV Envelope and Boosting with gp160 Protein . Proceedings of the National Academy of Sciences. [Cited by 64 ] (4.83/year)
DAVIS, H.L., et al. , 1996. DNA-mediated immunization to hepatitis B surface antigen: longevity of primary response and effect … . Vaccine. [Cited by 111 ] (10.82/year)
DE, G. and M. LENZERINI, 1994. Boosting the correspondence between description logics and propositional dynamic logics . Proc. of the 12th Nat. Conf. on Artificial Intelligence ( …. [Cited by 97 ] (7.92/year)
DEGANO, P., et al. , 1999. Gene gun intradermal DNA immunization followed by boosting with modified vaccinia virus Ankara: … . Vaccine. [Cited by 48 ] (6.62/year)
DETTLING, M. and P. B?HLMANN, 2003. Boosting for tumor classification with gene expression data . Bioinformatics. [Cited by 54 ] (16.59/year)
DHODAPKAR, M.V., et al. , 2000. Mature dendritic cells boost functionally superior CD8+ T-cell in humans without foreign helper … . [Cited by 106 ] (16.95/year)
DIETTERICH, T.G., 2000. … Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and … . Machine Learning. [Cited by 404 ] (64.59/year)
DRUCKER, H., 1997. Improving regressors using boosting techniques . Proceedings of the Fourteenth International Conference on …. [Cited by 68 ] (7.35/year)
DRUCKER, H., et al. , 1994. Boosting and other ensemble methods . Neural computation. [Cited by 105 ] (8.57/year)
DRUCKER, H., R. SCHAPIRE and P. SIMARD, 1993. Boosting performance in neural networks . International journal of pattern recognition and artificial …. [Cited by 85 ] (6.41/year)
DRUCKER, H., R.E. SCHAPIRE and P. SIMARD, 1992. Improving Performance in Neural Networks Using a Boosting Algorithm . Advances in Neural Information Processing Systems 5,[NIPS …. [Cited by 99 ] (6.95/year)
EARL, P.L., et al. , 2002. Comparison of Vaccine Strategies Using Recombinant env-gag-pol MVA with or without an Oligomeric … . Virology. [Cited by 40 ] (9.40/year)
ELKAN, C., 1997. Boosting and Naive Bayesian Learning . proceeding of KDD-97, New Port beach, CA. [Cited by 69 ] (7.46/year)
EO, S.K., et al. , 2001. Prime-Boost Immunization with DNA Vaccine: Mucosal Route of Administration Changes the Rules1 . The Journal of Immunology. [Cited by 46 ] (8.75/year)
ESCUDERO, G., L. MARQUEZ and G. RIGAU, 2000. Boosting Applied to Word Sense Disambiguation . Arxiv preprint cs.CL/0007010. [Cited by 68 ] (10.87/year)
FAN, W., et al. , 1999. AdaCost: misclassification cost-sensitive boosting . Proc. 16th International Conf. on Machine Learning. [Cited by 65 ] (8.96/year)
FREUND, Y. and R.E. SCHAPIRE, 1996. Experiments with a new boosting algorithm . Machine Learning: Proceedings of the Thirteenth …. [Cited by 1404 ] (136.91/year)
FREUND, Y. and R.E. SCHAPIRE, 1996. Game theory, on-line prediction and boosting . Proceedings of the ninth annual conference on Computational …. [Cited by 111 ] (10.82/year)
FREUND, Y. and R.E. SCHAPIRE, 1997. A decision-theoretic generalization of on-line learning and an application to boosting . Journal of Computer and System Sciences. [Cited by 1576 ] (170.29/year)
FREUND, Y. and R.E. SCHAPIRE, 1999. A short introduction to boosting . Journal of Japanese Society for Artificial Intelligence. [Cited by 209 ] (28.81/year)
FREUND, Y., 1995. Boosting a weak learning algorithm by majority . Information and Computation. [Cited by 397 ] (35.27/year)
FREUND, Y., 2001. An Adaptive Version of the Boost by Majority Algorithm . Machine Learning. [Cited by 78 ] (14.84/year)
FREUND, Y., et al. , 2003. An efficient boosting algorithm for combining preferences . Journal of Machine Learning Research. [Cited by 165 ] (50.70/year)
FRIEDMAN, J.H., 2001. Greedy Function Approximation: A Gradient Boosting Machine . The Annals of Statistics. [Cited by 254 ] (48.34/year)
FRIEDMAN, J.H., 2002. Stochastic gradient boosting . Computational Statistics and Data Analysis. [Cited by 62 ] (14.57/year)
FRIEDMAN, J.H., T. HASTIE and R. TIBSHIRANI, 2000. Additive logistic regression: a statistical view of boosting (With discussion and a rejoinder by the … . Ann. Statist. [Cited by 639 ] (102.16/year)
GASPERINI, M., J. MAHARANA and G. VENEZIANO, Arxiv preprint hep-th/9209052. Boosting Away Singularities from Conformal String Backgrounds . [Cited by 56 ] (?/year)
GILBERT, S.C., et al. , 2002. Enhanced CD8 T cell immunogenicity and protective efficacy in a mouse malaria model using a … . Vaccine. [Cited by 51 ] (11.99/year)
GOMES, C.P., B. SELMAN and H. KAUTZ, 1998. Boosting combinatorial search through randomization . Proceedings of the Fifteenth National Conference on …. [Cited by 237 ] (28.71/year)
GONZ?LEZ, J. and A. GONZ?LEZ, 1998. The potential of data value speculation to boost ILP . Proceedings of the 12th international conference on …. [Cited by 75 ] (9.09/year)
GOONETILLEKE, N.P., et al. , 2003. … of Bacille Calmette-Guerin Vaccine Using Mucosal Administration and Boosting with a Recombinant … . The Journal of Immunology. [Cited by 48 ] (14.75/year)
GROVE, A.J. and D. SCHUURMANS, 1998. Boosting in the limit: Maximizing the margin of learned ensembles . Proceedings of the Fifteenth National Conference on …. [Cited by 113 ] (13.69/year)
HANKE, T. and A. MCMICHAEL, 1999. Pre-clinical development of a multi-CTL epitope-based DNA prime MVA boost vaccine for AIDS. . Immunol Lett. [Cited by 61 ] (8.41/year)
HANKE, T., et al. , 1998. Enhancement of MHC class I-restricted peptide-specific T cell induction by a DNA prime/MVA boost … . Vaccine. [Cited by 140 ] (16.96/year)
HANKE, T., et al. , 1999. Effective Induction of Simian Immunodeficiency Virus-Specific Cytotoxic T Lymphocytes in Macaques by … . Journal of Virology. [Cited by 168 ] (23.16/year)
HARABAGIU, S., et al. , 2001. FALCON: Boosting Knowledge for Answer Engines . Proceedings of the 9 thText Retrieval Conference. [Cited by 152 ] (28.93/year)
HU, Y. and Q. YANG, 1996. DCD?disk caching disk: a new approach for boosting I/O performance . Proceedings of the 23rd annual international symposium on …. [Cited by 93 ] (9.07/year)
IRVINE, K.R., et al. , JNCI Cancer Spectrum. Enhancing efficacy of recombinant anticancer vaccines with prime/boost regimens that use two … . [Cited by 72 ] (?/year)
JONES, T.R., et al. , 2001. Protection of Aotus monkeys by Plasmodium falciparum EBA-175 region II DNA prime-protein boost … . J Infect Dis. [Cited by 60 ] (11.42/year)
KEARNS, M. and Y. MANSOUR, 1996. On the boosting ability of top-down decision tree learning algorithms . Proceedings of the twenty-eighth annual ACM symposium on …. [Cited by 89 ] (8.68/year)
KENT, S.J., et al. , 1998. … 1 Vaccine Regimen Consisting of Consecutive Priming with DNA and Boosting with Recombinant Fowlpox … . Journal of Virology. [Cited by 186 ] (22.53/year)
KIVINEN, J. and M.K. WARMUTH, 1999. Boosting as entropy projection . Proceedings of the twelfth annual conference on …. [Cited by 53 ] (7.31/year)
KUROSU, H., et al. , 2005. Suppression of Aging in Mice by the Hormone Klotho . Science. [Cited by 25 ] (19.92/year)
LEBANON, G. and J.D. LAFFERTY, 2001. Boosting and Maximum Likelihood for Exponential Models . cs.cmu.edu. [Cited by 58 ] (11.04/year)
LUGOSI, G. and N. VAYATIS, 2003. On the Bayes-risk consistency of regularized boosting methods . Annals of Statistics. [Cited by 63 ] (19.36/year)
MACLIN, R. and D. OPITZ, 1997. An empirical evaluation of bagging and boosting . Proceedings of the Fourteenth National Conference on …. [Cited by 100 ] (10.81/year)
MARGINEANTU, D.D. and T.G. DIETTERICH, 1997. Pruning adaptive boosting . Proc. 14th International Conference on Machine Learning. [Cited by 83 ] (8.97/year)
MARSHALL, J.L., et al. , 2000. Phase I Study in Advanced Cancer Patients of a Diversified Prime-and-Boost Vaccination Protocol … . Journal of Clinical Oncology. [Cited by 111 ] (17.75/year)
MASON, L., et al. , 2000. Boosting algorithms as gradient descent in function space . Neural Information Processing Systems. [Cited by 37 ] (5.92/year)
MEIR, R. and G. RATSCH, 2003. An introduction to boosting and leveraging . Advanced Lectures on Machine Learning, LNCS. [Cited by 80 ] (24.58/year)
MENG, W.S., et al. , 2001. a-Fetoprotein-specific Tumor Immunity Induced by Plasmid Prime-Adenovirus Boost Genetic Vaccination … . [Cited by 44 ] (8.37/year)
MENZIES, D., 1999. Interpretation of Repeated Tuberculin Tests Boosting, Conversion, and Reversion . [Cited by 96 ] (13.23/year)
OPELT, A., et al. , 2004. Weak hypotheses and boosting for generic object detection and recognition . Proc. ECCV. [Cited by 57 ] (25.28/year)
ORTALDO, J.R., et al. , 1984. A Species of Human Interferon That Lacks the Ability to Boost Human Natural Killer Activity . Proceedings of the National Academy of Sciences. [Cited by 32 ] (1.44/year)
PALMOWSKI, M.J., et al. , 2002. Competition Between CTL Narrows the Immune Response Induced by Prime-Boost Vaccination Protocols1 . The Journal of Immunology. [Cited by 58 ] (13.63/year)
PANCHOLI…, P., 2001. DNA prime/canarypox boost?based immunotherapy of chronic hepatitis B virus infection in a … . Hepatology. [Cited by 54 ] (10.28/year)
PETRI, W.A., et al. , 2001. Enhanced Immunogenicity of CD4+ T-Cell Responses and Protective Efficacy of a DNA-Modified Vaccinia … . Infection and Immunity. [Cited by 103 ] (19.60/year)
PLEBANSKI, M., et al. , 1998. Protection from Plasmodium berghei infection by priming and boosting T cells to a single class I- … . European Journal of Immunology. [Cited by 44 ] (5.33/year)
QUINLAN, J.R., 1996. Bagging, boosting, and C4. 5 . Proceedings of the Thirteenth National Conference on …. [Cited by 456 ] (44.47/year)
QUINLAN, J.R., 1996. Boosting first-order learning . Lecture Notes in Computer Science. [Cited by 45 ] (4.39/year)
RAPHAEL, S. and M.A. STOLL, 2000. Can Boosting Minority Car-ownership Rates Narrow Inter-racial Employment Gaps? . muse.jhu.edu. [Cited by 46 ] (7.35/year)
RATSCH, G., et al. , 2002. Constructing Boosting Algorithms from SVMs: An Application to One-Class Classification . IEEE Transactions on Pattern Analysis and Machine …. [Cited by 39 ] (9.17/year)
REUTERS, L.T.O. and J. BOOSTING, Design and Implementation. of a Multi-label Chinese Text Categorization System . ieeexplore.ieee.org. [not cited] (?/year)
RICHMOND, J.F.L., et al. , 1998. … Anti-Human Immunodeficiency Virus Type 1 Env Antibody Elicited by DNA Priming and Protein Boosting . Journal of Virology. [Cited by 59 ] (7.15/year)
SCHAPIRE, R.E. and Y. SINGER, 1999. Improved Boosting Algorithms Using Confidence-rated Predictions . Machine Learning. [Cited by 648 ] (89.32/year)
SCHAPIRE, R.E. and Y. SINGER, 2000. BoosTexter: A Boosting-based System for Text Categorization . Machine Learning. [Cited by 349 ] (55.80/year)
SCHAPIRE, R.E., 1997. Using output codes to boost multiclass learning problems . Machine Learning: Proceedings of the Fourteenth …. [Cited by 100 ] (10.81/year)
SCHAPIRE, R.E., 1999. Theoretical views of boosting . Computational Learning Theory: Fourth European Conference, …. [Cited by 64 ] (8.82/year)
SCHAPIRE, R.E., 1999. Theoretical views of boosting and applications . Tenth International Conference on Algorithmic Learning …. [Cited by 49 ] (6.75/year)
SCHAPIRE, R.E., 2002. The boosting approach to machine learning: An overview . MSRI Workshop on Nonlinear Estimation and Classification. [Cited by 210 ] (49.36/year)
SCHAPIRE, R.E., et al. , 1998. Boosting the Margin: A New Explanation for the Effectiveness of Voting Methods . The Annals of Statistics. [Cited by 779 ] (94.37/year)
SCHAPIRE, R.E., margin. A Brief Introduction to Boosting . [Cited by 253 ] (?/year)
SCHAPIRE, R.E., Y. SINGER and A. SINGHAL, 1998. Boosting and Rocchio applied to text filtering . Proceedings of the 21st annual international ACM SIGIR …. [Cited by 162 ] (19.63/year)
SCHEERLINCK, J.P., et al. , 2001. The immune response to a DNA vaccine can be modulated by co-delivery of cytokine genes using a DNA … . Vaccine. [Cited by 34 ] (6.47/year)
SCHNEIDER, J., et al. , … and complete protective efficacy of malaria DNA vaccination by boosting with modified vaccinia … . nature.com. [Cited by 314 ] (?/year)
SCHNEIDER, J., et al. , 1999. Induction of CD8+ T cells using heterologous prime-boost immunisation strategies. . Immunol Rev. [Cited by 92 ] (12.68/year)
SEDEGAH, M., et al. , 1998. Boosting with recombinant vaccinia increases immunogenicity and protective efficacy of malaria DNA … . [Cited by 130 ] (15.75/year)
SEDEGAH, M., et al. , 2000. … : Priming with Antigen and GM-CSF-Encoding Plasmid DNA and Boosting with Antigen-Expressing … . The Journal of Immunology. [Cited by 76 ] (12.15/year)
SHEDLOCK, D.J. and H. SHEN, Science's STKE. Requirement for CD4 T Cell Help in Generating Functional CD8 T Cell Memory . [Cited by 271 ] (?/year)
SHIPLEY, W.U., et al. , 1995. … results of a randomized comparative trial of high dose irradiation boosting with conformal protons … . Int J Radiat Oncol Biol Phys. [Cited by 92 ] (8.17/year)
SIEGRIST, C.A., et al. , 1998. Influence of maternal antibodies on vaccine responses: inhibition of antibody but not T cell … . European Journal of Immunology. [Cited by 68 ] (8.24/year)
SMITH, M.D., M. HOROWITZ and M.S. LAM, 1992. Efficient superscalar performance through boosting . Proceedings of the fifth international conference on …. [Cited by 83 ] (5.82/year)
SMITH, M.D., M.S. LAM and M.A. HOROWITZ, 1990. Boosting beyond static scheduling in a superscalar processor . Computer Architecture, 1990. Proceedings. 17th Annual …. [Cited by 104 ] (6.40/year)
SVMS, O., O.K. MACHINES and O. BOOSTING, 2003. Advances in Large Margin Classifiers?AJ Smola, PL Bartlett . IEEE TRANSACTIONS ON NEURAL NETWORKS. [not cited] (0/year)
TIEU, K. and P. VIOLA, 2004. Boosting Image Retrieval . International Journal of Computer Vision. [Cited by 180 ] (79.83/year)
TORRALBA, A., K.P. MURPHY and W.T. FREEMAN, Sharing features: efficient boosting procedures for multiclass object detection . ieeexplore.ieee.org. [Cited by 68 ] (?/year)
WEBB, G.I., 2000. MultiBoosting: A Technique for Combining Boosting and Wagging . Machine Learning. [Cited by 77 ] (12.31/year)
YANG, Z., et al. , 2003. Overcoming Immunity to a Viral Vaccine by DNA Priming before Vector Boosting . [Cited by 52 ] (15.98/year)