Boosting for Correlated Binary Classification

Adeniyi J. Adewale Adeniyi J. Adewale is Biometrician, Merck Research Laboratories, 351 N. Sumneytown Pike, North Wales, PA 19454. Irina Dinu is Assistant Professor, Yutaka Yasui is Professor , Department of Public Health Sciences, School of Public Health, University of Alberta, 13-103 Clinical Sciences Building, Edmonton, Alberta T6G 2G3, Canada.

Irina Dinu Adeniyi J. Adewale is Biometrician, Merck Research Laboratories, 351 N. Sumneytown Pike, North Wales, PA 19454. Irina Dinu is Assistant Professor, Yutaka Yasui is Professor , Department of Public Health Sciences, School of Public Health, University of Alberta, 13-103 Clinical Sciences Building, Edmonton, Alberta T6G 2G3, Canada.

Yutaka Yasui Adeniyi J. Adewale is Biometrician, Merck Research Laboratories, 351 N. Sumneytown Pike, North Wales, PA 19454. Irina Dinu is Assistant Professor, Yutaka Yasui is Professor , Department of Public Health Sciences, School of Public Health, University of Alberta, 13-103 Clinical Sciences Building, Edmonton, Alberta T6G 2G3, Canada.

Abstract

Boosting is a successful method for dealing with problems of high-dimensional classification of independent data. However, existing variants do not address the correlations in the context of longitudinal or cluster study-designs with measurements collected across two or more time points or in clusters. This article presents two new variants of boosting with a focus on high-dimensional classification problems with matched-pair binary responses or, more generally, any correlated binary responses. The first method is based on the generic functional gradient descent algorithm and the second method is based on a direct likelihood optimization approach. The performance and the computational requirements of the algorithms were evaluated using simulations. Whereas the performance of the two methods is similar, the computational efficiency of the generic-functional-gradient-descent-based algorithm far exceeds that of the direct-likelihood-optimization-based algorithm. The former method is illustrated using data on gene expression changes in de novo and relapsed childhood acute lymphoblastic leukemia. Computer code implementing the algorithms and the relevant dataset are available online as supplemental materials.

Keywords: :

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Boosting for Correlated Binary Classification

Related Research Data

Information for

Open access

Opportunities

Help and information

Boosting for Correlated Binary Classification

Abstract

Reprints and Corporate Permissions

Academic Permissions

Related Research Data

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature