Views

CrossRef citations to date

Altmetric

Research Article

On statistical classification with incomplete covariates via filtering

Majid Mojirsheibania Department of Mathematics, California State University Northridge, Los Angeles, CA, USAView further author information

My-Nhi Nguyenb Department of Preventive Medicine, University of Southern California, Los Angeles, CA, USACorrespondence[email protected]

https://orcid.org/0000-0001-8535-9896 View further author information

Abstract

This article deals with the problem of classification when some of the covariates may have missing parts. Here, it is allowed for both the training sample as well as the new unclassified observation to have missing parts in the covariates. In fact, it is shown in Remark 3.3 that in classification the reconstruction/imputation of the missing part of a new unclassified observation (which is to be classified) can be counter-productive in terms of the error rates. Furthermore, unlike many of the results in the literature, where covariate fragments are usually assumed to be missing completely at random, we do not impose such assumptions here. Given the observed parts of the covariates, we construct a kernel-type classifier which is straightforward to implement. The proposed classifier is constructed based on d-dim covariate vectors that are obtained from the original covariates (by moving from the space $L^{2}$ to $ℓ_{2}$ ), where $d (< \infty)$ itself is a parameter that has to be estimated. To estimate various parameters, we employ an easy-to-implement data-splitting approach.

Keywords:

2010 Mathematics Subject Classifications:

Acknowledgments

This work was supported by the NSF under Grant DMS-1916161 of Majid Mojirsheibani.

Data availability statement

The Share Price Increase data set used in Section 4.2, and a description of it, is available at http://www.timeseriesclassification.com/dataset.php

Additionally, a copy of the ‘R’ codes used to carry out the analysis in Section 4.2 is posted on the GitHub repository at https://github.com/mynhinguyen/Statistical-classification-with-incomplete-covariates-via-filtering

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the National Science Foundation (NSF) under Grant DMS-1916161 of Majid Mojirsheibani.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

On statistical classification with incomplete covariates via filtering

Information for

Open access

Opportunities

Help and information

On statistical classification with incomplete covariates via filtering

Abstract

Acknowledgments

Data availability statement

Disclosure statement

Additional information

Funding

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature