2,352
Views
23
CrossRef citations to date
0
Altmetric
Theory and Methods

Category-Adaptive Variable Screening for Ultra-High Dimensional Heterogeneous Categorical Data

, , &
Pages 747-760 | Received 26 Nov 2017, Accepted 03 Jan 2019, Published online: 22 Apr 2019

References

  • Bhattacharjee, A., Richards, W., Staunton, J., Li, C., Monti, S., Vasa, P., Ladd, C., Beheshti, J., Bueno R., Gillette, M., Loda, M., Weber, G., Mark, E., Lander, E., Wong, W., Johnson, B., Golub, T., Sugarbaker, D., and Meyerson, M. (2001), “Classifcation of Human Lung Carcinomas by mRNA Expression Profling Reveals Distinct Adenocarcinoma Subclasses,” Proceedings of the National Academy of Sciences of the United States of America, 98, 13790–13795. DOI: 10.1073/pnas.191502998.
  • Chang, J., Tang, C. Y., and Wu, Y. (2013), “Marginal Empirical Likelihood and Sure Independence Feature Screening,” The Annals of Statistics, 41, 2123–2148. DOI: 10.1214/13-AOS1139.
  • Chen, K. (2001a), “Parametric Models for Response-Biased Sampling,” Journal of the Royal Statistical Society, Series B, 63, 775–789. DOI: 10.1111/1467-9868.00312.
  • Chen, K. (2001b), “Generalized Case-Cohort Sampling,” Journal of the Royal Statistical Society, Series B, 63, 791–809. DOI: 10.1111/1467-9868.00313.
  • Chen, K., Lin, Y., Yao, Y., and Zhou, C. (2017), “Regression Analysis With Response-Biased Sampling,” Statistica Sinica, 27, 1699–1714.
  • Clemmensen, L., Hastie, T., Witten, D., and Ersboll, B. (2011), “Sparse Discriminant Analysis,” Technometrics, 53, 406–415. DOI: 10.1198/TECH.2011.08118.
  • Cui, H., Li, R., and Zhong, W. (2015), “Model-Free Feature Screening for Ultrahigh Dimensional Discriminant Analysis,” Journal of the American Statistical Association, 110, 630–641. DOI: 10.1080/01621459.2014.920256.
  • Fan, J., Feng, Y., and Song, R. (2011), “Nonparametric Independence Screening in Sparse Ultrahigh-Dimensional Additive Models,” Journal of the American Statistical Association, 106, 544–557. DOI: 10.1198/jasa.2011.tm09779.
  • Fan, J., and Lv, J. (2008), “Sure Independence Screening for Ultrahigh Dimensional Feature Space,” Journal of the Royal Statistical Society, Series B, 70, 849–911. DOI: 10.1111/j.1467-9868.2008.00674.x.
  • Fan, J., Samworth, R., and Wu, Y. (2009), “Ultrahigh Dimensional Feature Selection: Beyond the Linear Model,” The Journal of Machine Learning Research, 10, 2013–2038.
  • Fan, J., and Song, R. (2010), “Sure Independence Screening in Generalized Linear Models with NP-Dimensionality,” The Annals of Statistics, 38, 3567–3604. DOI: 10.1214/10-AOS798.
  • He, X., Wang, L., and Hong, H. G. (2013), “Quantile-Adaptive Model-Free Variable Screening for High-Dimensional Heterogeneous Data,” The Annals of Statistics, 41, 342–369. DOI: 10.1214/13-AOS1087.
  • Huang, D., Li, R., and Wang, H. (2014), “Feature Screening for Ultrahigh Dimensional Categorical Data with Applications,” Journal of Business & Economic Statistics, 32, 237–244. DOI: 10.1080/07350015.2013.863158.
  • Huang, C. Y., and Qin, J. (2011), “Nonparametric Estimation for Length-Biased and Right-Censored Data,” Biometrika, 98, 177–186. DOI: 10.1093/biomet/asq069.
  • Kim, J. P., Lu, W., Sit, T., and Ying, Z. (2013), “A Unified Approach to Semiparametric Transformation Models Under General Biased Sampling Schemes,” Journal of the American Statistical Association, 108, 217–227. DOI: 10.1080/01621459.2012.746073.
  • Kim, J. P., Sit, T., and Ying, Z. (2016), “Accelerated Failure Time Model under General Biased Sampling Scheme,” Biostatistics, 17, 576–588. DOI: 10.1093/biostatistics/kxw008.
  • Lawless, J. F., Wild, C. J., and Kalbfleisch, J. D. (1999), “Semiparametric Methods for Response-Selective and Missing Data Problems in Regression,” Journal of the Royal Statistical Society, Series B, 61, 413–438. DOI: 10.1111/1467-9868.00185.
  • Le Cun, Y., Boser, B., Denker, J., Henderson, D., Howard, R., Hubbard, W., and Jackel, L. (1990), “Handwritten Digit Recognition With A Back-Propogation Betwork in D. Touretzky, ed,” Advances in Neural Information Processing Systems, 2, 386–404.
  • Li, G., Peng, H., Zhang, J., and Zhu, L. (2012), “Robust Rank Correlation Based Screening,” The Annals of Statistics, 40, 1846–1877. DOI: 10.1214/12-AOS1024.
  • Li, R., Zhong, W., and Zhu, L. (2012), “Feature Screening via Distance Correlation Learning,” Journal of the American Statistical Association, 107, 1129–1139. DOI: 10.1080/01621459.2012.695654.
  • Lin, D. Y. (2000), “On Fitting Coxs Proportional Hazards Models to Survey Data,” Biometrika, 87, 37–47. DOI: 10.1093/biomet/87.1.37.
  • Luo, X., and Tsai, W. Y. (2009), “Nonparametric Estimation for Right-Censored Length-Biased Data: A Pseudo-Partial Likelihood Approach,” Biometrika, 96, 873–886. DOI: 10.1093/biomet/asp064.
  • Luo, X., Tsai, W. Y., and Xu, Q. (2009), “Pseudo-Partial Likelihood Estimators for the Cox Regression Model with Missing Covariates,” Biometrika, 96, 617–633. DOI: 10.1093/biomet/asp027.
  • Mai, Q., and Zou, H. (2013), “The Kolmogorov Filter for Variable Screening in High-Dimensional Binary Classification,” Biometrika, 100, 229–234. DOI: 10.1093/biomet/ass062.
  • Mai, Q., and Zou, H. (2015), “The Fused Kolmogorov Filter: A Nonparametric Model-Free Screening Method,” The Annals of Statistics, 43, 1471–1497. DOI: 10.1214/14-AOS1303.
  • Ning, J., Qin, J., and Shen, Y. (2010), “Nonparametric Tests for Right-Censored Data with Biased Sampling,” Journal of the Royal Statistical Society, Series B, 5, 609–630. DOI: 10.1111/j.1467-9868.2010.00742.x.
  • Pan, R., Wang, H., and Li, R. (2016), “Ultrahigh Dimensional Multi-Class Linear Discriminant Analysis by Pairwise Sure Independence Screening,” Journal of the American Statistical Association, 111, 169–179. DOI: 10.1080/01621459.2014.998760.
  • Pollard, D. (1984), Convergence of Stochastic Processes, New York: Springer-Verlag.
  • Prentice, R. L., and Pyke, R. (1979), “Logistic Disease Incidence Models With Case-Control Studies,” Biometrika, 66, 403–411. DOI: 10.1093/biomet/66.3.403.
  • Qin, J. (2017), Biased Sampling, Over-Identified Parameter Problems and Beyond, New York: Springer.
  • Scott, A. J., and Wild, C. J. (1986), “Fitting Logistic Models Under Case-Control or Choice Based Sampling,” Journal of the Royal Statistical Society, Series B, 48, 170–182. DOI: 10.1111/j.2517-6161.1986.tb01400.x.
  • Scott, A. J., and Wild, C. J. (1997), “Fitting Regression Models to Case-Control Data by Maximum Likelihood,” Biometrika, 84, 57–71. DOI: 10.1093/biomet/84.1.57.
  • Sun, Y., Chan, K. C. G., and Qin, J. (2018), “Simple and Fast Overidentified Rank Estimation for Right-Censored Length-Biased Data and Backward Recurrence Time,” Biometrics, 74, 77–85. DOI: 10.1111/biom.12727.
  • Tsai, W. Y. (2009), “Pseudo-Partial Likelihood for Proportional Hazards Models with Biased-Sampling Data,” Biometrika, 96, 601–615. DOI: 10.1093/biomet/asp026.
  • Xu, G., Sit, T., Wang, L., and Huang, C. Y. (2017), “Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling,” Journal of the American Statistical Association, 112, 1571–1586. DOI: 10.1080/01621459.2016.1222286.
  • Zhu, L. P., Li, L., Li, R., and Zhu, L. X. (2011), “Model-Free Feature Screening for Ultrahigh-Dimensional Data,” Journal of the American Statistical Association, 106, 1464–1475. DOI: 10.1198/jasa.2011.tm10563.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.