References
- Akaike, H. 1973. Information theory and an extension of the maximum likelihood principle. In 2nd International Symposium on Information Theory, eds. B.N. Petrov and F. Csaki, 267–81. Budapest: Akademia Kiado. IEEE Trans. on Automatic Control. 19, 716–23.
- Bai, D., C. R. Rao, and Y. Wu. 1999. Model selection with data-oriented penalty. Journal of Statistical Planning and Inference 77 (1):103–17. doi:10.1016/S0378-3758(98)00168-2.
- Berkson, J. 1955. Maximum likelihood and minimum χ2 estimates of the logistic function. Journal of the American Statistical Association 50:130–62.
- Cox, D. R., and E. J. Snell. 1968. A general definition of residuals. Journal of the Royal Statistical Society: Series B 30:248–75.
- Fan, J., and R. Li. 2001. Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association 96 (456):1348–60. doi:10.1198/016214501753382273.
- First, D. 1993. Bias reduction of maximum likelihood estimates. Biometrika 80:27–38.
- Horn, R. A. H., and C. R. Johnson. 1985. Matrix Analysis, 2nd ed. Cambridge: Cambridge University Press.
- Lv, J., and J. S. Liu. 2014. Model selection principles in misspecified models. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 76 (1):141–67. doi:10.1111/rssb.12023.
- Maritines, J. M. 2002. Minimization of discontinuous cost functions by smoothing. Acta Applicandae Mathematicae 71:245–60.
- McCullagh, P., and J. A. Nelder. 1989. Generalized linear models. 2nd ed. London: Chapman and Hall. doi:10.18637/jss.v083.i02.
- R Core Team. 2019. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/.
- Schwarz, G. 1978. Estimating the dimension of a model. Annals of Statistics 6:461–4.
- Su, X., J. Fan, R. A. Levine, M. E. Nunn, and C. L. Tsai. 2018. Sparse estimation of generalized linear models (GLM) via approximated information criteria. Statistica Sinica 28:1561–81.
- Takeuchi, K. 1976. Distribution of an information statistic and the criterion for the optimal model. Mathematical Science 153:12–18. (In Japanese).
- Tibshirani, R. 1996. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B 58:267–88.
- Qian, G., and Y. Wu. 2006. Strong limit theorems on model selection in generalized linear regression with binomial responses. Statistica Sinica 16:1335–65.
- White, H. 1982. Maximum likelihood estimation misspecified models. Econometrica 50 (1):1–25. doi:10.2307/1912526.
- Vuong, Q. H. 1989. Likelihood ratio tests for model selection and non-nested hypothesis. Econometrica 57 (2):307–33. doi:10.2307/1912557.
- Zhang, C.-H. 2010. Nearly unbiased variable selection under minimax concave penalty. Annals of Statistics 38:894–942.
- Zou, H., and R. Li. 2008. One-step sparse estimates in nonconcave penalized likelihood models. Annals of Statistics 36 (4):1509–33. doi:10.1214/009053607000000802.