References
- Audibert, J.-Y., and Tsybakov, A. B. (2007), “Fast Learning Rates for Plug-in Classifiers,” The Annals of Statistics, 35, 608–633.
- Bartlett, P. L., and Wegkamp, M. H. (2008), “Classification with a Reject Option Using a Hinge Loss,” The Journal of Machine Learning Research, 9, 1823–1840.
- Chow, C. K. (1970), “On Optimum Recognition Error and Reject Tradeoff,” IEEE Transactions on Information Theory, 16, 41–46.
- del Coz, J. J., Diez, J., and Bahamonde, A. (2009), “Learning Nondeterministic Classifiers,” The Journal of Machine Learning Research, 10, 2273–2293.
- Denis, C., and Hebiri, M. (2015), “Consistency of Plug-in Confidence Sets for Classification in Semi-supervised Learning,” arXiv:1507.07235.
- Devroye, L. (1978), “The Uniform Convergence of Nearest Neighbor Regression Function Estimators and Their Application in Optimization,” IEEE Transactions on Information Theory, 24, 142–151.
- Grycko, E. (1993), “Classification With Set-Valued Decision Functions,” in Information and Classification, Studies in Classification, Data Analysis and Knowledge Organization, eds. O. Opitz, B. Lausen, and R. Klar, Berlin, Heidelberg: Springer, pp. 218–224.
- Herbei, R., and Wegkamp, M. H. (2006), “Classification With Reject Option,” Canadian Journal of Statistics, 34, 709–721.
- Le Cun, Y., Boser, B., Denker, J. S., Henderson, D., Howard, R. E., Hubbard, W., and Jackel, L. D. (1990), “Handwritten Digit Recognition with a Back-pRopagation Network,” in Advances in Neural Information Processing Systems 2. Proceedings of the 1989 Conference, ed. D. S. Touretzky, San Francisco, CA: Morgan Kaufmann, pp. 396–404.
- Lei, J. (2014), “Classification With Confidence,” Biometrika, 101, 755–769.
- Lei, J., Rinaldo, A., and Wasserman, L. (2014), “A Conformal Prediction Approach to Explore Functional Data,” Annals of Mathematics and Artificial Intelligence, 74, 29–43.
- Lei, J., Robins, J., and Wasserman, L. (2013), “Distribution Free Prediction Set,” Journal of the American Statistical Association, 108, 278–287.
- Lei, J., and Wasserman, L. (2014), “Distribution Free Prediction Bands for Nonparametric Regression,” Journal of the Royal Statistical Society, Series B, 76, 71–96.
- Lichman, M. (2013), “UCI Machine Learning Repository,” available at http://archive.ics.uci.edu/ml
- Papadopoulos, H., Proedrou, K., Vovk, V., and Gammerman, A. (2002), “Inductive Confidence Machines for Regression,” in Machine Learning: ECML 2002, Springer, pp. 345–356.
- Ramaswamy, H. G., Tewari, A., and Agarwal, S. (2018), “Consistent Algorithms for Multiclass Classification with a Reject Option,” Electronic Journal of Statistics, 12, 530–554.
- Shafer, G., and Vovk, V. (2008), “A Tutorial on Conformal Prediction,” Journal of Machine Learning Research, 9, 371–421.
- Silverman, B. W. (1986), Density Estimation for Statistics and Data Analysis (Vol. 26), Boca Raton, FL: CRC press.
- Stone, C. (1982), “Optimal Global Rates of Convergence for Nonparametric Regression,” The Annals of Statistics, 10, 1040–1053.
- Tsoumakas, G., and Katakis, I. (2007), “Multi-label Classification: An Overview,” International Journal of Data Warehousing & Mining, 3, 1–13.
- Tsybakov, A. B. (2009), Introduction to Nonparametric Estimation, New York: Springer.
- van de Geer, S. A. (2008), “High-Dimensional Generalized Linear Models and the Lasso,” The Annals of Statistics, 36, 614–645.
- Vovk, V. (2013), “Conditional Validity of Inductive Conformal Predictors,” Machine Learning, 92, 349–376.
- Vovk, V., Fedorova, V., Nouretdinov, I., and Gammerman, A. (2016), “Criteria of Efficiency for Conformal Prediction,” in Proceedings of COPA 2016 (Fifth Symposium on Conformal and Probabilistic Prediction with Applications), pp. 23–39.
- Vovk, V., Gammerman, A., and Shafer, G. (2005), Algorithmic Learning in a Random World, New York: Springer.
- Vovk, V., Petej, I., and Fedorova, V. (2014), “From Conformal to Probabilistic Prediction,” in COPA 2014 Proceedings, Artificial Intelligence Applications and Innovations, pp. 221–230.
- Yuan, M., and Wegkamp, M. (2010), “Classification Methods with Reject Option Based on Convex Risk Minimization,” Journal of Machine Learning Research, 11, 111–130.
- Zhang, M. L., and Zhou, Z. H. (2007), “Ml-knn: A Lazy Learning Approach to Multi-label Learning,” Pattern Recognition, 40, 2038–2048.