Search in:

Advanced search

Journal of the American Statistical Association Volume 119, 2024 - Issue 546

Submit an article Journal homepage

1,587

Views

CrossRef citations to date

Altmetric

Theory and Methods

A Two-Sample Conditional Distribution Test Using Conformal Prediction and Weighted Rank Sum

Xiaoyu Hua School of Mathematical Sciences, Center for Statistical Science, Peking University, Beijing, ChinaView further author information

Jing Leib Department of Statistics and Data Science, Carnegie Mellon University, Pittsburgh, PACorrespondence[email protected]
View further author information

Pages 1136-1154 | Received 17 Nov 2021, Accepted 16 Jan 2023, Published online: 08 Mar 2023

Cite this article
https://doi.org/10.1080/01621459.2023.2177165
CrossMark

Full Article
Figures & data
References
Supplemental
Citations
Metrics
Reprints & Permissions

References

Andrews, D. W. (1997), “A Conditional Kolmogorov Test,” Econometrica: Journal of the Econometric Society, 65, 1097–1128. DOI: 10.2307/2171880.
Web of Science ®Google Scholar
Bai, J. (2003), “Testing Parametric Conditional Distributions of Dynamic Models,” Review of Economics and Statistics, 85, 531–549. DOI: 10.1162/003465303322369704.
Web of Science ®Google Scholar
Barber, R. F., Candes, E. J., Ramdas, A., and Tibshirani, R. J. (2019), “Predictive Inference with the jackknife+,” arXiv preprint arXiv:1905.02928.
Google Scholar
Bates, S., Candès, E., Lei, L., Romano, Y., and Sesia, M. (2021), “Testing for Outliers with Conformal p-values,” arXiv preprint arXiv:2104.08279.
Google Scholar
Bickel, S., Brückner, M., and Scheffer, T. (2007), “Discriminative learning for differing training and test distributions,” in Proceedings of the 24th International Conference on Machine Learning, pp. 81–88. DOI: 10.1145/1273496.1273507.
Google Scholar
Bickel, S., Brückner, M., and Scheffer, T. (2009), “Discriminative Learning Under Covariate Shift,” Journal of Machine Learning Research, 10, 2137–2155.
Web of Science ®Google Scholar
Bühlmann, P. (2020), “Invariance, Causality and Robustness,” Statistical Science, 35, 404–426. DOI: 10.1214/19-STS721.
Web of Science ®Google Scholar
Cheng, K. F., and Chu, C.-K. (2004), “Semiparametric Density Estimation Under a Two-Sample Density Ratio Model,” Bernoulli, 10, 583–604. DOI: 10.3150/bj/1093265631.
Web of Science ®Google Scholar
Chernozhukov, V., Wüthrich, K., and Zhu, Y. (2021), “Distributional Conformal Prediction,” Proceedings of the National Academy of Sciences, 118, e2107794118. DOI: 10.1073/pnas.2107794118.
PubMed Web of Science ®Google Scholar
Corradi, V., and Swanson, N. R. (2006), “Bootstrap Conditional Distribution Tests in the Presence of Dynamic Misspecification,” Journal of Econometrics, 133, 779–806. DOI: 10.1016/j.jeconom.2005.06.013.
Web of Science ®Google Scholar
Csurka, G. (2017), Domain Adaptation in Computer Vision Applications (Vol. 2), Cham: Springer.
Google Scholar
DiCiccio, C. J., DiCiccio, T. J., and Romano, J. P. (2020), “Exact Tests via Multiple Data Splitting,” Statistics & Probability Letters, 166, 108865. DOI: 10.1016/j.spl.2020.108865.
Web of Science ®Google Scholar
Dua, D., and Graff, C. (2019), “UCI Machine Learning Repository.”
Google Scholar
Fan, Y., Li, Q., and Min, I. (2006), “A Nonparametric Bootstrap Test of Conditional Distributions,” Econometric Theory, 22, 587–613. DOI: 10.1017/S0266466606060294.
Web of Science ®Google Scholar
Fedorova, V., Gammerman, A., Nouretdinov, I., and Vovk, V. (2012), “Plug-in Martingales for Testing Exchangeability On-line,” arXiv preprint arXiv:1204.3251.
Google Scholar
Gretton, A., Smola, A., Huang, J., Schmittfull, M., Borgwardt, K., and Schölkopf, B. (2009), “Covariate Shift by Kernel Mean Matching,” Dataset Shift in Machine Learning, eds. J. Quiñonero-Candela, M. Sugiyama, A. Schwaighofer, and N. D. Lawrence, pp. 131–160, Cambridge, MA: MIT Press.
Google Scholar
Guan, L., and Tibshirani, R. (2019), “Prediction and Outlier Detection in Classification Problems,” arXiv preprint arXiv:1905.04396.
Google Scholar
Hall, P., and Hart, J. D. (1990), “Bootstrap Test for Difference between Means in Nonparametric Regression,” Journal of the American Statistical Association, 85, 1039–1049. DOI: 10.1080/01621459.1990.10474974.
Web of Science ®Google Scholar
Hardle, W., and Marron, J. S. (1990), “Semiparametric Comparison of Regression Curves,” The Annals of Statistics, 18, 63–89. DOI: 10.1214/aos/1176347493.
Web of Science ®Google Scholar
Kanamori, T., Hido, S., and Sugiyama, M. (2009), “A Least-Squares Approach to Direct Importance Estimation,” Journal of Machine Learning Research, 10, 1391–1445.
Web of Science ®Google Scholar
Kim, B., Xu, C., and Barber, R. F. (2020), “Predictive Inference is Free with the Jackknife+-after-Bootstrap,” arXiv preprint arXiv:2002.09025.
Google Scholar
Kivaranovic, D., Johnson, K. D., and Leeb, H. (2020), “Adaptive, Distribution-Free Prediction Intervals for Deep Networks,” in International Conference on Artificial Intelligence and Statistics, PMLR, pp. 4346–4356.
Google Scholar
Kouw, W. M., and Loog, M. (2018), “An Introduction to Domain Adaptation and Transfer Learning,” arXiv preprint arXiv:1812.11806.
Google Scholar
Kuchibhotla, A. K., and Ramdas, A. K. (2019), “Nested Conformal Prediction and the Generalized Jackknife+,” arXiv preprint arXiv:1910.10562.
Google Scholar
Kulasekera, K. (1995), “Comparison of Regression Curves Using Quasi-Residuals,” Journal of the American Statistical Association, 90, 1085–1093. DOI: 10.1080/01621459.1995.10476611.
Web of Science ®Google Scholar
Kulasekera, K., and Wang, J. (1997), “Smoothing Parameter Selection for Power Optimality in Testing of Regression Curves,” Journal of the American Statistical Association, 92, 500–511. DOI: 10.1080/01621459.1997.10474003.
Web of Science ®Google Scholar
Lei, J., G’Sell, M., Rinaldo, A., Tibshirani, R. J., and Wasserman, L. (2018), “Distribution-Free Predictive Inference for Regression,” Journal of the American Statistical Association, 113, 1094–1111. DOI: 10.1080/01621459.2017.1307116.
Web of Science ®Google Scholar
Lei, J., Rinaldo, A., and Wasserman, L. (2015), “A Conformal Prediction Approach to Explore Functional Data,” Annals of Mathematics and Artificial Intelligence, 74, 29–43. DOI: 10.1007/s10472-013-9366-6.
Web of Science ®Google Scholar
Lei, J., Robins, J., and Wasserman, L. (2013), “Distribution-Free Prediction Sets,” Journal of the American Statistical Association, 108, 278–287. DOI: 10.1080/01621459.2012.751873.
PubMed Web of Science ®Google Scholar
Lei, J., and Wasserman, L. (2014), “Distribution-Free Prediction Bands for Non-parametric Regression,” Journal of the Royal Statistical Society, Series B, 76, 71–96. DOI: 10.1111/rssb.12021.
Google Scholar
Li, S., Sesia, M., Romano, Y., Candès, E., and Sabatti, C. (2021), “Searching for Robust Associations with a Multi-Environment Knockoff Filter,” Biometrika, 109, 611–629. DOI: 10.1093/biomet/asab055.
PubMed Web of Science ®Google Scholar
Meinshausen, N., Meier, L., and Bühlmann, P. (2009), “P-values for High-Dimensional Regression,” Journal of the American Statistical Association, 104, 1671–1681. DOI: 10.1198/jasa.2009.tm08647.
Web of Science ®Google Scholar
Neumeyer, N., and Dette, H. (2003), “Nonparametric Comparison of Regression Curves: An Empirical Process Approach,” The Annals of Statistics, 31, 880–920. DOI: 10.1214/aos/1056562466.
Web of Science ®Google Scholar
Pan, S. J., and Yang, Q. (2009), “A Survey on Transfer Learning,” IEEE Transactions on Knowledge and Data Engineering, 22, 1345–1359. DOI: 10.1109/TKDE.2009.191.
Web of Science ®Google Scholar
Pardo-Fernández, J. C., Jiménez-Gamero, M. D., and El Ghouch, A. (2015), “Tests for the Equality of Conditional Variance Functions in Nonparametric Regression,” Electronic Journal of Statistics, 9, 1826–1851. DOI: 10.1214/15-EJS1058.
Web of Science ®Google Scholar
Peters, J., Bühlmann, P., and Meinshausen, N. (2016), “Causal Inference by Using Invariant Prediction: Identification and Confidence Intervals,” Journal of the Royal Statistical Society, Series B, 78, 947–1012. DOI: 10.1111/rssb.12167.
Google Scholar
Qin, J. (1998), “Inferences for Case-Control and Semiparametric Two-Sample Density Ratio Models,” Biometrika, 85, 619–630. DOI: 10.1093/biomet/85.3.619.
Web of Science ®Google Scholar
Rinaldo, A., Wasserman, L., G’Sell, M. (2019), “Bootstrapping and Sample Splitting for High-Dimensional, Assumption-Lean Inference,” The Annals of Statistics, 47, 3438–3469. DOI: 10.1214/18-AOS1784.
Web of Science ®Google Scholar
Romano, Y., Patterson, E., and Candes, E. (2019), “Conformalized Quantile Regression,” in Advances in Neural Information Processing Systems (Vol. 32).
Google Scholar
Sesia, M., and Romano, Y. (2021), “Conformal Prediction using Conditional Histograms,” in Advances in Neural Information Processing Systems (Vol. 34).
Google Scholar
Shimodaira, H. (2000), “Improving Predictive Inference Under Covariate Shift by Weighting the Log-Likelihood Function,” Journal of Statistical Planning and Inference, 90, 227–244. DOI: 10.1016/S0378-3758(00)00115-4.
Web of Science ®Google Scholar
Sollich, P. (2000), “Probabilistic Methods for Support Vector Machines,” in Advances in Neural Information Processing Systems, pp. 349–355.
Google Scholar
Sugiyama, M., and Kawanabe, M. (2012), Machine Learning in Non-stationary Environments: Introduction to Covariate Shift Adaptation, Cambridge, MA: MIT Press.
Google Scholar
Sugiyama, M., Krauledat, M., and Müller, K.-R. (2007), “Covariate Shift Adaptation by Importance Weighted Cross Validation,” Journal of Machine Learning Research, 8, 985–1005.
Web of Science ®Google Scholar
Sugiyama, M., Nakajima, S., Kashima, H., Buenau, P. V., and Kawanabe, M. (2008), “Direct Importance Estimation with Model Selection and its Application to Covariate Shift Adaptation,” in Advances in neural information processing systems, pp. 1433–1440.
Google Scholar
Tibshirani, R. J., Barber, R. F., Candes, E., and Ramdas, A. (2019), “Conformal Prediction Under Covariate Shift,” in Advances in Neural Information Processing Systems, pp. 2526–2536.
Google Scholar
Tsuboi, Y., Kashima, H., Hido, S., Bickel, S., and Sugiyama, M. (2009), “Direct Density Ratio Estimation for Large-Scale Covariate Shift Adaptation,” Journal of Information Processing, 17, 138–155. DOI: 10.2197/ipsjjip.17.138.
Google Scholar
Vovk, V. (2019), “Testing Randomness,” arXiv preprint arXiv:1906.09256.
Google Scholar
Vovk, V. (2020), “Testing for Concept Shift Online,” arXiv preprint arXiv:2012.14246.
Google Scholar
Vovk, V., Gammerman, A., and Shafer, G. (2005), Algorithmic Learning in a Random World, New York: Springer.
Google Scholar
Vovk, V., Nouretdinov, I., and Gammerman, A. (2003), “Testing Exchangeability On-line,” in Proceedings of the 20th International Conference on Machine Learning, pp. 768–775.
Google Scholar
Vovk, V., Petej, I., Nouretdinov, I., Ahlberg, E., Carlsson, L., and Gammerman, A. (2021), “Retrain or Not Retrain: Conformal Test Martingales for Change-Point Detection,” in Conformal and Probabilistic Prediction and Applications, pp. 191–210, PMLR.
Google Scholar
Wasserman, L., and Roeder, K. (2009), “High Dimensional Variable Selection,” Annals of Statistics, 37, 2178–2201. DOI: 10.1214/08-aos646.
PubMed Web of Science ®Google Scholar
Zheng, J. X. (2000), “A Cconsistent Test of Conditional Parametric Distributions,” Econometric Theory, 16, 667–691. DOI: 10.1017/S026646660016503X.
Web of Science ®Google Scholar
Zhu, J., and Hastie, T. (2005), “Kernel Logistic Regression and the Import Vector Machine,” Journal of Computational and Graphical Statistics, 14, 185–205. DOI: 10.1198/106186005X25619.
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

A Two-Sample Conditional Distribution Test Using Conformal Prediction and Weighted Rank Sum

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

A Two-Sample Conditional Distribution Test Using Conformal Prediction and Weighted Rank Sum

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date