Search in:

Advanced search

Journal of the American Statistical Association Volume 116, 2021 - Issue 534

Submit an article Journal homepage

1,596

Views

CrossRef citations to date

Altmetric

Theory and Methods Special Issue on Precision Medicine and Individualized Policy Discovery, Part II

More Efficient Policy Learning via Optimal Retargeting

Nathan KallusSchool of Operations Research and Information Engineering and Cornell Tech, Cornell University, New York, NYCorrespondence[email protected]

https://orcid.org/0000-0002-2757-1570 View further author information

Pages 646-658 | Received 19 Jun 2019, Accepted 23 Jun 2020, Published online: 03 Aug 2020

Cite this article
https://doi.org/10.1080/01621459.2020.1788948
CrossMark

Full Article
Figures & data
References
Supplemental
Citations
Metrics
Reprints & Permissions

References

Athey, S., and Wager, S. (2017), “Efficient Policy Learning,” arXiv no. 1702.02896.
Google Scholar
Behaghel, L., Crépon, B., and Gurgand, M. (2014), “Private and Public Provision of Counseling to Job Seekers: Evidence From a Large Controlled Experiment,” American Economic Journal: Applied Economics, 6, 142–174. DOI: https://doi.org/10.1257/app.6.4.142.
Web of Science ®Google Scholar
Bennett, A., and Kallus, N. (2020), “Efficient Policy Learning From Surrogate-Loss Classification Reductions,” in Proceedings of the 34th International Conference on Machine Learning.
Google Scholar
Bertsimas, D., Kallus, N., Weinstein, A. M., and Zhuo, Y. D. (2017), “Personalized Diabetes Management Using Electronic Medical Records,” Diabetes Care, 40, 210–217. DOI: https://doi.org/10.2337/dc16-0826.
PubMed Web of Science ®Google Scholar
Beygelzimer, A., and Langford, J. (2009), “The Offset Tree for Learning With Partial Labels,” in Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 129–138. DOI: https://doi.org/10.1145/1557019.1557040.
Google Scholar
Bickel, P., Klassen, C., Ritov, Y., and Wellner, J. (1993), Efficient and Adaptive Estimation for Semiparametric Models, New York: Springer.
Google Scholar
Chen, G., Zeng, D., and Kosorok, M. R. (2016), “Personalized Dose Finding Using Outcome Weighted Learning,” Journal of the American Statistical Association, 111, 1509–1521. DOI: https://doi.org/10.1080/01621459.2016.1148611.
PubMed Web of Science ®Google Scholar
Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W., and Robins, J. (2018), “Double/Debiased Machine Learning for Treatment and Structural Parameters,” The Econometrics Journal, 21, C1–C68. DOI: https://doi.org/10.1111/ectj.12097.
Web of Science ®Google Scholar
Cochran, W. G., and Rubin, D. B. (1973), “Controlling Bias in Observational Studies: A Review,” Sankhyā: The Indian Journal of Statistics, Series A, 35, 417–446.
Google Scholar
Crump, R., Hotz, V. J., Imbens, G., and Mitnik, O. (2006), “Moving the Goalposts: Addressing Limited Overlap in the Estimation of Average Treatment Effects by Changing the Estimand,” Working Paper 330, National Bureau of Economic Research.
Google Scholar
D’Amour, A., Ding, P., Feller, A., Lei, L., and Sekhon, J. (2017), “Overlap in Observational Studies With High-Dimensional Covariates,” arXiv no. 1711.02582.
Google Scholar
Dehejia, R. H., and Wahba, S. (1999), “Causal Effects in Nonexperimental Studies: Reevaluating the Evaluation of Training Programs,” Journal of the American Statistical Association, 94, 1053–1062. DOI: https://doi.org/10.1080/01621459.1999.10473858.
Web of Science ®Google Scholar
Dudík, M., Langford, J., and Li, L. (2011), “Doubly Robust Policy Evaluation and Learning,” in Proceedings of the 28th International Conference on International Conference on Machine Learning, pp. 1097–1104.
Google Scholar
Hahn, J. (1998), “On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects,” Econometrica, 66, 315–331. DOI: https://doi.org/10.2307/2998560.
Web of Science ®Google Scholar
Heckman, J. J., Ichimura, H., and Todd, P. E. (1997), “Matching as an Econometric Evaluation Estimator: Evidence From Evaluating a Job Training Programme,” The Review of Economic Studies, 64, 605–654. DOI: https://doi.org/10.2307/2971733.
Web of Science ®Google Scholar
Hirano, K., Imbens, G. W., and Ridder, G. (2003), “Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score,” Econometrica, 71, 1161–1189. DOI: https://doi.org/10.1111/1468-0262.00442.
Web of Science ®Google Scholar
Hirano, K., and Porter, J. R. (2009), “Asymptotics for Statistical Treatment Rules,” Econometrica, 7, 1683–1701.
Google Scholar
Iacus, S. M., King, G., and Porro, G. (2011), “Multivariate Matching Methods That Are Monotonic Imbalance Bounding,” Journal of the American Statistical Association, 106, 345–361. DOI: https://doi.org/10.1198/jasa.2011.tm09599.
Web of Science ®Google Scholar
Ionides, E. L. (2008), “Truncated Importance Sampling,” Journal of Computational and Graphical Statistics, 17, 295–311. DOI: https://doi.org/10.1198/106186008X320456.
Web of Science ®Google Scholar
Kallus, N. (2016), “Generalized Optimal Matching Methods for Causal Inference,” arXiv no. 1612.08321.
Google Scholar
——— (2017), “Recursive Partitioning for Personalization Using Observational Data,” in Proceedings of the 34th International Conference on Machine Learning, pp. 1789–1798.
Google Scholar
——— (2018), “Balanced Policy Evaluation and Learning,” in Advances in Neural Information Processing Systems, pp. 8895–8906.
Google Scholar
Kallus, N., and Zhou, A. (2018a), “Confounding-Robust Policy Improvement,” in Advances in Neural Information Processing Systems, pp. 9269–9279.
Google Scholar
Kallus, N., and Zhou, A. (2018b), “Policy Evaluation and Optimization With Continuous Treatments,” in International Conference on Artificial Intelligence and Statistics, pp. 1243–1251.
Google Scholar
Kallus, N., and Zhou, A. (2019), “Assessing Disparate Impacts of Personalized Interventions: Identifiability and Bounds,” arXiv no. 1906.01552.
Google Scholar
Kitagawa, T., and Tetenov, A. (2018), “Who Should Be Treated? Empirical Welfare Maximization Methods for Treatment Choice,” Econometrica, 86, 591–616. DOI: https://doi.org/10.3982/ECTA13288.
Web of Science ®Google Scholar
Kosorok, M. R., and Laber, E. B. (2019), “Precision Medicine,” Annual Review of Statistics and Its Application, 6, 263–286. DOI: https://doi.org/10.1146/annurev-statistics-030718-105251.
PubMed Web of Science ®Google Scholar
Kube, A., Das, S., and Fowler, P. J. (2019), “Allocating Interventions Based on Predicted Outcomes: A Case Study on Homelessness Services,” in Proceedings of the AAAI Conference on Artificial Intelligence. DOI: https://doi.org/10.1609/aaai.v33i01.3301622.
Google Scholar
Laber, E. B., Lizotte, D. J., Qian, M., Pelham, W. E., and Murphy, S. A. (2014), “Dynamic Treatment Regimes: Technical Challenges and Applications,” Electronic Journal of Statistics, 8, 1225. DOI: https://doi.org/10.1214/14-ejs920.
PubMed Web of Science ®Google Scholar
LaLonde, R. J. (1986), “Evaluating the Econometric Evaluations of Training Programs With Experimental Data,” The American Economic Review, 76, 604–620.
Web of Science ®Google Scholar
Li, F., Morgan, K. L., and Zaslavsky, A. M. (2018), “Balancing Covariates via Propensity Score Weighting,” Journal of the American Statistical Association, 113, 390–400. DOI: https://doi.org/10.1080/01621459.2016.1260466.
Web of Science ®Google Scholar
Li, L., Chu, W., Langford, J., and Wang, X. (2011), “Unbiased Offline Evaluation of Contextual-Bandit-Based News Article Recommendation Algorithms,” in Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 297–306. DOI: https://doi.org/10.1145/1935826.1935878.
Google Scholar
Mandel, T., Liu, Y.-E., Levine, S., Brunskill, E., and Popovic, Z. (2014), “Offline Policy Evaluation Across Representations With Applications to Educational Games,” in Proceedings of the International Conference on Autonomous Agents and Multi-Agent Systems, pp. 1077–1084.
Google Scholar
Pollard, D. (1990), “Empirical Processes: Theory and Applications,” in NSF-CBMS Regional Conference Series in Probability and Statistics.
Google Scholar
Qian, M., and Murphy, S. A. (2011), “Performance Guarantees for Individualized Treatment Rules,” The Annals of Statistics, 39, 1180. DOI: https://doi.org/10.1214/10-AOS864.
PubMed Web of Science ®Google Scholar
Robins, J. M., Rotnitzky, A., and Zhao, L. P. (1994), “Estimation of Regression Coefficients When Some Regressors Are Not Always Observed,” Journal of the American Statistical Association, 89, 846–866. DOI: https://doi.org/10.1080/01621459.1994.10476818.
Web of Science ®Google Scholar
Rubin, D. B. (1980), “Comments on ‘Randomization Analysis of Experimental Data: The Fisher Randomization Test Comment’,” Journal of the American Statistical Association, 75, 591–593. DOI: https://doi.org/10.2307/2287653.
Web of Science ®Google Scholar
Rubin, D. B. (2010), “On the Limitations of Comparative Effectiveness Research,” Statistics in Medicine, 29, 1991–1995.
PubMed Web of Science ®Google Scholar
Santacatterina, M., and Bottai, M. (2018), “Optimal Probability Weights for Inference With Constrained Precision,” Journal of the American Statistical Association, 113, 983–991. DOI: https://doi.org/10.1080/01621459.2017.1375932.
Web of Science ®Google Scholar
Smith, J. A., and Todd, P. E. (2005), “Does Matching Overcome Lalonde’s Critique of Nonexperimental Estimators?,” Journal of Econometrics, 125, 305–353.
Web of Science ®Google Scholar
Stoye, J. (2009), “Minimax Regret Treatment Choice With Finite Samples,” Journal of Econometrics, 151, 70–81. DOI: https://doi.org/10.1016/j.jeconom.2009.02.013.
Web of Science ®Google Scholar
Swaminathan, A., and Joachims, T. (2015a), “Counterfactual Risk Minimization: Learning From Logged Bandit Feedback,” in International Conference on Machine Learning, pp. 814–823.
Google Scholar
Swaminathan, A., and Joachims, T. (2015b), “The Self-Normalized Estimator for Counterfactual Learning,” in Advances in Neural Information Processing Systems, pp. 3231–3239.
Google Scholar
Tsiatis, A. (2007), Semiparametric Theory and Missing Data, New York: Springer.
Google Scholar
Van der Vaart, A. W. (1998), Asymptotic Statistics, New York: Cambridge University Press.
Google Scholar
Vapnik, V. (2000), The Nature of Statistical Learning Theory, New York: Springer.
Google Scholar
Zhao, Y., Zeng, D., Rush, A. J., and Kosorok, M. R. (2012), “Estimating Individualized Treatment Rules Using Outcome Weighted Learning,” Journal of the American Statistical Association, 107, 1106–1118. DOI: https://doi.org/10.1080/01621459.2012.695674.
PubMed Web of Science ®Google Scholar
Zhao, Y.-Q., Zeng, D., Tangen, C. M., and Leblanc, M. L. (2019), “Robustifying Trial-Derived Optimal Treatment Rules for a Target Population,” Electronic Journal of Statistics, 13, 1717–1743. DOI: https://doi.org/10.1214/19-EJS1540.
PubMed Web of Science ®Google Scholar
Zhou, Z., Athey, S., and Wager, S. (2018), “Offline Multi-Action Policy Learning: Generalization and Optimization,” arXiv no. 1810.04778.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

More Efficient Policy Learning via Optimal Retargeting

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

More Efficient Policy Learning via Optimal Retargeting

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date