1,596
Views
8
CrossRef citations to date
0
Altmetric
Theory and Methods Special Issue on Precision Medicine and Individualized Policy Discovery, Part II

More Efficient Policy Learning via Optimal Retargeting

ORCID Icon
Pages 646-658 | Received 19 Jun 2019, Accepted 23 Jun 2020, Published online: 03 Aug 2020

References

  • Athey, S., and Wager, S. (2017), “Efficient Policy Learning,” arXiv no. 1702.02896.
  • Behaghel, L., Crépon, B., and Gurgand, M. (2014), “Private and Public Provision of Counseling to Job Seekers: Evidence From a Large Controlled Experiment,” American Economic Journal: Applied Economics, 6, 142–174. DOI: https://doi.org/10.1257/app.6.4.142.
  • Bennett, A., and Kallus, N. (2020), “Efficient Policy Learning From Surrogate-Loss Classification Reductions,” in Proceedings of the 34th International Conference on Machine Learning.
  • Bertsimas, D., Kallus, N., Weinstein, A. M., and Zhuo, Y. D. (2017), “Personalized Diabetes Management Using Electronic Medical Records,” Diabetes Care, 40, 210–217. DOI: https://doi.org/10.2337/dc16-0826.
  • Beygelzimer, A., and Langford, J. (2009), “The Offset Tree for Learning With Partial Labels,” in Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 129–138. DOI: https://doi.org/10.1145/1557019.1557040.
  • Bickel, P., Klassen, C., Ritov, Y., and Wellner, J. (1993), Efficient and Adaptive Estimation for Semiparametric Models, New York: Springer.
  • Chen, G., Zeng, D., and Kosorok, M. R. (2016), “Personalized Dose Finding Using Outcome Weighted Learning,” Journal of the American Statistical Association, 111, 1509–1521. DOI: https://doi.org/10.1080/01621459.2016.1148611.
  • Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W., and Robins, J. (2018), “Double/Debiased Machine Learning for Treatment and Structural Parameters,” The Econometrics Journal, 21, C1–C68. DOI: https://doi.org/10.1111/ectj.12097.
  • Cochran, W. G., and Rubin, D. B. (1973), “Controlling Bias in Observational Studies: A Review,” Sankhyā: The Indian Journal of Statistics, Series A, 35, 417–446.
  • Crump, R., Hotz, V. J., Imbens, G., and Mitnik, O. (2006), “Moving the Goalposts: Addressing Limited Overlap in the Estimation of Average Treatment Effects by Changing the Estimand,” Working Paper 330, National Bureau of Economic Research.
  • D’Amour, A., Ding, P., Feller, A., Lei, L., and Sekhon, J. (2017), “Overlap in Observational Studies With High-Dimensional Covariates,” arXiv no. 1711.02582.
  • Dehejia, R. H., and Wahba, S. (1999), “Causal Effects in Nonexperimental Studies: Reevaluating the Evaluation of Training Programs,” Journal of the American Statistical Association, 94, 1053–1062. DOI: https://doi.org/10.1080/01621459.1999.10473858.
  • Dudík, M., Langford, J., and Li, L. (2011), “Doubly Robust Policy Evaluation and Learning,” in Proceedings of the 28th International Conference on International Conference on Machine Learning, pp. 1097–1104.
  • Hahn, J. (1998), “On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects,” Econometrica, 66, 315–331. DOI: https://doi.org/10.2307/2998560.
  • Heckman, J. J., Ichimura, H., and Todd, P. E. (1997), “Matching as an Econometric Evaluation Estimator: Evidence From Evaluating a Job Training Programme,” The Review of Economic Studies, 64, 605–654. DOI: https://doi.org/10.2307/2971733.
  • Hirano, K., Imbens, G. W., and Ridder, G. (2003), “Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score,” Econometrica, 71, 1161–1189. DOI: https://doi.org/10.1111/1468-0262.00442.
  • Hirano, K., and Porter, J. R. (2009), “Asymptotics for Statistical Treatment Rules,” Econometrica, 7, 1683–1701.
  • Iacus, S. M., King, G., and Porro, G. (2011), “Multivariate Matching Methods That Are Monotonic Imbalance Bounding,” Journal of the American Statistical Association, 106, 345–361. DOI: https://doi.org/10.1198/jasa.2011.tm09599.
  • Ionides, E. L. (2008), “Truncated Importance Sampling,” Journal of Computational and Graphical Statistics, 17, 295–311. DOI: https://doi.org/10.1198/106186008X320456.
  • Kallus, N. (2016), “Generalized Optimal Matching Methods for Causal Inference,” arXiv no. 1612.08321.
  • ——— (2017), “Recursive Partitioning for Personalization Using Observational Data,” in Proceedings of the 34th International Conference on Machine Learning, pp. 1789–1798.
  • ——— (2018), “Balanced Policy Evaluation and Learning,” in Advances in Neural Information Processing Systems, pp. 8895–8906.
  • Kallus, N., and Zhou, A. (2018a), “Confounding-Robust Policy Improvement,” in Advances in Neural Information Processing Systems, pp. 9269–9279.
  • Kallus, N., and Zhou, A. (2018b), “Policy Evaluation and Optimization With Continuous Treatments,” in International Conference on Artificial Intelligence and Statistics, pp. 1243–1251.
  • Kallus, N., and Zhou, A. (2019), “Assessing Disparate Impacts of Personalized Interventions: Identifiability and Bounds,” arXiv no. 1906.01552.
  • Kitagawa, T., and Tetenov, A. (2018), “Who Should Be Treated? Empirical Welfare Maximization Methods for Treatment Choice,” Econometrica, 86, 591–616. DOI: https://doi.org/10.3982/ECTA13288.
  • Kosorok, M. R., and Laber, E. B. (2019), “Precision Medicine,” Annual Review of Statistics and Its Application, 6, 263–286. DOI: https://doi.org/10.1146/annurev-statistics-030718-105251.
  • Kube, A., Das, S., and Fowler, P. J. (2019), “Allocating Interventions Based on Predicted Outcomes: A Case Study on Homelessness Services,” in Proceedings of the AAAI Conference on Artificial Intelligence. DOI: https://doi.org/10.1609/aaai.v33i01.3301622.
  • Laber, E. B., Lizotte, D. J., Qian, M., Pelham, W. E., and Murphy, S. A. (2014), “Dynamic Treatment Regimes: Technical Challenges and Applications,” Electronic Journal of Statistics, 8, 1225. DOI: https://doi.org/10.1214/14-ejs920.
  • LaLonde, R. J. (1986), “Evaluating the Econometric Evaluations of Training Programs With Experimental Data,” The American Economic Review, 76, 604–620.
  • Li, F., Morgan, K. L., and Zaslavsky, A. M. (2018), “Balancing Covariates via Propensity Score Weighting,” Journal of the American Statistical Association, 113, 390–400. DOI: https://doi.org/10.1080/01621459.2016.1260466.
  • Li, L., Chu, W., Langford, J., and Wang, X. (2011), “Unbiased Offline Evaluation of Contextual-Bandit-Based News Article Recommendation Algorithms,” in Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 297–306. DOI: https://doi.org/10.1145/1935826.1935878.
  • Mandel, T., Liu, Y.-E., Levine, S., Brunskill, E., and Popovic, Z. (2014), “Offline Policy Evaluation Across Representations With Applications to Educational Games,” in Proceedings of the International Conference on Autonomous Agents and Multi-Agent Systems, pp. 1077–1084.
  • Pollard, D. (1990), “Empirical Processes: Theory and Applications,” in NSF-CBMS Regional Conference Series in Probability and Statistics.
  • Qian, M., and Murphy, S. A. (2011), “Performance Guarantees for Individualized Treatment Rules,” The Annals of Statistics, 39, 1180. DOI: https://doi.org/10.1214/10-AOS864.
  • Robins, J. M., Rotnitzky, A., and Zhao, L. P. (1994), “Estimation of Regression Coefficients When Some Regressors Are Not Always Observed,” Journal of the American Statistical Association, 89, 846–866. DOI: https://doi.org/10.1080/01621459.1994.10476818.
  • Rubin, D. B. (1980), “Comments on ‘Randomization Analysis of Experimental Data: The Fisher Randomization Test Comment’,” Journal of the American Statistical Association, 75, 591–593. DOI: https://doi.org/10.2307/2287653.
  • Rubin, D. B. (2010), “On the Limitations of Comparative Effectiveness Research,” Statistics in Medicine, 29, 1991–1995.
  • Santacatterina, M., and Bottai, M. (2018), “Optimal Probability Weights for Inference With Constrained Precision,” Journal of the American Statistical Association, 113, 983–991. DOI: https://doi.org/10.1080/01621459.2017.1375932.
  • Smith, J. A., and Todd, P. E. (2005), “Does Matching Overcome Lalonde’s Critique of Nonexperimental Estimators?,” Journal of Econometrics, 125, 305–353.
  • Stoye, J. (2009), “Minimax Regret Treatment Choice With Finite Samples,” Journal of Econometrics, 151, 70–81. DOI: https://doi.org/10.1016/j.jeconom.2009.02.013.
  • Swaminathan, A., and Joachims, T. (2015a), “Counterfactual Risk Minimization: Learning From Logged Bandit Feedback,” in International Conference on Machine Learning, pp. 814–823.
  • Swaminathan, A., and Joachims, T. (2015b), “The Self-Normalized Estimator for Counterfactual Learning,” in Advances in Neural Information Processing Systems, pp. 3231–3239.
  • Tsiatis, A. (2007), Semiparametric Theory and Missing Data, New York: Springer.
  • Van der Vaart, A. W. (1998), Asymptotic Statistics, New York: Cambridge University Press.
  • Vapnik, V. (2000), The Nature of Statistical Learning Theory, New York: Springer.
  • Zhao, Y., Zeng, D., Rush, A. J., and Kosorok, M. R. (2012), “Estimating Individualized Treatment Rules Using Outcome Weighted Learning,” Journal of the American Statistical Association, 107, 1106–1118. DOI: https://doi.org/10.1080/01621459.2012.695674.
  • Zhao, Y.-Q., Zeng, D., Tangen, C. M., and Leblanc, M. L. (2019), “Robustifying Trial-Derived Optimal Treatment Rules for a Target Population,” Electronic Journal of Statistics, 13, 1717–1743. DOI: https://doi.org/10.1214/19-EJS1540.
  • Zhou, Z., Athey, S., and Wager, S. (2018), “Offline Multi-Action Policy Learning: Generalization and Optimization,” arXiv no. 1810.04778.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.