References
- Athey, S., and Wager, S. (2017), “Efficient Policy Learning,” arXiv no. 1702.02896.
- Behaghel, L., Crépon, B., and Gurgand, M. (2014), “Private and Public Provision of Counseling to Job Seekers: Evidence From a Large Controlled Experiment,” American Economic Journal: Applied Economics, 6, 142–174. DOI: https://doi.org/10.1257/app.6.4.142.
- Bennett, A., and Kallus, N. (2020), “Efficient Policy Learning From Surrogate-Loss Classification Reductions,” in Proceedings of the 34th International Conference on Machine Learning.
- Bertsimas, D., Kallus, N., Weinstein, A. M., and Zhuo, Y. D. (2017), “Personalized Diabetes Management Using Electronic Medical Records,” Diabetes Care, 40, 210–217. DOI: https://doi.org/10.2337/dc16-0826.
- Beygelzimer, A., and Langford, J. (2009), “The Offset Tree for Learning With Partial Labels,” in Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 129–138. DOI: https://doi.org/10.1145/1557019.1557040.
- Bickel, P., Klassen, C., Ritov, Y., and Wellner, J. (1993), Efficient and Adaptive Estimation for Semiparametric Models, New York: Springer.
- Chen, G., Zeng, D., and Kosorok, M. R. (2016), “Personalized Dose Finding Using Outcome Weighted Learning,” Journal of the American Statistical Association, 111, 1509–1521. DOI: https://doi.org/10.1080/01621459.2016.1148611.
- Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W., and Robins, J. (2018), “Double/Debiased Machine Learning for Treatment and Structural Parameters,” The Econometrics Journal, 21, C1–C68. DOI: https://doi.org/10.1111/ectj.12097.
- Cochran, W. G., and Rubin, D. B. (1973), “Controlling Bias in Observational Studies: A Review,” Sankhyā: The Indian Journal of Statistics, Series A, 35, 417–446.
- Crump, R., Hotz, V. J., Imbens, G., and Mitnik, O. (2006), “Moving the Goalposts: Addressing Limited Overlap in the Estimation of Average Treatment Effects by Changing the Estimand,” Working Paper 330, National Bureau of Economic Research.
- D’Amour, A., Ding, P., Feller, A., Lei, L., and Sekhon, J. (2017), “Overlap in Observational Studies With High-Dimensional Covariates,” arXiv no. 1711.02582.
- Dehejia, R. H., and Wahba, S. (1999), “Causal Effects in Nonexperimental Studies: Reevaluating the Evaluation of Training Programs,” Journal of the American Statistical Association, 94, 1053–1062. DOI: https://doi.org/10.1080/01621459.1999.10473858.
- Dudík, M., Langford, J., and Li, L. (2011), “Doubly Robust Policy Evaluation and Learning,” in Proceedings of the 28th International Conference on International Conference on Machine Learning, pp. 1097–1104.
- Hahn, J. (1998), “On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects,” Econometrica, 66, 315–331. DOI: https://doi.org/10.2307/2998560.
- Heckman, J. J., Ichimura, H., and Todd, P. E. (1997), “Matching as an Econometric Evaluation Estimator: Evidence From Evaluating a Job Training Programme,” The Review of Economic Studies, 64, 605–654. DOI: https://doi.org/10.2307/2971733.
- Hirano, K., Imbens, G. W., and Ridder, G. (2003), “Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score,” Econometrica, 71, 1161–1189. DOI: https://doi.org/10.1111/1468-0262.00442.
- Hirano, K., and Porter, J. R. (2009), “Asymptotics for Statistical Treatment Rules,” Econometrica, 7, 1683–1701.
- Iacus, S. M., King, G., and Porro, G. (2011), “Multivariate Matching Methods That Are Monotonic Imbalance Bounding,” Journal of the American Statistical Association, 106, 345–361. DOI: https://doi.org/10.1198/jasa.2011.tm09599.
- Ionides, E. L. (2008), “Truncated Importance Sampling,” Journal of Computational and Graphical Statistics, 17, 295–311. DOI: https://doi.org/10.1198/106186008X320456.
- Kallus, N. (2016), “Generalized Optimal Matching Methods for Causal Inference,” arXiv no. 1612.08321.
- ——— (2017), “Recursive Partitioning for Personalization Using Observational Data,” in Proceedings of the 34th International Conference on Machine Learning, pp. 1789–1798.
- ——— (2018), “Balanced Policy Evaluation and Learning,” in Advances in Neural Information Processing Systems, pp. 8895–8906.
- Kallus, N., and Zhou, A. (2018a), “Confounding-Robust Policy Improvement,” in Advances in Neural Information Processing Systems, pp. 9269–9279.
- Kallus, N., and Zhou, A. (2018b), “Policy Evaluation and Optimization With Continuous Treatments,” in International Conference on Artificial Intelligence and Statistics, pp. 1243–1251.
- Kallus, N., and Zhou, A. (2019), “Assessing Disparate Impacts of Personalized Interventions: Identifiability and Bounds,” arXiv no. 1906.01552.
- Kitagawa, T., and Tetenov, A. (2018), “Who Should Be Treated? Empirical Welfare Maximization Methods for Treatment Choice,” Econometrica, 86, 591–616. DOI: https://doi.org/10.3982/ECTA13288.
- Kosorok, M. R., and Laber, E. B. (2019), “Precision Medicine,” Annual Review of Statistics and Its Application, 6, 263–286. DOI: https://doi.org/10.1146/annurev-statistics-030718-105251.
- Kube, A., Das, S., and Fowler, P. J. (2019), “Allocating Interventions Based on Predicted Outcomes: A Case Study on Homelessness Services,” in Proceedings of the AAAI Conference on Artificial Intelligence. DOI: https://doi.org/10.1609/aaai.v33i01.3301622.
- Laber, E. B., Lizotte, D. J., Qian, M., Pelham, W. E., and Murphy, S. A. (2014), “Dynamic Treatment Regimes: Technical Challenges and Applications,” Electronic Journal of Statistics, 8, 1225. DOI: https://doi.org/10.1214/14-ejs920.
- LaLonde, R. J. (1986), “Evaluating the Econometric Evaluations of Training Programs With Experimental Data,” The American Economic Review, 76, 604–620.
- Li, F., Morgan, K. L., and Zaslavsky, A. M. (2018), “Balancing Covariates via Propensity Score Weighting,” Journal of the American Statistical Association, 113, 390–400. DOI: https://doi.org/10.1080/01621459.2016.1260466.
- Li, L., Chu, W., Langford, J., and Wang, X. (2011), “Unbiased Offline Evaluation of Contextual-Bandit-Based News Article Recommendation Algorithms,” in Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 297–306. DOI: https://doi.org/10.1145/1935826.1935878.
- Mandel, T., Liu, Y.-E., Levine, S., Brunskill, E., and Popovic, Z. (2014), “Offline Policy Evaluation Across Representations With Applications to Educational Games,” in Proceedings of the International Conference on Autonomous Agents and Multi-Agent Systems, pp. 1077–1084.
- Pollard, D. (1990), “Empirical Processes: Theory and Applications,” in NSF-CBMS Regional Conference Series in Probability and Statistics.
- Qian, M., and Murphy, S. A. (2011), “Performance Guarantees for Individualized Treatment Rules,” The Annals of Statistics, 39, 1180. DOI: https://doi.org/10.1214/10-AOS864.
- Robins, J. M., Rotnitzky, A., and Zhao, L. P. (1994), “Estimation of Regression Coefficients When Some Regressors Are Not Always Observed,” Journal of the American Statistical Association, 89, 846–866. DOI: https://doi.org/10.1080/01621459.1994.10476818.
- Rubin, D. B. (1980), “Comments on ‘Randomization Analysis of Experimental Data: The Fisher Randomization Test Comment’,” Journal of the American Statistical Association, 75, 591–593. DOI: https://doi.org/10.2307/2287653.
- Rubin, D. B. (2010), “On the Limitations of Comparative Effectiveness Research,” Statistics in Medicine, 29, 1991–1995.
- Santacatterina, M., and Bottai, M. (2018), “Optimal Probability Weights for Inference With Constrained Precision,” Journal of the American Statistical Association, 113, 983–991. DOI: https://doi.org/10.1080/01621459.2017.1375932.
- Smith, J. A., and Todd, P. E. (2005), “Does Matching Overcome Lalonde’s Critique of Nonexperimental Estimators?,” Journal of Econometrics, 125, 305–353.
- Stoye, J. (2009), “Minimax Regret Treatment Choice With Finite Samples,” Journal of Econometrics, 151, 70–81. DOI: https://doi.org/10.1016/j.jeconom.2009.02.013.
- Swaminathan, A., and Joachims, T. (2015a), “Counterfactual Risk Minimization: Learning From Logged Bandit Feedback,” in International Conference on Machine Learning, pp. 814–823.
- Swaminathan, A., and Joachims, T. (2015b), “The Self-Normalized Estimator for Counterfactual Learning,” in Advances in Neural Information Processing Systems, pp. 3231–3239.
- Tsiatis, A. (2007), Semiparametric Theory and Missing Data, New York: Springer.
- Van der Vaart, A. W. (1998), Asymptotic Statistics, New York: Cambridge University Press.
- Vapnik, V. (2000), The Nature of Statistical Learning Theory, New York: Springer.
- Zhao, Y., Zeng, D., Rush, A. J., and Kosorok, M. R. (2012), “Estimating Individualized Treatment Rules Using Outcome Weighted Learning,” Journal of the American Statistical Association, 107, 1106–1118. DOI: https://doi.org/10.1080/01621459.2012.695674.
- Zhao, Y.-Q., Zeng, D., Tangen, C. M., and Leblanc, M. L. (2019), “Robustifying Trial-Derived Optimal Treatment Rules for a Target Population,” Electronic Journal of Statistics, 13, 1717–1743. DOI: https://doi.org/10.1214/19-EJS1540.
- Zhou, Z., Athey, S., and Wager, S. (2018), “Offline Multi-Action Policy Learning: Generalization and Optimization,” arXiv no. 1810.04778.