1,297
Views
5
CrossRef citations to date
0
Altmetric
Applications and Case Studies

Discovering Heterogeneous Exposure Effects Using Randomization Inference in Air Pollution Studies

ORCID Icon, &
Pages 569-580 | Received 28 Feb 2018, Accepted 19 Dec 2020, Published online: 16 Feb 2021
 

Abstract

Several studies have provided strong evidence that long-term exposure to air pollution, even at low levels, increases risk of mortality. As regulatory actions are becoming prohibitively expensive, robust evidence to guide the development of targeted interventions to protect the most vulnerable is needed. In this article, we introduce a novel statistical method that (i) discovers subgroups whose effects substantially differ from the population mean, and (ii) uses randomization-based tests to assess discovered heterogeneous effects. Also, we develop a sensitivity analysis method to assess the robustness of the conclusions to unmeasured confounding bias. Via simulation studies and theoretical arguments, we demonstrate that hypothesis testing focusing on the discovered subgroups can substantially increase statistical power to detect heterogeneity of the exposure effects. We apply the proposed de novo method to the data of 1,612,414 Medicare beneficiaries in the New England region in the United States for the period 2000–2006. We find that seniors aged between 81 and 85 with low income and seniors aged 85 and above have statistically significant greater causal effects of long-term exposure to PM2.5 on 5-year mortality rate compared to the population mean.

Supplementary Materials

Appendix The online appendix contains proofs for Theorems 1 and 2, additional simulations for the discovery step. (.pdf file)

R code for application and simulations An R script illustrates our methods with a simulated dataset. Codes for both implementing the de novo method and producing simulation results are provided. (.R file)

Acknowledgments

We are grateful for helpful feedback from the editor, the associate editor, four anonymous referees, and session participants at JSM and European Causal Inference Meeting.

Additional information

Funding

This work was supported by NIH grants (R01GM111339, R01ES024332, R01ES026217, P50MD010428, DP2MD012722, R01ES028033, R01MD012769) and HEI grant (4953-RFA14-3/16-4). This work was also supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (2021R1C1C1012750).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.