1,357
Views
10
CrossRef citations to date
0
Altmetric
Technical Papers

The relationship between daily cardiovascular mortality and daily ambient concentrations of particulate pollutants (sulfur, arsenic, selenium, and mercury) and daily source contributions from coal power plants and smelters (individually, combined, and with interaction) in Phoenix, AZ, 1995–1998: A multipollutant approach to acute, time-series air pollution epidemiology: I

Pages 599-610 | Received 01 Oct 2014, Accepted 19 Mar 2015, Published online: 14 Apr 2015

Abstract

The objective of this paper is to estimate the increase in risk of daily cardiovascular mortality due to an increase in the daily ambient concentration of the individual particulate pollutants sulfur (S), arsenic (As), selenium (Se), and mercury (Hg) using single-pollutant models (SPMs) and to compare this risk to the combined increase in risk due to an increase in all four pollutants by including all four pollutants in the same model (multipollutant model, MPM) and to the risks from source contributions from power plants and smelters. Individual betas in a multipollutant model (MPM) were summed to give a combined beta. Interaction was investigated with a pollutant product term. SPMs (controlling for time trends, temperature, and relative humidity), for an interquartile range (IQR) increase in the pollutant concentration on lag day 0, gave these percent excess risks (±95% confidence levels): S, 6.9% (1.3–12%); As, 2.9% (0.4–5.5%); Se, 1.4% (–1.7 to 4.6); Hg, 9.6% (4.8–14.6%). The SPM beta for S (as sulfate) was higher than found in other studies. The SPM beta for Hg gave the largest t-statistic and beta per unit mass of any pollutant studied. An (IQR) increase in all four pollutants gave an excess risk of 15.4% (7.5–23.8%), slightly smaller than the combination of S and Hg, 16.7% (9.1–24.9%). The combined beta was 71% of the sum of the four individual SPM betas, indicating a reduction in confounding among pollutants in the combined model. As and Se were shown to be noncausal; their SPM betas could be explained as confounding by S.

Implications: The combined effect of several pollutants can be estimated by including the appropriate pollutants in the same statistical model, summing their individual betas to give a combined beta, and using a variance–covariance matrix to obtain the standard error. This approach identifies and reduces confounding among the species in the multipollutant model and can be used to identify confounded species that have no independent relationship with mortality. The effect of several pollutants acting together may be higher than that of one pollutant. Further work is needed to understand the strong relationship of mortality with particulate mercury and sulfate.

Introduction

Over the past decade there has been an effort to move air quality management from a single-pollutant approach to a “multipollutant” approach (National Research Council, Citation2004; Hidy, Citation2010; Chow, Citation2010; Mauderly et al., Citation2010). This has been accompanied by an effort to develop an air pollution research program suitable to provide a sound basis for multipollutant air quality management (Vedal, Citation2011; Dominici, Citation2010; Greenbaum and Shaikh, Citation2010). It has been fairly common to include two pollutants in the same statistical model and compare the individual betas and confidence intervals in an attempt to identify the more toxic pollutant or the more likely causal pollutant (Burnett, Citation2004; Roberts, Citation2006; Brook et al., Citation2007). However, it is now generally recognized that this approach is not reliable (Vedal, Citation2011; Tolbert et al., Citation2007; Ito et al., Citation2007). Ito et al. (Citation2007) argue that unless the pollutant variables are uncorrelated, pollutants in a multipollutant model (MPM) cannot be considered as independent variables. Therefore, multipollution approaches to air pollution epidemiology that use betas or t-statistics (or derived confidence intervals) as an indicator of the relative toxicity of the individual pollutants may provide indications but are not necessarily reliable (Burnett, Citation2004; Roberts, Citation2006; Brook et al., Citation2007). While the individual betas in an MPM may not be considered representative of the effects of the individual pollutants, the sum of the betas (combined beta) may be considered representative of the combined effect of the multiple pollutants. Thus, by putting several pollutants in the same statistical model, we can obtain the combined health effect of the pollutants in the model. Thurston et al. (Citation2013) have used a similar approach, combining many pollutants in an MPM to obtain a total risk index for long term exposure to air pollutants. This paper is apparently the first to apply this multipollutant approach to acute, time-series air pollution epidemiology. Source contributions obtained from source apportionment models also provide an approach to investigating the combined effects of multicomponent mixtures.

Methods

Air pollution data

Chemical composition data were obtained from the U.S. Environmental Protection Agency (EPA) National Exposure Research Laboratory (NERL) research monitoring site in central Phoenix. The site and measurements program are described in detail in Mar et al. (Citation2000, Citation2003) and in a EPA report (EPA, Citation1998). In brief, NERL investigators collected daily, gravimetric, integrated 24-hr (starting at 0700), fine particle samples (PM2.5) on Teflon filters. The concentrations of x-ray fluorescence (XRF) elements (including S, Hg, As, and Se) were determined at the EPA (Research Triangle Park, NC) with energy-dispersive x-ray fluorescence. Temperature and relative humidity measurements (1238 days from Feburary 9, 1995, until June 30, 1998) were obtained from measurements at the monitoring site, or in case of missing data, from the National Oceanic and Atmospheric Administration weather station at the Phoenix (AZ) Sky Harbor airport. Concentrations of XRF elements were available for 993 days. Pearson correlation coefficients between pollutants, temperature, and relative humidity are shown in . Except for the correlations between Se and As (r = 0.687), As and S (r = 0.400), and Se and S (r = 0.220), all correlations between pollutants were weak. Since the species differed greatly in concentrations, nanograms and micrograms per cubic centimeter (ng/cm3 and µg/cm3) concentration units were converted to interquartile range concentration units (IQR) prior to analysis. IQR values are given in . Betas are calculated in terms of IQR units; one IQR unit equals the IQR concentration. Plots of the pollutants versus time are provided in the supplemental data. Plots of S, As, and Se are similar, with higher concentrations during August to December. Hg concentrations did not show any seasonal variation.

Table 1. Pearson’s correlation coefficient, r

Table 2. Relationship of cardiovascular mortality with single and multiple pollutants (for an IQR increase in pollutant concentration)

Mortality data

Mortality data for all of Maricopa County from January 1, 1995, to June 30, 1998, were obtained from the Arizona Center for Health Statistics in Phoenix, AZ. Death certificate data included residence ZIP Code and the primary cause of death as identified by the International Classification of Diseases, Ninth Revision (ICD-9; World Health Organization, Geneva, Switzerland). In this study, relationships were investigated with cardiovascular mortality (CVM) only (ICD-9 codes 390–448.9) for persons 65 years and older. For the time period covered in the analysis, there were 4847 cardiovascular deaths or an average of 3.91 deaths per day. Autocorrelation in the natural logarithm of the daily death count was low; R2 never exceeded 0.01 out to lag 10.

Study area

Only the deaths of residents in ZIP Codes located near the monitoring site were included in this study. This ZIP Code region, hereafter called Central Phoenix, was recommended by the Arizona Department of Environmental Quality (Phoenix, AZ) as being a area in which the average community pollutant concentrations would be well represented by the concentrations measured at the monitoring site (Mar et al., Citation2000, Wilson et al., Citation2007). A map of Phoenix showing the ZIP Code areas used in this study and the location of the monitoring site may be found in Wilson et al. (Citation2007). Maps and additional information on the socioeconomic status of each ZIP Code are given in the supplemental data.

Statistical model

Mortality outcomes were modeled using Poisson log-link regression allowing for overdispersion in a generalized additive model (GAM) (Dominici et al., Citation2004) with a stringent convergence criterion (10e-10 with 1000 iteration steps). Mortality count data are distributed lognormally, so transformation from counts to the natural logarithm was appropriate. Autocorrelation in the residuals was checked for S and found to be low (R2 < 0.007) out to lag 10. The distribution of residuals appeared to be adequately random (plot shown in supplemental material). Analyses were conducted using R (R Core Team, Citation2012). The GAM-Exact technique, described by Dominici et al. (Citation2004), was used to obtain the final betas and the standard errors in betas. The base model used smoothing splines to control for time trends, temperature, and relative humidity. The number of degrees of freedom (df) for the time trend (df = 12) was selected by averaging the df obtained by identifying the minimum Akaike information criterion (AIC) (df = 15) and the df obtained by identifying the minimum absolute sum of the partial autocorrelation function (PACF) (df = 9). The potential confounding effects of weather were accounted for by including smooths for temperature (lag 2 days, df = 5) and relative humidity (lag 0 days, df = 2); lag days and df were based on minimization of the AIC. Indicator variables were used to adjust for day of the week. An extensive analysis of the sensitivity of the results for particulate matter (PM) to temperature and relative humidity controls is given in Mar et al. (Citation2000). In the earlier Phoenix studies (Mar et al., Citation2000, Citation2003), missing values were replaced with the average of all measured values. For this study, the Source Apportionment Workshop (Mar et al., Citation2006), and the more recent Phoenix study (Wilson et al., Citation2007), days with missing values were omitted from the analyses.

Single-pollutant models (SPMs) had only one pollutant variable in the model. In the multipollutant approach, variables representing two or more pollutants were included in the same statistical model. The individual betas calculated by the model were added to give a combined beta. The standard error for the combined beta was estimated using a variance/covariance matrix (T. Lumley, personal communication, 2008). Since the analysis used IQR units, using the combined beta in the equation [β = ln(rate ratio)] yields the increase in risk for one IQR increase in concentration of each pollutant in the model. To convert to gravimetric units, divide beta by the IQR (in ng/m3) and multiply by the number of units; that is, [(β × 10)/IQR = ln(rate ratio)] gives the relative risk for a 10-ng/m3 increase in the pollutant. With an SPM, we may think of the increase in risk being for a daily or a yearly average increase in the pollutant. For a combined beta, from an MPM, we must think in terms of a yearly increase in risk for an increase in the yearly average of each pollutant in the model.

Results and Discussion

Single-pollutant models

The beta, standard error, t-statistic, probability, percent increase in risk for an IQR increase in the pollutant concentration, and ±95% confidence intervals (CI) for the relationship of each pollutant with CVM are given in for lag day 0, the only lag day out to lag day 10 for which the relationship between mortality and pollutant was statistically significant.

Sulfate

In this study, the percent excess risk (95% confidence intervals) for an increase of 1 µg/m3 of sulfate (measured as XRF-S) was 8.0% (1.4–15%), higher than reported in most previous studies (EPA, Citation2009). Most studies of air pollution epidemiology use county-level mortality data. The use of ZIP-Code-level mortality data makes it possible to focus on a population in an area where the ambient concentration is well represented by the monitor. The exposure error for this population should be lower than the exposure error for the entire county (Mar et al., Citation2000). For Maricopa County, the value for sulfate is 1.8% (–1.6 to 5.2%), more in line with other results for county-level mortality (EPA, Citation2009). In Phoenix the selected ZIP-Code population also has a lower and more homogeneous socioeconomic status (SES) than the larger county population. Further work is needed to determine which population property is most important, a lower exposure error or a more homogeneous and lower SES.

Arsenic and selenium

As and Se were included in the NPACT study (Ito et al., Citation2013) and Se has been used as an indicator of coal combustion (Laden et al., Citation2000).

Mercury

The extremely strong relationship of CVM with particulate Hg was unexpected and surprising. The neurotoxic effects of Hg are well known. However, there is some evidence from toxicology to indicate that Hg may also have cardiovascular effects. Mozaffarian (Citation2009) lists the following “Experimentally-observed effects of mercury which may increase CVD risk” (p. 1899):

Systemic Effects

Promotion of free radicals and reactive oxygen species

Inhibition of antioxidant systems (glutathione peroxidase, catalase, superoxide dismutase)

Increased lipid (e.g., LDL cholesterol) peroxidation

Promotion of blood coagulation (clotting)

Inhibition of endothelial cell migration

Direct Cardiovascular Effects

Reduction in myocardial contractile force

Increased calcium release from myocardial sarcoplasmic reticulum

Reduction in left ventricular myosin ATPase activity

Decreased heart rate variability and increased blood pressure

Hence, it would not be possible, on the basis of toxicity alone, to rule out the possibility that particulate Hg, like other components of air pollution, may increase CVM.

However, it is not unreasonable to ask whether the relationship of CVM with Hg is real or spurious. There are several reasons to suspect that it may be spurious. Many of the values are negative, and the average is –0.13 ng/m3. However, this may reflect too large a correction for the filter blank or the XRF background. The confidence limit is about the same size as the IQR. Hg does not show any seasonal variation. Hg has an unusually large standard deviation for an air pollutant. Perhaps Hg comes from a local source that impacts the sampling site on some days and not on others. The large percent excess risk of 9.6% (4.8–14.6%) per increase in one IQR of 1.1 ng/m3, if true, would mean that for the same increase in mass, Hg would be 1000 times as toxic as sulfate. This suggests that Hg may be a surrogate for some other pollutant emitted from the same source in much larger quantities, perhaps total Hg (particulate + gaseous + semivolatile). (The semivolatile Hg may have evaporated from the filter by the time the filter was analyzed.) However, there is one reason to feel that Hg deserves further study. Hg’s t-statistic value of 3.99 gives a p of < 10−4, or, statistically speaking, there is less than 1 chance in 10,000 that the relationship is spurious.

As far as we are aware, this is the first study to look for a relationship between the concentrations of particulate Hg collected on a filter and CVM. Although the Hg concentration time series is not of the highest quality, the high beta and t-statistic suggest that the relationship might be real and that additional epidemiologic studies using particulate Hg would be useful. State monitoring agencies should report particulate Hg levels (readily available from the XRF analyses used to determine metal concentrations) to the EPA monitoring database. Studies of particulate Hg relationship with mortality should include populations near monitoring sites and populations with different SES in order to investigate exposure error and SES effects. In addition, some studies should be undertaken to obtain better measurements of both particulate and gaseous Hg over sufficiently long time periods to provide data suitable for epidemiologic analyses.

Combined model with all four pollutants

We would like to know the combined effect of S, As, Se, and Hg. However, we cannot just add the SPM betas because each SPM beta will represent not only the effect of the single pollutant in the model but also, due to confounding, part of the effect of other correlated pollutants. Including several pollutants in an MPM prevents confounding among the pollutants in the model. However, the combined beta may be confounded by other pollutants, not included in the model, that have an effect independent of the pollutants in the model and are correlated with one or more of the pollutants in the model. Thus, the combined beta may overestimate the effect of the pollutants in the model, but, if so, it will underestimate the effect of all pollutants present in the atmosphere. shows the results for a combined model containing S, As, Se, and Hg. The combined beta is almost identical to the combined beta for S and Hg. This suggests that the SPM relationship of As and Se with CVM can be attributed to their correlation with S. The combined beta for the four pollutants is only 71% of the sum of the betas from the four SPMs. Thus, the use of the combined model reduces confounding among the pollutants in the combined model and gives a more accurate estimate of the combined effects of the four pollutants.

Causal, or correlated but noncausal; confounder or confuser?

Confounding presents a major problem in interpreting the results of epidemiologic analyses. We would like to know the combined effect of S, As, Se, and Hg. However, because of confounding, the sum of the SPM betas would misrepresent the result. An extensive literature treats confounding in clinical studies, for example, Steyer and Schmitt (Citation1994 and references therein). Tests have been developed for using the SPM and co-pollutant model (CPM) to identify confounding in clinical epidemiology. First will describe these tests and show that they are not applicable to acute, time-series epidemiology (ATSE). Then we develop some tests to identify and quantify confounding in ATSE. More importantly, we show how to identify species that are noncausal but are correlated with a second pollutant. If the second pollutant is causal, or has an independent relationship with mortality, the noncausal pollutant, when included in a CPM, will change the relationship of second pollutant with mortality. We call such species confusers.

The test developed in Steyer and Schmitt (Citation1994) uses the following notation:

(1)
(2)
The test was: If α1 β2 = 0, not confounded; if α1 β2 ≠ 0, confounded. The test was applied to examples from clinical epidemiology. Thus, if α1 = 0, or near 0, X has no effect and will not confound; if β2 = 0, or near 0, W has no effect and X will not confound W. If α1 β2 ≠ 0, there will be confounding. This test does not account for what happens when X and W are correlated. In ATSE it would be rare to have an α or β near zero. Thus, this test is not adequate for ATSE.

Quantitative criteria are needed for ATSE. Lecture notes from a Boston University (Citation2015) online course give a different but quantitative test, also for clinical epidemiology.

A simple, direct way to determine whether a given risk factor caused confounding is to compare the estimated measure of association before and after adjusting for confounding. In other words, compute the measure of association both before and after adjusting for a potential confounding factor. If the difference between the two measures of association is 10% or more, then confounding was present. If it is less than 10%, then there was little, if any, confounding.

Using the preceding notation, the criterion is given as (α1 − β2)/β2. This test is also illustrated with examples from clinical epidemiology. This test would work if X and W had a 0 correlation with each other. But in that case the correlation coefficient would provide the test. It might also work if X and W were correlated and had the same correlation with their respective dose to the body. However, this is generally not the case for air pollutants. Thus, this test is also not adequate for ATSE.

Perhaps it will be useful here to recall that the definition of confounding is different for clinical epidemiology and ATSE.

Conditions necessary for confounding in clinical studies

  1. The confounding factor must be associated with both the risk factor of interest and the outcome.

  2. The confounding factor must be distributed unequally among the groups being compared.

  3. A confounder cannot be an intermediary step in the causal pathway from the exposure of interest to the outcome of interest.

Conditions necessary for confounding in ATSE

  1. The confounding variable must be associated (inversely or directly) with the outcome of interest.

  2. The confounding variable must be associated (inversely or directly) with the outcome of interest, independent of the variable of interest.

  3. The confounding variable must be associated (inversely or directly) with the variable of interest.

  4. The confounding variable is not in the causal pathway of the variable of interest to the outcome of interest.

As stated in condition 2, a confounder must have an association with the variable of interest and also with the outcome. In the following section, we develop a logical foundation for identifying and quantifying confounding in ATSE. We also show how to identify a variable that is not associated with the outcome and therefore is noncausal, but is associated with the variable of interest and therefore can lead to confusion.

Let us consider an ideal situation in which a population is exposed to only two pollutants with daily concentrations of P1 and P2 (given in the same units) and we have a time series of the daily pollutant concentrations and the number of deaths per day, Y. We assume a linear dependence of ln Y on P1 and P2 and express the expected values E(ln Y│P1) of variable ln Y as a function of P1 for a SPM or E(ln Y│P1, P2) for a CPM. In order to simplify notation, we follow the example of Steyer and Schmitt (Citation1994) and use α for the coefficient of pollutant concentration in SPMs and reserve β for CPMs and MPMs. Thus, we have for our ideal situation

(3)
(4)
If P1 and P2 are correlated, that is, R2 (P1P2) does not equal zero, P1 and P2 will confound each other. Part of the relationship of P1 with ln Y will be transferred to P2 and part of the relationship of P2 with ln Y will be transferred to P1. The sign of the quantity transferred is given by the sign, s, of the Pearson’s correlation coefficient, r. The amount of relationship depends on R2 (P1P2). Thus, we have for single-pollutant models
(5)
(6)
where αxT is the true, or unconfounded alpha.

For CPMs or MPMs we have

(7)
It will be convenient to define αS as the sum of the α terms from the SPMs and βC as the sum of the β terms from the CPM, which we refer to as the combined β, as
(8)
(9)
We can consider several cases. Since we can rarely prove causality with a single ATSE study, instead of referring to pollutants as causal, or possibly causal, we merely assume that the pollutant has a relationship with mortality that is independent of the relationship of the other pollutants in the model. We call such species independently related to mortality (IRM)—that is, pollutant P1 is IRM. An IRM pollutant could be causal or it could be noncausal but highly correlated with another pollutant not included in the model.

Case 1

P1 and P2 are both IRM but uncorrelated, that is, R2 (P1P2) = 0.

P1 will not confound P2 and P1 will not confound P1.

(10)
(11)
(12)
This important case occurs when a source apportionment model such as factor analysis or principal component analysis is used to obtain source contributions that are uncorrelated.

Case 2

P1 and P2 are correlated and both P1 and P2 are IRM. In SPMs, P1 and P2 are said to confound each other. We assume that mortality is not caused by ambient concentrations of pollution but by the dose of pollutant to the lung. Different pollutants may differ widely in the correlation between ambient concentration and dose to the lung. In our ideal situation, however, P1 and P2 both have the same degree of correlation with the amount of P1 and P2 deposited in the lung.

In the combined model, the statistical analysis will remove the confounding so that

(13)
(14)
(15)
(16)
If r is positive, βX will be less than αX. If r is negative, βX will be more than αX. We can use ((αX – βX)/βX) × 100 as a criterion to give some idea of how much confounding contributes to αX. Call this Criterion C1.

Case 3

This is like Case 2, but P1 and P2 have different degrees of correlation with their respective dose. Generally, one pollutant will be better correlated with the dose than the other. The statistical analysis will distribute more of the effect to the pollutant that has the better correlation with the dose. Therefore,

(17)

However, the sum of the β terms will still equal the sum of the true αT terms, that is,

(18)

Therefore, βC, the sum of the β terms in the CPM, will give the true effect on mortality of a unit increase in both pollutants.

It is not uncommon for analysts to control for confounding by P2 by putting P2 in a CPM model with P1 and either assuming that β1 will equal α1T or assuming that β1 and β2 will give the relative toxicities of P1 and P2. However, this will only be true in the unlikely situation described earlier when P1 and P2 have the same correlation with their dose.

For this case we cannot use C1, but we can use C2, ((αS – βC) × 100/βC), as a criterion for confounding or confusing. This gives the difference between the combined beta, βC, from the CPM, and the sum of the alpha’s from the SPMs. If C2 is positive we have positive confounding (r is positive) and the individual α terms are too large; if C2 is negative we have negative confounding (r is negative) and the individual α terms are too small. The individual β terms cannot be used in C1 because their values will depend on their correlation with each other and the correlation of the pollutant’s concentration with its dose.

Case 4

P1 and P2 are correlated but only P1 is IRM, that is, α2T = 0. α2 will be positive because P2 will be confounded by P1.

(17)
In this case, α2 is due solely to the correlation of P1 with P2 and P2 has no independent relationship with ln Y. In the CPM, all the relationship due to P1 will be captured in β1, and hence β2 will equal 0.
(18)
(19)
(20)
(21)
In the ideal case, βC would equal α1 + α2 . In a nonideal case, relationships will not be as exact but the criterion C3 =100 × (βC – α1)/ βC) will indicate how much βC has been increased and provide information regarding how likely P2 is to be causal. If C3 is near zero or negative, P2 is likely noncausal.

Case 5

P1 and P2 are correlated and P1 is IRM but P2 is noncausal. This is similar to Case 4. However, in this case, P1 is a multicomponent mixture and P2 is highly correlated with the causal component of P1. For example, in the case of S and As, As is highly correlated with the SO2 emitted from power plants and therefore likely to be highly correlated with the toxic metals neutralized and made soluble by reaction with sulfuric acid generated from the SO2 emissions. In this case, since both P1 and P2 are correlated with the toxic component of P1, the statistical analysis will distribute a portion of the effect to both P1 and P2. Since including P2 causes β1 to be less than α1, as happened in Case 4, we might think that P2 is a confounder of P1. However, since P2 is noncausal, it would not be considered a confounder so we call it a confuser. Since including P2 in the model will not increase the overall effect given by βC, βC will not be significantly greater than α1 and in the ideal case βC would equal α1. Again criterion C3 provides a useful test of whether P2 is causal or correlated, confounder or confuser. If the difference between βC and α1 relative to α1, given by C3 is very small, we can consider P2 to be noncausal and a confuser. Since C3 equal –6.26% for the S + As CPM, we can identify As as noncausal since it has no relationship with mortality except that due to its correlation with S. Again we cannot use C1 since the βs depend on the correlation of P1 and P2 with the causal component of P1.

Case 6

Let us add a third pollutant, P3. P2 and P3 are correlated with each other but not with P1. P1 and P2 are causal but P3 is not. Then the SPM for P3 is

(22)
(23)
Or since α3T = 0,
(24)
In a CPM with P1, P3 will appear to be causal and a confounder. However, including P3 in a CPM with P2 will show (Case 5) that P3 is noncausal and a confuser. The relationship between P3 and mortality is due to its correlation with the causal species P2.

Relationships in the real world are not as clean. We need to remember that

(25)
where the summation is over all PX’s known and measured, known but not measured, and unknown. (For PX= P1, s is positive and R2 = 1, so the first term in the summation is α1T∙P1.) Therefore, while the relationships we discuss are useful, they may not be exact because of the many additional possibilities for confounding.

Co- and multipollutant models

We are now ready to interpret the results of the CPMs and MPMs given in .

Table 3. Analysis of confounding and identification of noncausal species

Hg + S: An example of Case 1 with possible contributions from Case 2 and 3. Confounding is low, 1.47% for α1 (Hg), 3.46% for α2 (S), and 2.30% for βC. The contribution of S to βC is 40.65%, indicating that S is likely to be IRM. R2 is small and r is positive, so confounding slightly increases α1 over α1T, α2 over α2T, and αS over βC.

S + As: Example of Case 5. Confounding is high, >50% for α1 (S), α2 (As), and βC (S + As). However, as discussed under Case 5 earlier, the contribution of As to βC is a minus 6.26%, indicating that As is unlikely to be causal. The reduction in βC occurs because the As time series provides only error, that is, it is all noise with no signal, so it biases the βC to a lower value.

Hg + As: Example of Case 6. Since r is negative, β1 will be larger than α1 (Hg), and β2 will be larger than α2 (As). The contribution of As to βC is 26.88%, suggesting that As is likely to be IRM. However, we know from our analysis of S + As that As is not likely to be causal and that its contribution to βC is due to its correlation with S.

Hg + S + As: Extension of Case 5 to an MPM. In this three-pollutant model, the addition of As to Hg + S adds only 3.93% to βC, confirming that As is noncausal. Therefore, we would consider that As is a confuser, not a true confounder. The confounding of α2 (S), α3 (As), and βC (Hg + S + As) βC is high due to confounding of S and As by each other. The confounding of α1 (Hg) (–1.15%) is a combination of positive confounding by S and negative confounding by As. For testing Criterion 3 in a MPM, we would use β1 + β2 instead of α1.

The Steyer and Schmitt (Citation1994) test would indicate confounding in all these examples, incorrectly in the case of Hg + S. The Boston University (Citation2015) test would correctly indicate nonconfounding for Hg and S and, in the MPM case, correctly indicate that S and As were confounded but Hg was not. It would not work for the S + As example or another situation in which P1 and P2 were correlated. However, neither test would provide any indication that As and Se were likely noncausal.

Is sulfate causal?

It is unlikely that the sulfate anion (SO42-) is the biologic agent causing the toxicity associated with exposure to sulfate (Lippman et al., 1996). It is more likely that the cations (positive ions) associated with the sulfate anion (SO42-), determine the toxicity of sulfate containing particles. Sulfuric acid, formed from atmospheric oxidation of SO2, can react with basic metal oxides of toxic metals to convert insoluble oxides into soluble ions (Cd, Fe, Hg, Ni, Pb, V, Zn). These heavy metal ions are thought to be toxic (Amdur et al., Citation1978; Costa and Dreher, Citation1997; Ghio et al, Citation1999). Sulfate may also exist in the atmosphere as sulfuric acid aerosol (H2SO4) or, if neutralized by ammonia, as (NH4)2SO4 particles. Both types of particles yield potentially toxic acidic solutions in water (Lippmann et al., Citation2000; Gwynn et al., Citation2000).

In a review of the available information on the epidemiology and toxicology of air pollutants, the British Committee on the Medical Effects of Air Pollutants (COMEAP, 2009) concluded that

Sulphates could play a role in the capacity of PM2.5 to drive pulmonary inflammation via either increasing metal mobilization from PM, and hence increasing the ability to drive oxidative stress leading to inflammation, or decreasing the pH of lung-lining fluid, leading to decreased pathogen clearance and hence increased inflammation. Inflammation in the lung could then initiate changes in blood clotting, and/or activation of macrophages in atherosclerotic plaques increasing their instability.

While sulfate is likely not the biologically causal agent, sulfuric acid is in the causal pathway, for example:

Which link to call causal is perhaps more of a semantic than a scientific question. The key concern, however, is: “Will control of SO2 from power plants, and hence a reduction of sulfate in PM2.5, lead to a reduction in CVM?” This study, and many other studies that find an independent relationship of sulfate with CVM, suggest that the answer to the question is yes.

Source contributions

Sulfate is actually a multicomponent pollutant, having several fine particulate sources (power plants [oil and coal], smelters, possibly traffic [S in gasoline]) as well as coarse particulate sources (CaSO4 from wallboard and possibly other sulfates in resuspended soil). Since Hg is not correlated with sulfate and As and Se have been shown to be noncausal, we are left with sulfate as the only measured pollutant available to represent pollution from both coal power plants and smelters.

Source apportionment techniques may help separate sulfate into its various sources. Thurston et al. (Citation2011) have suggested that sulfate, or a group of associated species, may serve as a surrogate for particles from coal combustion. Using factor analysis in a study of 64 cities, they find a coal combustion factor including contributions from SO2, As, Se, and Hg. They did not include sulfate in their source apportionment analysis. It is likely that Hg, Se, and As are not the PM2.5 components causing the health effects associated with sulfate, but instead are serving, either individually or as a source signature, as a marker for other gaseous or particulate by-products of coal combustion.

Several types of source apportionment models (SAMs) have used the Phoenix pollutant database (Mar et al., Citation2000; Hopke et al., Citation2006; Mar et al., Citation2006). We next use factor analysis (FA), positive matrix factorization (PMF), and principal component analysis (PCA). In Phoenix, As, Se, and S are correlated but they are not correlated with SO2 or Hg. FA yielded one factor for regional sulfate. PMF splits regional sulfate into one coal combustion source and one smelter source. PCA finds one coal combustion and two smelter sources. SAMs appear to reduce the contribution from coarse sulfate in soil in PM2.5 as shown by the reduced correlation between of coal power plant and smelter sources and SOIL compared to the correlation between XRF-S and SOIL ().

Table 4. Pearson’s correlation coefficient, r, for source contributions with SOIL and S and for the coal power plant (PP) sources with the smelter sources

We can also combine sources into the same model in order to estimate the health effects of coal power plants and smelters together. gives results of SPM and MPM for these source contributions. It is interesting to see how the t-statistic and the percent excess risk both increase as we go from a single species sulfate (2.45 and 6.86%), to regional sulfate from FA (2.90 and 9.40%), to a coal power plant and a smelter from PMF (2.87 and 11.20%). Further splitting the smelter factor into two smelters (PCA) does not further improve the statistical significance (t-statistic) or increase the risk calculated for the combination of the two smelters and the coal power plant (PCA) (2.48 and 9.17%). This may be because PCA splits out Ni and V as having separate sources. Note that the negative correlation between the coal power plant and smelter 2 sources causes the β terms to be larger than the α terms. The increase in the t-statistic and the percent excess risk in going from XRF-S to sources may be due to removing the S in soil, as shown by the decrease in correlation in the source contributions with soil compared to that of XRF-S with SOIL ().

Table 5. Relationship of cardiovascular mortality with source contributions (for an IQR increase in the contribution of that source)

It would appear that an MPM containing species from coal power plants and smelters or a combination of source contributions from coal power plants and smelters could be used to estimate the health effects from these sources. Source contributions have the advantage of separating the two types of sources, while MPMs have the advantage of providing information on which species are noncausal.

Interaction

Hg and S

To investigate a possible interaction involving Hg and S, we use a variable obtained by multiplying Hg times sulfate and transforming the product into IQR units. (Hg and S are also in IQR units.) Results are shown in . In a SPM, Hg*S has a positive, statistically significant (SS) beta. However, when Hg*S is included with Hg and S in a MPM, its beta is negative and not SS. However, the individual betas for Hg and S in the MPM are both greater than in an SPM or a CPM. The combined beta from all three variables is higher than that from the CPM. This indicates that when both Hg and S concentrations are high, neither is as effective as when only one is high. Apparently they interfere with each other in some way (antagonism?). When Hg*S is included in the MPM, it controls for this interaction and gives the correct betas for Hg and S if only one species was present in the atmosphere. AICs were calculated but were not different enough to suggest that one model was better than another.

Table 6. Interaction of Hg and S and power plant (PP) and smelter sources

Power plant and smelter sources

As shown in , the interactions between coal power plant sources and smelter sources for both the PMF and PMC SAMs give results very similar to those found for Hg and S. When both source contributions are high, neither is as effective as when only one is high.

Sensitivity Analysis

Additional analyses were conducted to investigate how changes in the statistical model might affect the resulting betas and t-statistics.

Controls for temperature and relative humidity

In the statistical model, controls for temperature and relative humidity were based on minimization of AIC. We calculated beta for the relationship of the various pollutants with CVM with no controls, with controls used in the model based on minimization of AIC, and with controls based on inclusion of temperature and relative humidity as explanatory variables in a distributed lag model, for lag day 0 and lag days 0 + 1, 0 + 1 + 2, and 0 + 1 + 2 + 3. The betas were not very sensitive to changes in the temperature and relative humidity controls, suggesting that confounding by temperature or relative humidity was not a problem. Results are shown in the supplemental data.

Temporal control

In the statistical model, variations in beta with time were controlled by use of a smoothing spline with 12 df (about 4 df per year of study data) based on the average of the minimization of the AIC (df = 15) and the minimization of the PACF (df = 9). We calculated beta as a function of time df from 1 to 25. Using more degrees of freedom than 12 does not appreciably affect the beta for Hg but slightly increases the beta for XRF-S and slightly decreases the betas for As and Se. Using less than 9 degrees of freedom would decrease all species except Hg. Results are shown in the supplemental data.

Conclusions

The multipollutant approach to air pollution epidemiology, whether by use of MPMs or by use of source contributions, allows us to investigate the combined health effects of several pollutants. Since the use of an MPM reduces the amount of confounding among the species in the model, the combined beta gives a better indication of the combined effect than could be obtained if only SPM betas were available. Source contributions from SAMs also allow us to investigate the combined effect of the several pollutants included in the source signature. Further studies should examine other multipollutant groups, such as vehicle emissions and multipollutant groups based on other source categories.

The strong relationship of sulfate (measured as XRF S) with cardiovascular mortality (CVM) observed in this study indicates that the use of ZIP-Code-level mortality data provides higher and more statistically significant relative risks than obtained with county-level mortality data. Further work is needed to determine which population property is most important, a lower exposure error or a more homogeneous and lower SES.

The strong relationship of particulate Hg (collected on a filter) with CVM suggests that Hg may affect cardiovascular as well as neurological health. This relationship of particulate Hg with CVM is suspicious and needs to be confirmed by other studies. Therefore, further studies of the relationship of Hg with mortality are needed.

Comparison of the combined beta (βc) from a CPM containing concentrations of two pollutants with the sum of alphas (αs) from two SPMs of the same pollutants allows us to identify confounding and estimate how much the excess risk would be increased if we used αs instead of βc to calculate the relative risk. The difference between αx from a SPM and the corresponding βx allows us to identify confounding and estimate how much one pollutant is confounded by the other pollutant in the CPM. The difference between α1 and βc provides information on how much pollutant 2 contributes to βc from a CPM with pollutants 1 and 2. If the contribution to βc is small or negative, it is likely that pollutant 2 is noncausal. As and Se were found to be noncausal. Their SPM relationship with CVM may be attributed to their correlation with S. This study provides no support for an independent effect of As or Se on CVM.

Since As and Se are noncausal and Hg was not correlated with sulfate, sulfate is the only species left to represent emissions from both coal power plants and smelters. Using FA to yield a regional sulfate source reduces the error due to sulfate in soil and yields a higher t-statistic and percent excess risk. Further splitting regional sulfate into a coal power plant source and smelter sources further increases the t-statistic and percent excess risk when these sources are combined in a CPM (PMF) or an MPM (PCA).

Acknowledgments

Jane Koenig originated the technique of reducing exposure error by limiting the population in an epidemiologic study to a small area whose air pollutant concentrations were well represented by the monitoring site. Dr. Koenig also obtained from the state of Arizona the ZIP Code mortality data used in this study. Therese Mar conducted the statistical analyses for the earlier Phoenix studies and performed some preliminary analyses for this study. Larry Purdue, Robert Stevens, and Roy Zweidinger organized the EPA monitoring study that provided the pollutant and weather database used in this study. Thanks to Gary Norris for providing source contributions from factor analysis and positive matrix factorization and to Quingyu Meng for providing source contributions from principal component analysis. Thanks to Tom Lumly for writing a variance–covariance matrix code to calculate correct standard errors in the combined model. Thanks to David Svendsgaard, Lucas Neas, and Jason Sacks for helpful reviews. An early version of this paper was presented at the 11th International Congress on Combustion By-Products and their Health Effects, Research Triangle Park, NC, May 31–June 3, 2009.

Disclaimer

Some early work on this paper was done while the author was an employee or volunteer (Emeritus Program) at the U.S. Environmental Protection Agency. However, much of the work was done after the author retired. The views expressed in this paper are those of the author and do not necessarily reflect the views or policies of the U.S. Environmental Protection Agency.

Supplemental Materials

Supplemental data for this article can be accessed at http://dx.doi.org/10.1080/10962247.2015.1033067.

Supplemental material

Supplemental Data

Download PDF (1.1 MB)

Additional information

Notes on contributors

William E. Wilson

William E. Wilson retired in 2011 from the National Center for Environmental Assessment of the U.S. Environmental Protection Agency.

References

  • Amdur, M. J., Bayles, V. Ugro, and D.W. Underhill. 1978. Comparative irritant potency of sulfate salts. Environ. Res. 16:1–8. doi:10.1016/0013-9351(78)90135-4
  • Boston University. 2015. Confounding and effect measure modification. http://sphweb.bumc.bu.edu/otlt/MPH-Modules/BS/BS704-EP713_Confounding-EM/BS704-EP713_Confounding-EM_print.html (accessed March, 2015).
  • Brook J.R., R.T Burnett, T.F Dann, S. Cakmak, M.S. Goldberg, X. Fan, and A.J. Wheeler. 2007. Further interpretation of the acute effect of nitrogen dioxide observed in Canadian time-series studies. J. Expos. Sci. Environ. Epidemiol. Suppl. 2:S36–44. doi:10.1038/sj.jes.7500626
  • Burnett R.T., D. Stieb, J.R. Brook, S. Cakmak, R. Dales, M. Raizenne, R. Vincent, and T. Dann. 2004. Associations between short-term changes in nitrogen dioxide and mortality in Canadian cities. Arch. Environ. Health. 59:228–36. doi:10.3200/AEOH.59.5.228-236
  • Chow, J.C. 2010. Multipollutant air quality management. Introduction. J. Air Waste Manage. Assoc. 60(6):642–44. doi:10.3155/1047-3289.60.6.642
  • Dominici F., A. McDermott, and T.J. Hastie. 2004. Improved semi-parametric time series models of air pollution and mortality. J. Am. Stat. Assoc. 468:938–48. doi:10.1198/016214504000000656
  • Dominici, F., R.D. Peng, C.D. Barr, and M.L. Bell. 2010. Protecting human health from air pollution: Shifting from a single-pollutant to a multipollutant approach. Epidemiology 21:187–94. doi:10.1097/EDE.0b013e3181cc86e8
  • Costa, D.L., and K.L. Dreher. 1997. Bioavailable transition metals in particulate matter mediate cardiopulmonary injury in healthy and compromised animal models. Environ. Health Perspect. Suppl. 105:1053–60. doi:10.2307/3433509
  • Ghio, A.J., J. Stoneheurner, J.K. McGee, and J.S. Kinsey. 1999. Sulfate content correlates with iron concentrations in ambient air pollution particles. Inhal. Toxicol. 11:293–307.
  • Greenbaum, D., and R. Shaikh. 2010. First steps toward multipollutant science for air quality decisions. Epidemiology 21:195–97. doi:10.1097/EDE.0b013e3181ccc52a
  • Gwynn, R.C., R.T. Burnett, and G.D. Thurston. 2000. A time-series analysis of acidic particulate matter and daily mortality and morbidity in the Buffalo, New York, region. Environ. Health Perspect. 108(2):125–33. doi:10.1289/ehp.00108125
  • Hidy, G.M., and W.T. Pennell. 2010. Multipollutant air quality management. J. Air Waste Manage. Assoc. 60(6):645–74. doi:10.3155/1047-3289.60.6.645
  • Hopke, P.K., K. Ito, T. Mar, W.F. Christensen, D.J. Eatough, R.C. Henry, E. Kim, F. Laden, R. Lall, T.V. Larson, H. Liu, L. Neas, J. Pinto, M. Stolzel, H. Suh, P. Paatero, and G.D. Thurston. 2006. PM source apportionment and health effects: 1. Intercomparison of source apportionment results. J. Expos. Sci. Environ. Epidemiol. 16:275–86. doi:10.1038/sj.jea.7500458
  • Ito K., G.D. Thurston, and R.A. Silverman. 2007. Characterization of PM2.5, gaseous pollutants, and meteorological interactions in the context of time-series health effects models. J. Expos. Sci. Environ. Epidemiol. Suppl. 2:S45–60. doi:10.1038/sj.jes.7500627
  • Ito, K., Z. Ross, J. Zhou, A. Nádas, M. Lippmann, and G.D. Thurston. 2013. NPACT Study 3. Time-series analysis of mortality, hospitalizations, and ambient PM2.5 and its components. In National Particle Component Toxicity (NPACT) Initiative: Integrated Epidemiologic and Toxicologic Studies of the Health Effects of Particulate Matter Components, ed. M. Lippmann, L.C. Chen, T. Gordon, K. Ito, and G.D. Thurston. Health Effects Institute Report 177. www.healtheffects.org.
  • Laden, F., L.M. Neas, D.W. Dockery, and J. Schwartz. 2000. Association of fine particulate matter from different sources with daily mortality in six U.S. cities. Environ. Health. Perspect. 108:941–47. doi:10.2307/3435052
  • Lippmann, M., K. Ito, A. Nádas, and R.T. Burnett. 2000. Association of particulate matter components with daily mortality and morbidity in urban populations. Res. Rep. Health Effects Inst. 95:5–72.
  • Mar T.F., G.A. Norris, J.Q. Koenig, and T.V. Larson. 2000. Associations between air pollution and mortality in Phoenix, 1995–1997. Environ. Health. Perspect. 108:347–53. doi:10.2307/3454354
  • Mar T.F., G.A. Norris, T.V. Larson, W.E. Wilson, and J.Q. Koenig. 2003. Air pollution and cardiovascular mortality in Phoenix, 1995–1997. In Revised Analyses of Time-Series Studies of Air Pollution and Health. Special Report, 177–182. Boston, MA: Health Effects Institute. http:// www.healtheffects.org/Pubs/TimeSeries.pdf (accessed October, 2004).
  • Mar T.F., K. Ito, J.Q. Koenig, T.V. Larson, D.J. Eatough, R.C. Henry, E. Kim, F. Laden, R. Lall, L. Neas, M. Stolzel, P. Paatero, P.K. Hopke, and G.D. Thurston. 2006. PM source apportionment and health effects: 3. Investigation of inter-method variations in associations between estimated source contributions of PM25 and daily mortality in Phoenix, AZ. J. Expos. Sci. Environ. Epidemiology 16:311–20. doi:10.1038/sj.jea.7500465
  • Mauderly, J.L., R.T. Burnett, M. Castillejos, H. Ozkaynak, J.M. Samet, D.M. Stieb, S. Vedal, and R.E. Wyzga. 2010. Is the air pollution health research community prepared to support a multipollutant air quality management framework? Inhal. Toxicol. 22:1–19. doi:10.3109/08958371003793846
  • Mozaffarian, D. 2009. Fish, mercury, selenium and cardiovascular risk: Current evidence and unanswered questions. Int. J. Environ. Res. Public Health. 6:1894–916. doi:10.3390/ijerph6061894
  • National Research Council. 2004. Research Priorities For Airborne Particulate Matter: IV. Continuing Research Progress. Washington, DC: National Academy Press.
  • R Core Team. 2012. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. http://www.R-project.org
  • Roberts, S. 2006. A new model for investigating the mortality effects of multiple air pollutants in air pollution mortality time-series studies. J. Toxicol. Environ. Health A 69(6): 417–35. doi:10.1080/15287390500246761
  • Steyer, R., and T. Schmitt. 1994. The theory of confounding and its application in casual modeling with latent variables. In Latent Variable Analysis, ed. A. Von Eye and C.C. Clogg, chap. 2. Thousand Oaks, CA: Sage.
  • Thurston, G.D., K. Ito, T. Mar, W.F. Christensen, D.J. Eatough, R.C. Henry, E. Kim, F. Laden, R. Lall, T.V. Larson, et al. 2005. Workgroup report: Workshop on source apportionment of particulate matter health effects—Intercomparison of results and implications. Environ. Health Perspect. 113:1768–74. doi:10.1289/ehp.7989
  • Thurston, G.D., K. Ito, and R. Lall. 2011. A source apportionment of U.S. fine particulate matter air pollution. Atmos. Environ. (1994) 45(24):3924–36. doi:10.1016/j.atmosenv.2011.04.070
  • Thurston, G.D., K. Ito, R. Lall, R.T. Burnett, M.C. Turner, D. Krewski, Y. Shi, M. Jerrett, S.M. Gapstur, W.R. Diver, and C.A. Pope. 2013. NPACT Study 4. Mortality and long-term exposure to PM2.5 and its components in the American Cancer Society’s Cancer Prevention Study II Cohort. Appendix I. Total risk index measures to assess effects of multiple particulate and gaseous air pollutants. In National Particle Component Toxicity (NPACT) Initiative: Integrated Epidemiologic and Toxicologic Studies of the Health Effects of Particulate Matter Components, ed. M. Lippmann, L.C. Chen, T. Gordon, K. Ito, and G.D. Thurston. Health Effects Institute Report 177. www.healtheffects.org; Appendix I available at http://pubs.healtheffects.org/getfile.php?u=943 ( accessed March, 2015).
  • Tolbert, P.E., M. Klein, J.L. Peel, S.E. Sarnat, and J.A. Sarnat. 2007. Multipollutant modeling issues in a study of ambient air quality and emergency department visits in Atlanta. J. Expos. Sci. Environ. Epidemiol. Suppl. 2:S29–35. doi:10.1038/sj.jes.7500625
  • U.S. Environmental Protection Agency. 1998. Data report: NERL PM research monitoring platforms, Baltimore, Fresno, and Phoenix. Research Triangle Park, NC: U.S. Environmental Protection Agency, National Exposure Research Laboratory, report EPA/600/R-98/138. Available from NTIS, Springfield, VA.
  • U.S. Environmental Protection Agency. 2009. Integrated Science Assessment for Particulate Matter (Final Report). Washington, DC: U.S. Environmental Protection Agency, EPA/600/R-08/139F.
  • Vedal, S., and J.D. Kaufman. 2011. What does multi-pollutant air pollution research mean? Am. J. Respir. Crit. Care Med. 183(1): 4–6. doi:10.1164/rccm.201009-1520ED
  • Wilson, W.E., T.F. Mar, and J.Q. Koenig. 2007. Influence of exposure error and effect modification by socioeconomic status on the association of acute cardiovascular mortality with particulate matter in Phoenix. J. Expos. Sci. Environ. Epidemiol. 17:S11–19. doi:10.1038/sj.jes.7500620

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.