803
Views
6
CrossRef citations to date
0
Altmetric
Technical Papers

The accuracy of two- and three-way positive matrix factorization models: Applying simulated multisite data sets

, , , , , , & show all
Pages 1122-1129 | Received 06 Mar 2014, Accepted 15 May 2014, Published online: 16 Sep 2014

Abstract

The application of three-way data sets (combined multisite data sets) for source apportionment has become common, but its influence on the performance of receptor modeling techniques has not yet been explored systematically. To study the influence of site-to-site correlations of source contributions and the spatial variability of source profiles on two- and three-way positive matrix factorization (PMF), simulated three-way data sets were constructed and modeled by different applications of PMF (PMF2 for each site individually, PMF2 for data sets combining all sites together, and PMF3 for all sites). In addition, the performance of PMF was evaluated under conditions of collinearity and different source categories at two sites. The results indicated that if the sites were contributed by sources with identical profiles, the site-to-site correlations of source contributions would not influence the PMF2, and the three-way blocks could be used by PMF2. However, the PMF2 using three-way data sets was sensitive to the spatial variability of source profiles. For the three-way model, PMF3 could perform well only when all of the sources exhibited strong site-to-site associations among all sites, and at the same time, the spatial variability of source profiles were sufficiently small. It might due to the algorithm that, for each source, PMF3 produces the same source profile and the same temporal variation in daily contributions among all sites.

Implications: The application of multisite data sets for source apportionment has become common. However, limited work investigated the accuracy of two- and three-way PMFs when using multisite data sets. If the application of PMFs using multisite data sets were not appropriate, the results would be unreasonable. The unreasonable results would supply confused information for PM control strategies. In this work, simulated multisite data sets were modeled by different applications of PMFs. The effort to assess and compare the performance of two- and three-way PMFs using multisite data sets is very limited. The findings could provide information for multisite source apportionment.

Introduction

Atmospheric particulate matter (PM) has been considered one of the major pollutants in many countries (Zheng et al., Citation2007; Shi et al., Citation2011a; Shen et al., Citation2012; Gu et al., Citation2013; Pokorna et al., Citation2013; Zhao et al., Citation2013a; Titos et al., Citation2014). Understanding the sources of PM is of great significance for policymakers to design effective programs and strategies for the control and reduction of PM (Zheng et al., Citation2011; Shen et al., Citation2014). Receptor models are successfully applied in different scientific fields, especially when it comes to source apportionment of atmospheric particulate matter (PM) (Russell and Brunekreef, Citation2009; Lowenthal et al., Citation2012; Zhao et al., Citation2013b). Receptor-based source apportionment techniques have become powerful tools to understand the sources of PM (Kelly et al., Citation2013). Positive matrix factorization (PMF), one of the receptor models, is a factor analysis method for the modeling of environmental data sets, constraining factor loadings and scores to nonnegative values (Paatero and Tapper, Citation1994; Watson and Chow, Citation2001; Ofosu et al., Citation2013). It is sanctioned by the U.S. Environmental Protection Agency (EPA) and has frequently been employed in different fields of scientific research throughout the world (Paatero, Citation1999; Shi et al., Citation2009a; Chan et al., Citation2011; Green et al., Citation2012). In most studies applying two-way PMF (PMF2), numerous ambient samples obtained from a single sampling site are introduced into the PMF model and the input data set is generally characterized by two dimensions (temporal variability and chemical species). PMF2, based on a two-way factor analysis method (Paatero and Tapper, Citation1994; Paatero, Citation2007; Chen et al., Citation2010), extracts source profiles and quantifies contributions based on the temporal variation of chemical species because species from the same source have the same temporal variation (Pant and Harrison, Citation2012).

With the increasing number of monitoring sites organized for air quality monitoring networks all over the world, three-way blocks (i.e., multisite data sets) would be produced that assess chemical species, temporal variability, and spatial variability. In several studies (Escrig et al., Citation2009; Beuck et al., Citation2011; Mooibroek et al., Citation2011; Larsen et al., Citation2012; Minguillón et al., Citation2012), data sets combining data from multiple sites into a single input matrix (three-way blocks) have been introduced into PMF2 to enhance the number of receptor samples (Bruinen de Bruin et al., Citation2006; Beuck et al., Citation2011) and the representativeness of estimating the source contributions of PM at a regional scale (Xie et al., Citation2012; Keuken et al., Citation2013). In this context, a three-way factor analysis model, PARAFAC (Parallel Factor Analysis), which was developed to solve problems associated with three-way blocks, shows its suitability in the source apportionment of multiple sites (Paatero, Citation1999; Faber et al., Citation2003; Abdollahi et al., Citation2010). PMF3 (three-way PMF) is an algorithm commonly employed to solve three-way PARAFAC (Paatero, Citation1997, Citation2007).

Although the application of the three-way data set has become common, some associated problems have not yet been explored directly and systematically. Many studies have investigated single-site source apportionment in terms of the collinearity of source profiles and the source-to-source correlation of contributions (Watson et al., Citation2008; Shi et al., Citation2009b; Habre et al., Citation2011). However, for three-way source apportionment, two unique problems must be faced. First, for one source category, the profiles may vary from site to site in the real atmospheric system (Kim et al., Citation2005), but the PMF2 and PMF3 using the three-way data set produce only one extracted profile among different sites, inconsistent with the usual condition. Second, an assumption of PMF3 is that the source emission patterns should be the same for all sites (Zeng and Hopke, Citation1992). However, in the real world, the contributions of one source category at different sites may exhibit different temporal variation (Hwang et al., Citation2008). Thus, the site-to-site correlations of daily contributions for each source may also influence the solution of the PMF when applying three-way blocks. However, the impact of the aforementioned correlations of the daily source contributions and the spatial variability of source profiles on three-way source apportionment has not been clearly understood, and the problems just mentioned tend to be ignored by environmental scientists when they use multisite data sets. Due to the presence of these two problems, we should know whether the three-way data set used in the PMF is appropriate; if not, the results of the PMF should be rejected (Öztürk et al., Citation2012). Importantly, the spurious results in such cases would provide inconsistent information to policymakers attempting to devise PM control strategies.

Thus, for the purpose of studying different performances of PMF when running with three-way data sets, simulated multisite data sets were constructed and modeled by different applications of two- and three-way PMF. Our goal was to evaluate how the two problems already mentioned would impact the performance of two- and three-way PMF according to their respective algorithms. In addition, the performance of PMF was explored when collinearity and different source categories at two sites occurred. Although previous works have used synthetic data sets to assess the accuracy of PMF results (Brinkman et al., Citation2006; Henry and Christensen, Citation2010; Habre et al., Citation2011), few studies have focused on the difference between two- and three-way PMF. To our knowledge, there is very limited work that has conducted such an assessment for three-way data sets and to compare the performance of two- and three-way PMF using multisite data sets. The findings in our work could provide valuable information for the application of two- and three-way PMF when using multisite data set.

Methodology

PMF2 and PMF3

The PMF2 attempts to apportion the source profile matrix F and the source contribution matrix G based on observation matrix X at the receptor site (Paatero and Tapper, Citation1994; Paatero, Citation2007):

1
where E is the matrix of residuals.

The three-way PMF algorithm (PMF3) can be best described in the form of the following matrix:

2
where X is also the concentration matrix at the receptor site, A relates to the source contributions, B relates to the source profiles, and C is the fractional contribution of each site. PMF3 was first presented by Paatero (Citation1997), but the application of PMF3 has been relatively infrequent. Detailed descriptions of PMF2 and PMF3 are available in Paatero (Citation2007). The principles of two- and three-way PMF and the parameters used in this work are supplied in the Supplemental Materials.

Synthetic data-set development

The site-to-site relationship of both source contributions and source profiles might influence the performance of two- and three-way PMF. Understanding the appropriateness of applying these PMF methods in different cases is imperative. Thus, two types of synthetic data sets were developed in this section: (1) data sets used to analyze site-to-site correlations of daily source contributions between different sites and (2) data sets used to investigate the influence of spatial variability in source profiles. Considering the feasibility of experiments and following prior studies (Javitz et al., Citation1988; Habre et al., Citation2011), the collinearity of source profiles and the source-to-source correlations between source contributions at a single site were not taken into account in this section, and the results of related tests are provided in the Supplemental Materials.

Soil dust (SDUST), ammonium nitrate (AMNITR), and light-duty gasoline vehicles (LDGV) were selected as three ambient particle sources and their source profiles are referred to Marmur et al. (Citation2005) and Shi et al. (Citation2011b). The condition index (CI) and the variance-decomposition proposition (VDP) of these sources were introduced to investigate collinearity (Belsley et al., Citation2005). As listed in Table S1, only one CI value was greater than 2; however, for each CI, only one VDP exceeded 0.5. Thus, no near collinearity problem is detected in the source profile matrix used in this section (Shi et al., Citation2009b). The source profiles and CI values are listed in Table S1 in detail. Furthermore, the source-to-source correlations between contributions at each site were set to zero. Two sites, SI and SII, were investigated. At each site, the receptor concentration matrix was defined as the following:

3
where Gm×p was the source contribution matrix (m was the number of daily samples and was 300 in this study; p was the number of sources, which was 3 for each site); Fp×n was the source profile matrix (n was the number of species, which was 22 here); and Xm×n was the concentration matrix. The receptor concentrations were randomly perturbed to simulate analytical and measurement errors. Following Javitz et al. (Citation1988), the coefficient of variation (CV), defined as the standard deviation of the species concentration divided by its mean, was used to perturb receptor concentrations. A CV of 10% was used in all of the concentration matrices, and random uncertainties were generated from a log-normal distribution. The contributions of each source at both sites were simulated from a normal distribution with the set mean and standard deviations, which are summarized in Table S1. For the method used to construct simulated data sets here, refer to Habre et al. (Citation2011).

To investigate the site-to-site correlations of daily source contributions, and in consideration of both the representative and the feasibility, seven scenarios were investigated: (1) for each source, the site-to-site correlations of daily contributions between two sites were set to zero (case 1); (2) one source exhibited correlations between two sites, and the correlations were set to 0.3, 0.6,and 0.9 (cases 2–4) and (3) three sources exhibited site-to-site correlations between two sites, and the correlations were set to 0.3, 0.6, and 0.9 (cases 5–7). The correlations were selected based on Habre et al. (Citation2011). The detailed R matrixes (the matrixes of site-to-site correlation coefficients) for each case are listed in Table S2 (Supplementary Materials). The R matrix was denoted as the following:

4
where CORREL is the symbol of correlation coefficients; gijSI was the simulated contribution of the jth source to the ith sample at site SI; and gijSII was the simulated contribution of the jth source to the ith sample at site SII. For each scenario, 10 simulated data sets were generated, resulting in a total of 70 data sets.

To investigate whether the spatial variability of source profiles influences PMF, synthetic data sets of another type were developed. Based on the seven scenarios already described, one scenario was chosen for this study. The same method for perturbation was used as described previously. Different CVs for source profiles were introduced to change the spatial variability of source profiles. The CVs were set to 10%, 20%, 30%, 40%, and 50% to increase the spatial variability of source profiles. For each CV, 10 data sets were generated, producing a total of 50 data sets.

Data analysis

In this study, three applications of PMF were investigated: (1) PMF2 was individually applied to the data sets of each site (called PMF2-I); (2) PMF2 was applied on the data sets that combined the two sites together (PMF2-T); and (3) PMF3 was applied to the data sets that consisted of the two sites (PMF3).

Two indicators—the correlation coefficients (S and P) between the simulated (true) and estimated daily contributions and the average absolute error (AAE)—were used to evaluate the results of the PMF. Two kinds of correlation coefficients were calculated by SPSS 16.0 and were denoted as follows:

5
6
where S is the Spearman correlation coefficient and P is the Pearson correlation coefficient. To investigate a comprehensive condition, the S and P of each site used in the following discussion are defined as the sum of correlation coefficients for all of the sources and the total concentrations (TC) of PM (sum of contributions from the three sources) at one site.

The AAE was defined as follows (Javitz et al., Citation1988):

7
where n was the number of samples (300 in this work); Ei was the estimated contribution of a source at one site; and Ti was the simulated contribution (namely, true contribution) that corresponded to Ei.

Results and Discussion

Site-to-site correlations of source contributions

To investigate the site-to-site correlations of daily source contributions, 70 simulated data sets were computed using PMF2-I, PMF2-T, and PMF3. All simulated cases converged in each method of PMF. The mean estimated contributions for each source, the total concentration (TC) of PM (sum of contributions from the tree sources), and the standard deviations for the mean contributions of the 70 data sets estimated by the three PMF applications at both sites are summarized in Table S3 and Figure S1. The results of TC were consistent with the true mean TC, and the standard deviations were relatively small for all the PMF, ranging from 0.49 to 1.73. However, for the individual sources at different sites, the results of PMF2-I, PMF2-T, and PMF3 were different. The results using PMF3 were rather divergent and had high standard deviations. The results of PMF2-I and PMF2-T were consistent with the simulated data, and the standard deviations were low. These results indicated that the three-way data set could be used by PMF2 when the sites were affected by sources with identical profiles.

To evaluate whether the PMF methods could function well in cases with different site-to-site correlations of daily source contributions and to investigate the influence of individual cases on PMF2-I, PMF2-T, and PMF3, we calculated the Pearson and Spearman correlation coefficients and the AAEs between estimated contributions obtained by different PMF methods and corresponding true daily contributions. The P and S correlation coefficients of each site are shown in and Figure S2, respectively. shows that in all cases, the Pearson correlation coefficients of PMF2-I and PMF2-T approximately equaled 4, indicating that the estimated contributions of PMF2-I and PMF2-T were consistent with the true contributions, and both methods performed well in all cases. However, the results of PMF3 were obviously unstable. In most cases, the estimated contributions of PMF3 diverged from the true values. Notably, the PMF3 performed well in case 7, where the site-to-site correlations of daily contributions for each source between two sites were 0.9. From the case 5 to case 7, the results showed a trend toward convergence. The Spearman correlation coefficients showed consistent results.

Figure 1. The P (Pearson correlation between true and estimated daily contributions) of each site (defined as the sum of P for all the sources and TC at one site).

Figure 1. The P (Pearson correlation between true and estimated daily contributions) of each site (defined as the sum of P for all the sources and TC at one site).

Furthermore, a three-dimensional heat map of AAEs is shown in to further demonstrate the performance of PMF for each source in the different cases. The results of this heat map were in good agreement with results of correlation coefficients. According to the heat map, it can be found that the performances of PMF2-I and PMF2-T were stable for each source at both sites for all of the site-to-site correlations. The AAEs of PMF3 are worth noting: Significant variability can be observed. PMF3 performed relatively better in cases 6 and 7, while poor results were derived in cases 1–5. These results might suggest that the site-to-site correlations of source contributions would have a strong influence on the performance of PMF3, wherein PMF3 could perform well only when all sources exhibited strong site-to-site associations between both of the investigated sites. In addition, in case 4, relatively low AAEs between the estimated and true contributions of SDUSTSI and SDUSTSII were observed, which might resulted from high site-to-site correlations that equaled 0.9 for daily contributions of SDUST at the two sites (Table S2).

Figure 2. Three-dimensional heat map of AAEs (%) between true and estimated daily contributions with different site-to-site correlations of daily source contributions by PMF2-I, PMF2-T, and PMF3.

Figure 2. Three-dimensional heat map of AAEs (%) between true and estimated daily contributions with different site-to-site correlations of daily source contributions by PMF2-I, PMF2-T, and PMF3.

These results can be explained by the different principles underlying PMF2 and PMF3. According to eq 2, PMF3 extracts only one Matrix A (related to source contributions) for all sites; that is, for each source, the site-to-site correlations of contributions should be unity. Therefore, PMF3 could perform well only if all sources exhibited strong site-to-site correlations among all investigated sites. In contrast, spatial variation was not considered in the solution of PMF2 ( eq 1). Thus, site-to-site correlations of source contributions do not influence the results of PMF2.

Spatial variability of source profiles

As discussed with and earlier, the site-to-site correlations of daily source contributions strongly influenced the performance of PMF3, but only weakly influenced PMF2-I and PMF2-T. In addition, the spatial variability of source profiles at different sites might also affect the performance of the PMF methods. To study the influence of spatial variability of source profiles, PMF2-I, PMF2-T, and PMF3 were calculated using different CVs of source profiles, based on case 7 where all of the PMF methods provided consistent results.

The CVs of source profiles were set to 10%, 20%, 30%, 40%, and 50%; 50 data sets were computed using PMF2-I, PMF2-T, and PMF3. The P and S correlation coefficients between estimated and simulated contributions at two sites with different CVs are exhibited in Figure S3, for PMF2-I, PMF2-T, and PMF3, respectively. Figure S3 indicates that variation in source profiles played an important role in the performance of PMF2-T and PMF3, but weakly influenced PMF2-I. The performance of PMF2-T and PMF3 declined when the CVs of source profiles were high. The mean AAEs of each source at two sites with different spatial variability of source profiles are shown in . A trend of increasing AAEs with increasing CVs was observed for PMF2-T and PMF3, while low, stable AAEs were observed for PMF2-I, which was consistent with the results of P and S correlation coefficients.

Table 1. Mean AAEs (%) of each source category at site SI and SII under different spatial variability of source profiles, estimated by PMF2-I, PMF2-T, and PMF3

To further investigate the source profiles and contributions estimated by different PMF applications, a case study (two synthetic tests with CVs of 10% and 30%, respectively) was carried out. The normalized simulated (true) and estimated source profiles of SDUST for the two experiments are shown in . For CV = 10%, the simulated source profiles showed little spatial variability for different sites. Source profiles estimated by PMF2-I, PMF2-T, and PMF3 showed similar trends with the simulated profiles, although the source profiles calculated by PMF2-T and PMF3 were similar for different sites. However, for CV = 30%, high spatial variability was observed among the true source profiles. Under this condition, the results of PMF2-I resembled the simulated profiles, while the performance of PMF2-T and PMF3 was poor, which might be because they can only estimate the same source profiles for different sites whose source profiles had high spatial variability. Figure S4 shows the true and estimated source contributions of SDUST for two sites, which were calculated using PMF, for which CV = 10% and 30% (Supplemental Materials). The performance of PMF was in agreement with the preceding discussion. For CV = 10%, the estimated contributions calculated using all PMF methods were consistent with the simulated contributions; however, differences were observed for CV = 30%.

Figure 3. Normalized simulated and estimated source profiles of SDUST for site SI and SII by PMF2-I, PMF2-T, and PMF3.

Figure 3. Normalized simulated and estimated source profiles of SDUST for site SI and SII by PMF2-I, PMF2-T, and PMF3.

Because PMF2-I was run independently among sites, it was not surprising to observe that spatial variability in the source files weakly influenced PMF2-I. However, eqs 1 and 2 predicted that PMF2-T and PMF3 would produce the same source profiles (matrix F in eq 1 and matrix B in eq 2), so spatial variability in the source profiles significantly influenced PMF2-T and PMF3. In conclusion, if the spatial variability of source profiles was high, PMF2-T and PMF3 could not work well, and only PMF2 for each site individually could be used reliably. Thus, the results in this study indicated that PMF2 and PMF3 would not perform well and would be inconsistent with empirical data if the spatial variability of source profiles were too large, even though the same common source categories were permitted, resolving for different sites by individual analyses.

In addition, scenarios of collinearity in the source profiles and source contributions and different source categories at two sites were explored. The results and discussion are provided in the Supplemental Materials, as shown in Tables S4–S8 and Figures S5 and S6.

Conclusion

Different applications of two- and three-way PMF showed that both the site-to-site correlations of daily source contributions and the spatial variability of source profiles would influence the performance of different applications of PMF, along with the accuracy of the estimated results. For PMF2, if the investigated sites were contributed by sources with identical profiles, it would not be influenced by the site-to-site correlations of source contributions, so PMF2 could perform well using a three-way data set that contained the same source profiles. However, PMF2 was sensitive to the spatial variability of source profiles. When the spatial variability was high, PMF2 using the three-way data set could not function well. For the three-way model, the application of PMF3 was limited by both the site-to-site correlations of daily source contributions and the spatial variability of source profiles. PMF3 could perform well only when all of the sources exhibited strong site-to-site associations among all of the investigated sites and simultaneously, the spatial variability of source profiles was small. These conclusions may provide valuable information to environmental scientists when they carry out multisite source apportionment.

Funding

This study is supported by the National Natural Science Foundation of China (21207070 and 41375132), Special Funds for Research on Public Welfare of the Ministry of Environmental Protection of China (201409003), and the Combined Laboratory of the Tianjin Meteorological Bureau.

Supplemental Material

Supplemental data for this article can be accessed at http://dx.doi.org/10.1080/10962247.2014.926300.

Supplemental material

Supplemental_Material.doc

Download MS Word (454.5 KB)

Additional information

Notes on contributors

Ying-Ze Tian

Ying-Ze Tian, Guo-Liang Shi, Xiao-Yu Zhou, Jiao Wang, and Yin-Chang Feng are with the State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, at Nankai University, Tianjin, China.

Guo-Liang Shi

Ying-Ze Tian, Guo-Liang Shi, Xiao-Yu Zhou, Jiao Wang, and Yin-Chang Feng are with the State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, at Nankai University, Tianjin, China.

Bo Han

Bo Han and Wei Wang are with the College of Software, Nankai University, Tianjin, China.

Wei Wang

Bo Han and Wei Wang are with the College of Software, Nankai University, Tianjin, China.

Xiao-Yu Zhou

Ying-Ze Tian, Guo-Liang Shi, Xiao-Yu Zhou, Jiao Wang, and Yin-Chang Feng are with the State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, at Nankai University, Tianjin, China.

Jiao Wang

Ying-Ze Tian, Guo-Liang Shi, Xiao-Yu Zhou, Jiao Wang, and Yin-Chang Feng are with the State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, at Nankai University, Tianjin, China.

Xiang Li

Xiang Li is with the Department of Computer Science, University of Georgia, Athens, GA, USA.

Yin-Chang Feng

Ying-Ze Tian, Guo-Liang Shi, Xiao-Yu Zhou, Jiao Wang, and Yin-Chang Feng are with the State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, at Nankai University, Tianjin, China.

References

  • Abdollahi, H., and S.M. Sajjadi. 2010. On rotational ambiguity in parallel factor analysis. Chemom. Intell. Lab. Syst. 103:144–51. doi:10.1016/j.chemolab.2010.06.010
  • Belsley, D.A., E. Kuh, and R.E. Welsch. 2005. Regression Diagnostics: Identifying Influential Data and Sources of Collinearity. New York, NY: John Wiley & Sons.
  • Beuck, H., U. Quass, O. Klemm, and T.A.J. Kuhlbusch. 2011. Assessment of sea salt and mineral dust contributions to PM10 in NW Germany using tracer models and positive matrix factorization. Atmos. Environ. 45:5813–21. doi:10.1016/j.atmosenv.2011.07.010
  • Brinkman, G., G. Vance, M.P. Hannigan, and J.B. Milford. 2006. Use of synthetic data to evaluate positive matrix factorization as a source apportionment tool for PM2.5 exposure data. Environ. Sci. Technol. 40:1892–901. doi:10.1021/es051712y
  • Bruinen de Bruin, Y., K. Koistinen, T. Yli-Tuomi, S. Kephalopoulos, and M. Jantunen. 2006. Source apportionment techniques and marker substances available for identification of personal exposure, indoor and outdoor sources of chemicals. EUR 22349 EN, Joint Research Center JRC, Ispra, Italy. ISBN 92-79-02981-9.
  • Chan, Y.C., O. Hawas, D. Hawker, P. Vowles, D.D. Cohen, E. Stelcer, R. Simpson, G. Golding, and E. Christensen. 2011. Using multiple type composition data and wind data in PMF analysis to apportion and locate sources of air pollutants. Atmos. Environ. 45:439–49. doi:10.1016/j.atmosenv.2010.09.060
  • Chen, L.W.A., D.H. Lowenthal, J.G. Watson, D. Koracin, N. Kumar, E.M. Knipping, N. Wheeler, K. Craig, and S. Reid. 2010. Toward effective source apportionment using positive matrix factorization: Experiments with simulated PM2.5 data. J. Air Waste Manage Assoc. 60:43–54. doi:10.3155/1047-3289.60.1.43
  • Escrig, A., E. Monfort, I. Celades, X. Querol, F. Amato, M.C. Minguillón, and P.K. Hopke. 2009. Application of optimally scaled target factor analysis for assessing source contribution of ambient PM10. J. Air Waste Manage Assoc. 59:1296–307. doi:10.3155/1047-3289.59.11.1296
  • Faber, N.M., R. Bro, and P.K. Hopke. 2003. Recent developments in CANDECOMP/PARAFAC algorithms: A critical review. Chemom. Intell. Lab. Syst. 65:119–37. doi:10.1016/S0169-7439(02)00089-8
  • Green, M.C., L.W.A. Chen, D.W. DuBois, and J.V. Molenar. 2012. Fine particulate matter and visibility in the Lake Tahoe Basin: Chemical characterization, trends, and source apportionment. J. Air Waste Manage Assoc. 62: 53–65. doi:10.1080/10962247.2012.690362
  • Gu, J., J. Schnelle-Kreis, M. Pitz, J. Diemer, A. Reller, R. Zimmermann, J. Soentgen, A. Peters, and J. Cyrys. 2013. Spatial and temporal variability of PM10 sources in Augsburg, Germany. Atmos. Environ. 71:131–39. doi:10.1016/j.atmosenv.2013.01.043
  • Habre, R., B. Coull, and P. Koutrakis. 2011. Impact of source collinearity in simulated PM2.5 data on the PMF receptor model solution. Atmos. Environ. 45:6938–46. doi:10.1016/j.atmosenv.2011.09.034
  • Henry, R.C., and E.R. Christensen. 2010. Selecting an appropriate multivariate source apportionment model result. Environ. Sci. Technol. 44: 2474–2481. doi:10.1021/es9018095
  • Hwang, I., P.K. Hopke, and J.P. Pinto. 2008. Source apportionment and spatial distributions of coarse particles during the regional air pollution study. Environ. Sci. Technol. 42:3524–30. doi:10.1021/es0716204
  • Javitz, H.S., J.G. Watson, and N. Robinson. 1988. Performance of the chemical mass balance model with simulated local-scale aerosol. Atmos. Environ. 22:2309–22. doi:10.1016/0004-6981(88)90142-4
  • Kelly, K.E., R. Kotchenruther, R. Kuprov, and G.D. Silcox. 2013. Receptor model source attributions for Utah’s Salt Lake City airshed and the impacts of wintertime secondary ammonium nitrate and ammonium chloride aerosol. J. Air Waste Manage Assoc. 63:575–90. doi:10.1080/10962247.2013.774819
  • Keuken, M.P., M. Moerman, M. Voogt, M. Blom, E.P. Weijers, T. Röckmann, and U. Dusek. 2013. Source contributions to PM2.5 and PM10 at an urban background and a street location. Atmos. Environ. 71:26–35. doi:10.1016/j.atmosenv.2013.01.032
  • Kim, E., P.K. Hopke, J.P. Pinto, and W. Wilson. 2005. Spatial variability of fine particle mass, components, and source contributions during the regional air pollution study in St. Louis. Environ. Sci. Technol. 39:4172–79. doi:10.1021/es049824x
  • Larsen, B.R., S. Gilardoni, K. Stenström, J. Niedzialek, J. Jimenez, and C.A. Belis. 2012. Sources for PM air pollution in the Po Plain, Italy: II. Probabilistic uncertainty characterization and sensitivity analysis of secondary and primary sources. Atmos. Environ. 50:203–13. doi:10.1016/j.atmosenv.2011.12.038
  • Lowenthal, D.H., J.G. Watson, D. Koracin, L.W.A. Chen, D. Dubois, R. Vellore, N. Kumar, E.M. Kumar, N. Wheeler, and K. Craig. 2012. Evaluation of regional-scale receptor modeling. J. Air Waste Manage Assoc. 60:26–42. doi:10.3155/1047-3289.60.1.26
  • Marmur, A., A. Unal, J.A. Mulholland, and A.G. Russell. 2005. Optimization-based source apportionment of PM2.5 incorporating gas-to-particle ratios. Environ. Sci. Technol. 39:3245–54. doi:10.1021/es0490121
  • Minguillón, M.C., X. Querol, U. Baltensperger, and A.S.H. Prévôt. 2012. Fine and coarse PM composition and sources in rural and urban sites in Switzerland: Local or regional pollution. Sci. Total Environ. 427–28: 191–202. doi:10.1016/j.scitotenv.2012.04.030
  • Mooibroek, D., M. Schaap, E.P. Weijers, and R. Hoogerbrugge. 2011. Source apportionment and spatial variability of PM2.5 using measurements at five sites in the Netherlands. Atmos. Environ. 45:4180–91. doi:10.1016/j.atmosenv.2011.05.017
  • Ofosu, F.G., P.K. Hopke, I.J.K. Aboh, and S.A. Bamford. 2013. Biomass burning contribution to ambient air particulate levels at Navrongo in the Savannah zone of Ghana. J. Air Waste Manage Assoc. 63:1036–45. doi:10.1080/10962247.2013.783888
  • Öztürk, F., A. Zararsız, V.A. Dutkiewicz, L. Husain, P.K. Hopke, and G. Tuncel. 2012. Temporal variations and sources of Eastern Mediterranean aerosols based on a 9-year observation. Atmos. Environ. 61:463–75. doi:10.1016/j.atmosenv.2012.07.051
  • Paatero, P., and U. Tapper. 1994. Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values. Environmetrics 5:111–26. doi:10.1002/(ISSN)1099-095X
  • Paatero, P. 1997. A weighted non-negative least squares algorithm for three-way “PARAFAC’ factor analysis. Chemom. Intell. Lab. Syst. 38:223–42. doi:10.1016/S0169-7439(97)00031-2
  • Paatero, P. 1999. The Multilinear Engine—A table-driven least squares program for solving multilinear problems, including the n-way parallel factor analysis model. J. Comput. Graph. Stat. 8:854–88. doi:10.2307/1390831
  • Paatero, P. 2007. User’s Guide for Positive Matrix Factorization Programs PMF2 and PMF3, Part 1–2: Tutorial. Helsinki, Finland: University of Helsinki.
  • Pant, P., and R.M. Harrison. 2012. Critical review of receptor modelling for particulate matter: A case study of India. Atmos. Environ. 49:1–12. doi:10.1016/j.atmosenv.2011.11.060
  • Pokorna, P., J. Hovorka, J. Krouzek, and P.K. Hopke. 2013. Particulate matter source apportionment in a village situated in industrial region of Central Europe. J. Air Waste Manage Assoc. 63:1412–21. doi:10.1080/10962247.2013.825215
  • Russell, A.G., and B.A. Brunekreef. 2009. A focus on particulate matter and health. Environ. Sci. Technol. 43:4620–25. doi:10.1021/es9005459
  • Shen, G.F., S. Wei, W. Wei, Y. Zhang, Y. Min, B. Wang, R. Wang, W. Li, H. Shen, Y. Huang, Y. Yang, W. Wang, X. Wang, X. Wang, and S. Tao. 2012. Emission factors, size distributions, and emission inventories of carbonaceous particulate matter from residential wood combustion in rural China. Environ. Sci. Technol. 46:4207–14. doi:10.1021/es203957u
  • Shen, G.F., M. Xue, S.Y. Yuan, J. Zhang, Q.Y. Zaho, B. Li, H.S. Wu, and A.J. Ding. 2014. Chemical compositions and reconstructed light extinction coefficients of particulate matter in a mega-city in the western Yangtze River Delta, China. Atmos. Environ. 83:14–20. doi:10.1016/j.atmosenv.2013.10.055
  • Shi, G.L., X. Li, Y.C. Feng, Y.Q. Wang, J.H. Wu, J. Li, and T. Zhu. 2009a. Combined source apportionment, using positive matrix factorization-chemical mass balance and principal component analysis/multiple linear regression-chemical mass balance models. Atmos. Environ. 43:2929–37. doi:10.1016/j.atmosenv.2009.02.054
  • Shi, G.L., Y.C. Feng, G. Zeng, X. Li, Y.F. Zhang, Y.Q. Wang, and T. Zhu. 2009b. Use of a nonnegative constrained principal component regression chemical mass balance model to study the contributions of nearly collinear sources. Environ. Sci. Technol. 43:8867–73. doi:10.1021/es902785c
  • Shi, G.L., F. Zeng, X. Li, Y.C. Feng, Y.Q. Wang, G.X. Liu, and T. Zhu. 2011a. Estimated contributions and uncertainties of PCA/MLR-CMB results: Source apportionment for synthetic and ambient data sets. Atmos. Environ. 45:2811–19. doi:10.1016/j.atmosenv.2011.03.007
  • Shi, G.L., Y.Z. Tian, Y.F. Zhang, W.Y. Ye, X. Li, X.X. Tie, Y.C. Feng, and T. Zhu. 2011b. Estimation of the concentrations of primary and secondary organic carbon in ambient particulate matter: Application of the CMB-Iteration method. Atmos. Environ. 45:5692–98. doi:10.1016/j.atmosenv.2011.07.031
  • Titos, G., H. Lyamani, M. Pandolfi, A. Alastuey, and L. Alados-Arboledas. 2014. Identification of fine (PM1) and coarse (PM10-1) sources of particulate matter in an urban environment. Atmos. Environ. 89:593–602. doi:10.1016/j.atmosenv.2014.03.001
  • Watson, J.G., and J.C. Chow. 2001. Source characterization of major emission sources in the Imperial and Mexicali Valleys along the US–Mexico border. Sci. Total Environ. 276:33–47. doi:10.1016/S0048-9697(01)00770-7
  • Watson, J.G., L.W. Antony Chen, J.C. Chow, P. Doraiswamy, and D.H. Lowenthal. 2008. Source apportionment: Findings from the U.S. Supersites program. J. Air Waste Manage Assoc. 58:265–88. doi:10.3155/1047-3289.58.2.265
  • Xie, M., T.L. Coons, J.G. Hemann, S.J. Dutton, J.B. Milford, J.L. Peel, S.L. Miller, S.Y. Kim, S. Vedal, L. Sheppard, and M.P. Hannigan. 2012. Intra-urban spatial variability and uncertainty assessment of PM2.5 sources based on carbonaceous species. Atmos. Environ. 60:305–15. doi:10.1016/j.atmosenv.2012.06.036
  • Zeng, Y.S., and P.K. Hopke. 1992. A new receptor model: A direct trilinear decomposition followed by a matrix reconstruction. J. Chemom. 6:65–83. doi:10.1002/(ISSN)1099-128X
  • Zhao, X.J., P.S. Zhao, J. Xu, W. Meng, W.W. Pu, F. Dong, D. He, and Q.F. Shi. 2013a. Analysis of a winter regional haze event and its formation mechanism in the North China Plain. Atmos. Chem. Phys. 13:5685–96. doi:10.5194/acp-13-5685-2013
  • Zhao, P.S., F. Dong, Y.D. Yang, D. He, X.J. Zhao, W.Z. Zhamng, Q. Yao, and H.Y. Liu. 2013b. Characteristics of carbonaceous aerosol in the region of Beijing, Tianjin, and Hebei, China. Atmos. Environ. 71:389–98. doi:10.1016/j.atmosenv.2013.02.010
  • Zheng, M., G.R. Cass, L. Ke, F. Wang, J.J. Schauer, E.S. Edgerton, and A.G. Russell. 2007. Source apportionment of daily fine particulate matter at Jefferson Street, Atlanta, GA, during summer and winter. J. Air Waste Manage Assoc. 57:228–42. doi:10.1080/10473289.2007.10465322
  • Zheng, M., F. Wang, G.S.W. Hagler, X. Hou, M. Bergin, Y. Cheng, L.G. Salmon, J.J. Schauer, P.K.K. Louie, L. Zeng, and Y.H. Zhang. 2011. Sources of excess urban carbonaceous aerosol in the Pearl River Delta Region, China. Atmos. Environ. 45:1175–82. doi:10.1016/j.atmosenv.2010.09.041

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.