820
Views
10
CrossRef citations to date
0
Altmetric
Technical Paper

Compositional data analysis of smoke emissions from debris piles with low-density polyethylene

ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Pages 834-845 | Received 02 Apr 2020, Accepted 12 Jun 2020, Published online: 03 Aug 2020

ABSTRACT

Data describing the composition of smoke are inherently multivariate and always non-negative parts of a whole. The data are relative and the information is contained in the ratios between parts of the composition. A prior analysis of smoke emissions produced from the burning of manzanita wood mixed with low-density polyethylene plastic applied traditional statistical methods to the compositional data and found no effect. The current paper applies compositional data techniques to these smoke emissions to determine if the prior analysis was accurate. Analysis of variance of the isometric log-ratios showed that LDPE significantly affected the CO2 emission ratio for 8 of the 191 trace gases; this analysis showed none of the gases identified in the previous analysis were affected by LDPE. LDPE did not affect the CO2 emission ratios for the alkanes, alkenes, alkynes, aldehydes, cycloalkanes, cycloalkenes, diolefins, ketones, MAHs, and PAHs. Compositional data analysis should be used to analyze smoke emissions data. Burning contaminant-free LDPE should produce emissions like wood.

Implications

Reanalysis of impact of burning LDPE plastic in silvicultural debris piles using appropriate statistical techniques confirmed previously published results from inappropriate techniques that LDPE did not change the composition of the smoke emissions. Being able to dispose of these LDPE-covered forest debris by burning can save thousands of dollars in labor costs annually. Disposal of pesticide-free agricultural LDPE plastic by burning should only produce wood-like smoke emissions. This applies to LDPE/total mass ratios of 0.25– 2.5% as studied.

Introduction

Shrubs and small diameter trees exist in the understories of many western forests in the United States. They are important from an ecological perspective; however, this vegetation also presents a potential hazard as “ladder fuels” or as a heat source to damage the overstory during prescribed burns. Cutting and piling of this material to burn under safe conditions is a common silvicultural practice. To improve ignition success and flammability of the piled debris, polyethylene plastic (LDPE) is often used to cover a portion of the pile. While burning of piled forest debris is an acceptable practice in southern California from an air quality perspective, inclusion of LDPE in the piles changes these debris piles to rubbish piles (a legal term) which should not be burned. A review of available literature (Wrobel and Reinhardt Citation2003) suggested that emissions resulting from burning piles with LDPE would be similar to emissions from debris piles without LDPE. The literature review was based on bench-scale tests of LDPE pyrolysis and combustion. With support from the four National Forests in southern California, we conducted a laboratory experiment to determine if the presence of LDPE in a pile of burning wood changed the smoke emissions. Using conventional statistical analysis techniques, we concluded that the presence of LDPE did not have an effect on the emissions (Hosseini et al. Citation2014). A field study in Oregon examined emissions from burning debris piles and similarly reported no significant effect due to the presence of LDPE (Aurell et al. Citation2016). In our original analysis, the chemical compounds comprising the gaseous and particulate components of the smoke were analyzed as isolated variables containing separable information. In actuality, the individual parts are intrinsically interrelated and multicollinear, with their values being relative to each other.

Recently, Weise et al. (Citation2020) have shown that emissions data are compositional data because they describe the parts of some whole. They contain only relative information which is contained in ratios of the individual emissions. Emissions data are typically expressed in some form of ratio (emission factor, emission ratio, concentration, mole ratio, etc.). Furthermore, using a stoichiometric argument, modified combustion efficiency (MCE) was shown to be linearly linked to all other wildfire emissions thus making its use in linear regression as a predictor variable for other emissions problematic (Weise et al. Citation2020). Using correlation to describe relationships between parts of a composition can lead to spurious conclusions because of the relative nature of the data (Aitchison Citation1986; Lovell et al. Citation2015; Pearson Citation1896). A well-principled methodological body to analyze compositional data has been developed in the past 30 years and this is an active area of statistical research so we have chosen to apply it to fire emissions data. Interestingly, it has been successfully applied in varied fields but appears to have seldom been applied to combustion or emissions data (Bandeen-Roche Citation1994; Billheimer Citation2001; Buccianti and Pawlowsky-Glahn Citation2006; Speranza et al. Citation2018).

The relative nature of emissions data defines emissions data as compositional data which are coherently analyzed using Compositional Data Analysis (CoDA) (Aitchison Citation1986; Filzmoser, Hron, and Templ Citation2018; Pawlowsky-Glahn, Egozcue, and Tolosana-Delgado Citation2015; van den Boogaart and Tolosana-Delgado Citation2013). CoDA methodology has three underlying principles. The first is scale invariance – vectors with proportional positive components represent the same composition, and form what is known as an equivalence class. This means changing the units should not change relative relationships between the parts nor affect results and scientific conclusions. The second is that inferences about subcompositions, i.e. smaller compositions formed from subsets of parts, must not contradict the inferences from the full composition. The third principle states that the order of the parts of the composition must not affect the inferences. While the initial work on CoDA explicitly assumed in the definition of a composition that the parts sum (are closed) to a constant total, theory has developed to show that this is only a particular representation of the data in a simplex and equivalent non-closed compositions carry the same relative information (Barceló-Vidal and Martín-Fernández Citation2016). The field of compositional data analysis is a well-developed body of statistical methodology that provides models and methods equivalent to traditional ones yet accounts for these special constraining features of relative data. The approach has been used for decades to analyze analogous types of data in the geosciences and, more recently, in other disparate areas such as microbiome studies, animal green-house gases emissions or physical activity epidemiology (Chastin et al. Citation2015; Fernandes et al. Citation2014; Palarea-Albaladejo et al. Citation2017).

In CoDA today, in order to use familiar statistical techniques such as exploratory data analysis, linear regression, multivariate analysis of variance and other multivariate techniques, the mainstream approach is to transform the parts from the simplex to the real numbers space using isometric log-ratio (ilr) coordinates (Egozcue and Pawlowsky-Glahn Citation2005; Mateu-Figueras, Pawlowsky-Glahn, and Egozcue Citation2011; Pawlowsky-Glahn, Egozcue, and Tolosana-Delgado Citation2015; van den Boogaart and Tolosana-Delgado Citation2013). Software to perform compositional data analysis is available, including the stand-alone point-and-click CoDaPack package (Thió-Henestrosa and Comas Citation2016) (http://www.compositionaldata.com/codapack.php) and comprehensive libraries on the open-source R statistical computing system (R Core Team Citation2018): compositions (van den Boogaart and Tolosana-Delgado Citation2013), robCompositions (Templ, Hron, and Filzmoser Citation2011) and zCompositions (Palarea-Albaladejo, Martín-Fernández, and Buccianti Citation2014). This paper reports the reanalysis of the data previously reported (Hosseini et al. Citation2014) using compositional data analysis techniques that are consonant with the nature of the data in order to improve the scientific rigor of the previous conclusions.

Methods

Data set and original analysis

Mean values of the emissions data were presented originally (Hosseini et al. Citation2014). The original compositional data set consisting of a total of 9 observations composed of 191 compounds resulting from the LDPE study is available (Hosseini et al. Citation2018). The 9 observations resulted from a randomized block design consisting of three blocks (in time) each with one replication of three masses of LDPE (0, 5, 50 g) incorporated into 2 kg piles of manzanita (Arctostaphylos sp.) wood yielding mass ratios of LDPE to wood of 0%, 0.25% and 2.5%. We estimated the typical LDPE mass ratio in debris piles to be 0.1% (Hosseini et al. Citation2014). In the western U.S., debris piles are created by cutting woody shrubs and small trees present in the understory of forests to interrupt vertical fuel continuity to reduce the ability of a wildland fire to spread vertically from the ground into the elevated canopies of the trees (Weise, Cobian-Iñiguez, and Princevac Citation2018). The blocking was used to account for differences caused by changes in the ambient atmosphere in the lab facility which had no control of air temperature and relative humidity. A Horiba PG-250Footnote1 multigas analyzer sampled NOx, CO and CO2; carbonyls, volatile organic compounds (VOC), polyaromatic hydrocarbons (PAH), particulate matter (PM) mass and composition were collected using a variety of methods and filters. See Hosseini et al. (Citation2014) for a schematic of the experimental setup and detailed description of the specific measurements and calibrations for the PG-250 and the other methods. The quantity of each part (gasj) of the composition was expressed as an emission ratio with CO2 ERCO2gasj=ΔgasjΔCO2 where Δ is the mass of the gas above background (ambient) values (Delmas, Lacaux, and Brocard Citation1995); the original data units were g of emission per kg CO2.

Our original analysis of emission factors treated LDPE as a fixed effect (factor) and we performed both parametric (analysis of variance) and nonparametric (Friedman – based on ranks) tests to determine if LDPE significantly affected the emission ratio for each compound individually as well as the sum of the emission ratios for each class of compound (alkanes, diolefins, monocyclic aromatic hydrocarbons, PAHs, aldehydes, etc.). Recall that parametric tests assume that the data can be described by an underlying statistical distribution such as the normal or log-normal distribution with assumed parameters. Nonparametric tests either assume no underlying statistical distribution or assume a distribution with unspecified parameters (Conover Citation1999; Mason, Gunst, and Hess Citation1989; Mendenhall, Schaeaffer, and Wackerly Citation1981). We assumed agreement between the parametric and nonparametric approaches strengthened our conclusions. Modified combustion efficiency (MCE, Ward and Hao Citation1991) was included as a covariate to account for differences in the burning characteristics of each test which we have subsequently shown to be an inappropriate approach to analysis of emission factors since MCE is part of the composition of gases (Weise et al. Citation2020). Due to the large number of tests, the Bonferroni method was used to control type I error when comparing the means; that is, the false-positive error. Using multivariate analysis of variance (MANOVA) was not possible because the number of observations (9) was less than the number of gases (190), i.e. the rank of the matrix was not sufficient in order to estimate the effects (Mardia, Kent, and Bibby Citation1979). At the time of the original analysis, the authors were not aware of a nonparametric multivariate method (PERMANOVA) similar to MANOVA that does not require that the number of observations be greater than the number of gases (Anderson Citation2017). Analyzing each gas individually resulted in three statistical comparisons that found a significant difference between the LDPE treatment means for 3 M-octane, pyrene and fluoranthene; the lack of significant difference for the other 180+ compounds supported the hypothesis that burning LDPE with wood would not have an effect on the composition of the smoke (Wrobel and Reinhardt Citation2003). Means and standard deviations for each emission ratio and for the classes were presented; however, we subsequently discovered that a few of the compounds were assigned to the incorrect chemical class (alkane, alkene, etc.) and these errors were corrected. In the original analysis, values below detection level (BDL) were assigned the value of 0 to calculate totals for each of the classes. The reported results for the individual emissions reflected this approach to handling BDL values.

Statistical methods

In the original data, BDL values occurred for the trace compounds. BDL values are common in emissions research and represent a form of left-censored data in compositional data sets (Palarea-Albaladejo and Martín-Fernández Citation2013). Several different compositional methods are available to replace BDL values with an estimate; model-based methods are preferred when the covariance structure of the data is rich and the proportion of parts with values BDL exceeds about 10–15% (Palarea-Albaladejo and Martín-Fernández Citation2013, Citation2015, Citation2008). For this data set, the overall percentage of values that were BDL was 9.3%. Multiplicative lognormal replacement, as implemented by the multLN function in the zCompositions package (Palarea-Albaladejo and Martín-Fernández Citation2015), was applied to replace the BDL values with a fraction of the DL (1 E-07) while adjusting the other parts of the composition multiplicatively to maintain the ratios between parts in accordance with their compositional nature. The interested reader is referred to the documentation of the zCompositions package and the references therein for more details.

Forming log-ratios from the original emission ratios eliminated the numerical influence of CO2:

(1) logERCO2gasjERCO2gask=logΔgasjΔCO2/ΔgaskΔCO2=logΔgasjΔgask.(1)

The effect of LDPE on smoke emissions was analyzed using each gas singly as done originally. While the entire composition could be analyzed in a multivariate approach using PERMANOVA which is a distribution-free permutation technique (Anderson Citation2017), this was not done due to the relatively small sample size in the present study. For the single gas analysis, the data were transformed into log-ratio coordinates using a special ilr transformation (Egozcue et al. Citation2003) known as pivot coordinates (Filzmoser, Hron, and Templ Citation2018) using the robCompositions package (Templ, Hron, and Filzmoser Citation2011). It is important to note that an infinite number of ilr transformations are possible for the data, changing from one another by applying orthogonal rotations of the ilr coordinate system, but it is the nature of the ilr transformation that this preserves the multivariate Euclidean distances within the data set. The pivot coordinate transformation has the characteristic that one part (used as pivot) is isolated in the numerator of the first of the D-1 ilr coordinates associated with a D-part composition, where D is the number of parts (191 in this data set). The first coordinate

(2) z1=D1Dlngas1k=2DgaskD1=1DD1lngas1gas2+lngas1gas3++lngas1gasD(2)

is the normalized log-ratio of a gas species (gas1), placed in the first position of the composition, with the geometric mean of all other gases, which can be interpreted as the relative dominance of that gas within the composition. For each gas in the composition, the order of the parts was permuted, which is equivalent to applying an orthogonal rotation of the ilr coordinate representation, so that each gas was placed once in the first position, and hence in the numerator of the first pivot coordinatez1, and an analysis of variance testing the effect of LDPE was individually performed on these first pivot coordinates sequentially. Multiple post hoc pair-wise comparison tests (glht and mcp functions) assessed the differences between the three levels of LDPE using Tukey contrasts; with p-values being adjusted for multiplicity using the default single-step method in the multcomp package (Bretz, Hothorn, and Westfall Citation2011). It is important to note that the pivot coordinate representations are only useful to make inferences individually about the relative dominances of the gases relative to the others in the composition. That is, the collection of all first pivot coordinates could not be used jointly as representative of the entire original composition nor to investigate associations between pairs of gases (in fact, all first pivot coordinates considered jointly would form a system of linearly dependent coordinates).

In order to test the effect of LDPE on the relative amounts of the chemical classes (alkane, alkene, alkyne, cycloalkane, cycloalkene, aldehyde, ketone, diolefin, MAH, PAH), another type of ilr coordinates called compositional balances z˜k defined as

(3) z˜k=rkskrk+sklnxi1xi2xirk1/rkxj1xj2xjsk1/sk,k=1,,D1,(3)

were used to form scientifically meaningful subsets of parts of the composition (Egozcue and Pawlowsky-Glahn Citation2005). In a balance, the log-ratio compares in relative terms the geometric mean of rk parts in one subset (class) with the geometric mean of sk parts in another subset (class). Five balances were defined as detailed in . The first balance, alkanes vs alkenes and alkynes, examined if the relative reactivity represented by these classes of compounds changed with LDPE. Alkanes tend to be more stable and less reactive than alkenes and alkynes. The next three balances compare the relative importance of these closely related classes of compounds which have are composed of larger molecules and have a wide variety of characteristics. The last balance compared the relative amounts of non-hydrocarbon gases produced by the burning of manzanita with and without LDPE. The emissions data were transformed into balance coordinates and analysis of variance was used to test if the balance means differed between LDPE levels. Individual t-tests of equality of the balances to 0 were also conducted for each LPDE level, with balance equal to 0 here meaning equilibrium between the two classes depicted in numerator and denominator of the log-ratio, respectively. The gases assigned to each class can be found in .

Table 1. Geometric mean emission ratios (ERCO2)a for combustion products produced by burning manzanita (Arctostaphylos sp.) wood-polyethylene (LDPE) plastic mixtures. Values differ from (Hosseini et al. Citation2014) because geometric mean μg used instead of arithmetic mean. Gray shading identifies the chemical species that were significantly related to LDPE. Geometric mean and geometric standard deviation σgbased on three replicates.

Table 2. Summary of analysis of variance testing the effect of LDPE plastic on pivot coordinates (isometric log-ratios) for individual smoke emissions. When the effect of LDPE was statistically significant (p-value ≤ 0.05), the p-values associated with the t-tests for the multiple pair-wise comparisons of the mean values for the levels of LDPE are provided.

Table 3. Summary of analyses of variance examining effect of LDPE plastic on selected compositional balances between classes of smoke emissions.

Results

After fixing the assignment errors in the original data, the geometric mean of each part of the composition for each level of LDPE is contained in . These values supersede the values reported in Tables 3–5 of Hosseini et al. (Citation2014) which had been calculated as arithmetic means. Recall that the original data were relative to the amount of CO2 produced. If ERCO2 is multiplied by 1.8 to estimate an emission factor based on fuel mass (Hosseini et al. Citation2014), the log-ratio of any two gases or groups of gases will not change due to compositional equivalence (van den Boogaart and Tolosana-Delgado Citation2013).

Determining if LDPE influenced emissions by analyzing the ilr pivot coordinate of each gas, i.e. the normalized log-ratio of each gas with the geometric mean of all other gases, resulted in 191 analyses of variance. The mean pivot coordinates were found to differ statistically significantly between levels of LDPE for 8 of the 191 compounds () as indicated by the F-statistic p-values. For 5 of the 8 compounds, the mean pivot coordinates for the 50 g LDPE differed significantly from the 5 g LDPE. With the exception of 3M-1-pentene, whose pivot coordinate decreased in mean from 5 g LDPE to 50 g LDPE indicating that the relative dominance of 3M-1-pentene in the composition decreased as LDPE increased, the relative dominance of the other 4 gases increased as LDPE increased from 5 to 50 g. When the 50 g treatment was compared to the 0 g treatment, 2 of the four significant pivot coordinates increased and 2 decreased. In 1 of the 2 significant comparisons between the 0 and 5 g LDPE there was an increase, the other was a decrease.

Comparisons of the effects of LDPE on the defined balances between relevant classes of compounds did not find any statistically meaningful differences (as indicated by the non-significant F-statistics in ), while the individual t-tests indicated that the balances for the cycloalkanes vs cycloalkenes, aldehydes vs diolefins and ketones, and NOx vs CO were significantly different from 0 in the piles without LDPE. Note that the signs of the balances were unchanged as the amount of LDPE increased. The relative amounts of cycloalkanes compared to cycloalkenes, and NOx compared to CO were significantly smaller in mean as indicated by the negative sign of the respective balances for the no LDPE level. Similarly, the relative amount of aldehydes compared to diolefins and ketones was consistently larger in mean.

A large suite of chemicals was analyzed as part of this study to identify whether the toxicity of the smoke plume from burning silvicultural wood piles would be increased by the use of non-pesticide agricultural LDPE plastic coverings. Hosseini et al. (Citation2014) provide the detailed sampling/analysis protocols. The detailed hydrocarbon analysis protocol SAE 930142HP (Siegl et al. Citation1993) was used to identify a wide array of C4-C12 gas-phase species typically present in combustion emissions, inclusive of many monocyclic aromatic species including the subset of benzene-toluene-ethyl benzenes-xylenes (BTEX) and 1,3-butadiene. The SAE 930142HP protocol for DNPH-based carbonyl methods was also used to quantify emissions of light carbonyl species, which include harmful species such as formaldehyde, acetaldehyde, and acrolein. Further detailed quantification of PAHs following the modified EPATO13A protocol (Winberry and Jungclaus Citation1999) provided insight into changes in PAH emissions.

The detailed and target chemical analysis yielded no statistically significant changes to PAHs, BTEX, 1,3-butadiene, and carbonyls along with the hundreds of other species analyzed demonstrates that an increase in the toxicity of the plume is not expected due to the use of the pesticide-free agricultural LDPE. Further, no statistical differences were observed for alkanes, alkenes, alkynes, aldehydes, cycloalkanes, cycloalkenes, diolefins, ketones, MAHs, and PAHs. Emission rates of non-hydrocarbon toxic species (PM, CO, NO2) were also not impacted by the LDPE.

It is impractical to measure every potential harmful species in the smoke plume. Given that the combustion of LDPE begins by evaporation of light hydrocarbons, it is unlikely that large molecular weight compounds needed additional study. Analysis was not required for harmful metals (LDPE is hydrocarbon-based) and was not performed. Further analysis for nitro- and oxy-PAHs and dioxins were beyond the scope of this study; however, it is very unlikely that these species are enhanced by combusting LDPE, confirmed in part by the lack of change of PAH emissions. Chlorine, sulfur and nitrogen-containing compounds are not expected in the emissions given that LDPE does not contain these precursors. Similarly, reduced species are not expected during oxidation of the LDPE. Therefore, it can be reasonably concluded that none of the compounds included on the US EPA list of 187 Hazardous Air Pollutants (HAPs) (U.S. Environmental Protection Agency Citation2020) identified by Section 112 of the Clean Air Act are expected to increase.

While the chemical composition of wood and its relative amounts of cellulose, hemicellulose and lignin can vary somewhat (Pettersen Citation1984), manzanita wood was assumed to be representative of the several hundred species of woody plants found in western forests. Several studies have documented the composition of smoke over the past several decades resulting from the burning of forest fuels. These fuels often contain foliage, fruits such as cones, decomposing wood and organic soil. Many of these fuels, while hydrocarbon-based, different from wood in terms of trace elements and the relative amounts of cellulose, hemicellulose, and lignin (e.g., Jolly et al. Citation2012; Li et al. Citation2011; Matt, Dietenberger, and Weise Citation2020). Thus, the composition of smoke from a complex mixture of different fuels as typically reported will likely differ slightly compared to the present study. In the original paper, ranges of previously reported emission factors were provided for comparison. Those data points are still valid; however, comparison of those results with the LDPE results should be done in light of the relative nature of the emissions data. The cited data should first be transformed into log-ratios to honor its compositional nature and then appropriate, commonly used, mainstream statistical methods could be applied to compare the present results with the cited data. Such an analysis is worthy but beyond the scope of the present paper.

A method denoted “positive matrix factorization” (Paatero and Tapper Citation1994) has been applied to emissions data (Sekimoto et al. Citation2018; Ulbrich et al. Citation2009). This method recognizes the nonnegative nature of emissions data which is a desirable characteristic. Like principal component analysis on which it is based, the method identifies linear combinations of the parts of a composition. However, the solution is constrained so that the fitted values are all non-negative. This imposed constraint is not a result of the nature of the data, but a means to provide a solution. In contrast, Aitchison (Citation1983) showed that principal component analysis can be used with compositional data with no such constraints on the solution once the data have been transformed appropriately using log-ratios. With this approach, each principal component becomes a log-linear combination of the parts that acknowledges the non-negative nature of the data. The compositional data approach matches analysis techniques to the data type instead of artificially imposing numerical constraints to enable solution.

Summary

Smoke emissions data describe the composition of the smoke produced by the burning of biomass and wildland fuels. As such, the data are inherently multivariate and relative in nature. This has been recognized for many years; however, the statistical techniques commonly used to analyze this type of data have ignored these characteristics. Reanalysis of our previously published study using compositional data techniques confirmed that the composition or toxicity of smoke produced by the burning of low-density polyethylene plastic mixed with manzanita wood was not affected by the presence of the plastic. Measured composition of the wood smoke with plastic did not differ significantly from the composition of the wood smoke alone. The result of the present analysis agrees with the earlier published result; however, the original means as reported were in error due to wrong class assignment as well as inappropriate use of the arithmetic mean instead of geometric mean for ratios. When the individual gases were considered singly, a different set of affected gases was identified compared to the original paper. The original analysis did not account for the relative nature of the data, so the original single gas results are in error and the present results are derived from statistical methods based on the nature of the data. To paraphrase Lovell et al. (Citation2015), various statistical measures and algorithms will produce numbers. It is incumbent upon the analyst to ensure that the methods and measures are appropriate for the data. The present approach, based on CoDA, has matched the methodology and measures to the relative nature of smoke emissions. This reanalysis further strengthens the conclusion that use of LDPE in silvicultural practices does not introduce pollutants in addition to what is produced by burning wood alone. The simulated debris piles represented near-optimum burning conditions. The manzanita wood was dry and there was no dirt and other material which might reduce the combustion efficiency thus altering the composition of the smoke emissions. While the log-ratios between compounds may change as a result (Weise et al. Citation2020), the LDPE portion of the fuel bed should respond as the wood does. If the woody fuels contain significant moisture, the LDPE component may actually burn better than the wood since it does not easily adsorb water and the emissions produced in that situation should be similar to the results in the present study. Recent measurements of burning debris piles covered by LDPE plastic burned operationally (Aurell et al. Citation2016) reported limited effect of LDPE on measured emissions; however, these data were not treated as compositional data.

The compositional data analysis techniques used in this study should be applied to other smoke emissions data as well as other compositional data related to wildland fire. While many of the compounds measured are gases at flame temperatures, the data set also contained particulate data (see the MAH in ). The techniques presented here can be applied to any emissions data describing composition. Particle size class distributions are an example of discrete compositional data that can be analyzed using techniques based on CoDA (Huston and Schwarz Citation2012). Citations in the Introduction illustrate the wide-ranging nature of data that are collected to describe the composition of “things.” Analyzing smoke emissions data after transformation into ilr coordinates enables correct application of methods such as linear regression, analysis of variance, time series models and generalized linear models to examine and predict the complex nature of smoke emissions. The references provided herein to the literature and to easily acquired software should permit the appropriate statistical analysis of wildland fire data to provide an improved scientific basis for fire and smoke management decisions.

Acknowledgments

The data used in this paper resulted from the DOD/DOE/EPA Strategic Environmental Research and Development Program project RC-1648. Support for this study was also provided by the Angeles, Cleveland, Los Padres, and San Bernardino National Forests of the USDA Forest Service. J. P.-A. was supported by the Spanish Ministry of Science, Innovation and Universities under the project CODAMET (RTI2018-095518-B-C21, 2019-2021).

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This work was supported by the DOD/DOE/EPA Strategic Environmental Research and Development Program [RC-2640]; Spanish Ministry of Science, Innovation, and Universities [CODAMET (RTI2018-095518-B-C21, 2019-2021)].

Notes on contributors

David R. Weise

David R. Weise is a Research Forester at the USDA Forest Service Pacific Southwest Research Station, Riverside, CA, USA.

Heejung Jung

Heejung Jung is a Professor, Department of Mechanical Engineering, University of California, Riverside, CA, USA.

Javier Palarea-Albaladejo

Javier Palarea-Albaladejo is Principal Statistical Scientist at Biomathematics and Statistics Scotland, Edinburgh, Scotland, UK.

David R. Cocker

David R. Cocker is a Professor and Chair, Department of Chemical and Environmental Engineering, University of California, Riverside, CA, USA.

Notes

1 The use of trade or firm names in this publication is for reader information and does not imply endorsement by the U.S. Department of Agriculture of any product or service.

References

  • Aitchison, J. 1983. Principal component analysis of compositional data. Biometrika 70 (1):57–65. doi:10.1093/biomet/70.1.57.
  • Aitchison, J. 1986. The statistical analysis of compositional data. Monographs on Statistics and Applied Probability. London; New York: Chapman and Hall.
  • Anderson, M. J. 2017. Permutational multivariate analysis of variance (PERMANOVA). In Wiley StatsRef: Statistics reference online, ed. N. Balakrishnan, T. Colton, B. Everitt, W. Piegorsch, F. Ruggeri, and J. L. Teugels, 1–15. Chichester, UK: John Wiley & Sons, Ltd. doi: 10.1002/9781118445112.stat07841.
  • Aurell, J., B. K. Gullett, D. Tabor, and N. Yonker. 2016. Emissions from prescribed burning of Timber Slash piles in Oregon. Atmos. Environ. November. doi: 10.1016/j.atmosenv.2016.11.034.
  • Bandeen-Roche, K. 1994. Resolution of additive mixtures into source components and contributions: A compositional approach. J. Am. Stat. Assoc. 89 (428):1450–58. doi:10.1080/01621459.1994.10476883.
  • Barceló-Vidal, C., and J.-A. Martín-Fernández. 2016. The mathematics of compositional analysis. Austrian J. Stat. 45 (4):57. doi:10.17713/ajs.v45i4.142.
  • Benjamini, Y., and Y. Hochberg. 1995. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B Methodol. 57 (1):289–300.
  • Billheimer, D. 2001. Compositional receptor modeling. Environmetrics 12 (5):451–67. doi:10.1002/env.472.
  • Bretz, F., T. Hothorn, and P. H. Westfall. 2011. Multiple comparisons using R. Boca Raton, FL: CRC Press.
  • Buccianti, A., and V. Pawlowsky-Glahn. 2006. Statistical evaluation of compositional changes in volcanic gas chemistry: A case study. Stoch. Environ. Res. Risk Assess. 21 (1):25–33. doi:10.1007/s00477-006-0041-x.
  • Chastin, S. F. M., J. Palarea-Albaladejo, M. L. Dontje, and D. A. Skelton. 2015. Combined effects of time spent in physical activity, sedentary behaviors and sleep on obesity and cardio-metabolic health markers: A novel compositional data analysis approach. Edited by J. Devaney. PLoS One 10 (10):e0139984. doi: 10.1371/journal.pone.0139984.
  • Conover, W. J. 1999. Practical nonparametric statistics. 3rd ed. Wiley Series in Probability and Statistics. Applied Probability and Statistics Section. New York: Wiley.
  • Delmas, R., J. P. Lacaux, and D. Brocard. 1995. Determination of biomass burning emission factors: Methods and results. In African greenhouse gas emission inventories and mitigation options: Forestry, land-use change, and agriculture, ed. J. F. Fitzgerald, B. V. Braatz, S. Brown, A. O. Isichei, E. O. Odada, and R. J. Scholes, 75–98. Dordrecht: Springer Netherlands. http://www.springerlink.com/index/10.1007/978-94-009-1637-1_6.
  • Egozcue, J. J., and V. Pawlowsky-Glahn. 2005. Groups of parts and their balances in compositional data analysis. Math. Geol. 37 (7):795–828. doi:10.1007/s11004-005-7381-9.
  • Egozcue, J. J., V. Pawlowsky-Glahn, G. Mateu-Figueras, and C. Barceló-Vidal. 2003. Isometric logratio transformations for compositional data analysis. Math. Geol. 35 (3):279–300. doi:10.1023/A:1023818214614.
  • Fernandes, A. D., J. N. Reid, J. M. Macklaim, T. A. McMurrough, D. R. Edgell, and G. B. Gloor. 2014. Unifying the analysis of high-throughput sequencing datasets: Characterizing RNA-Seq, 16S RRNA gene sequencing and selective growth experiments by compositional data analysis. Microbiome 2 (1):15. doi:10.1186/2049-2618-2-15.
  • Filzmoser, P., K. Hron, and M. Templ. 2018. Applied compositional data analysis: With worked examples in R. New York, NY: Springer Berlin Heidelberg.
  • Hosseini, S., M. Shrivastava, D. R. Cocker, H. S. Jung, and D. R. Weise. 2018. Low-density polyethylene smoke emissions from simulated debris piles. Research data set RDS-2018-0037. Fort Collins, CO: USDA Forest Service Research Data Archive. doi: 10.2737/RDS-2018-0037.
  • Hosseini, S., M. Shrivastava, L. Qi, D. R. Weise, D. R. Cocker, J. W. Miller, and H. S. Jung. 2014. Effect of low-density polyethylene on smoke emissions from burning of simulated debris piles. J. Air Waste Manag. Assoc. 64 (6):690–703. doi:10.1080/10962247.2014.882282.
  • Huston, C., and C. Schwarz. 2012. Hierarchical Bayesian strategy for modeling correlated compositional data with observed zero counts. Environ. Ecol. Stat. 19 (3):327–44. doi:10.1007/s10651-012-0189-0.
  • Jolly, W. M., R. A. Parsons, A. M. Hadlow, G. M. Cohn, S. S. McAllister, J. B. Popp, R. M. Hubbard, and J. F. Negron. 2012. Relationships between moisture, chemistry, and ignition of Pinus Contorta needles during the early stages of mountain Pine Beetle attack. For. Ecol. Manag. 269 (April):52–59. doi:10.1016/j.foreco.2011.12.022.
  • Li, G.-Y., L.-H. Huang, C.-Y. Hse, and T.-F. Qin. 2011. Chemical compositions, infrared spectroscopy, and X-Ray diffractometry study on brown-rotted woods. Carbohydr. Polym. 85 (3):560–64. doi:10.1016/j.carbpol.2011.03.014.
  • Lovell, D., V. Pawlowsky-Glahn, J. J. Egozcue, S. Marguerat, and J. Bähler. 2015. Proportionality: A valid alternative to correlation for relative data. PLoS Comput. Biol. 11 (3):e1004075. doi:10.1371/journal.pcbi.1004075.
  • Mardia, K. V., J. T. Kent, and J. M. Bibby. 1979. Multivariate analysis. Probability and Mathematical Statistics. London; New York: Academic Press.
  • Mason, R. L., R. F. Gunst, and J. L. Hess. 1989. Statistical design and analysis of experiments: With applications to engineering and science. Wiley Series in Probability and Mathematical Statistics. New York: Wiley. https://www.cs.ubc.ca/~hutter/EARG.shtml/earg/papers05/Stat_Analysis_of_Exp_Gunst_Hess.pdf.
  • Mateu-Figueras, G., V. Pawlowsky-Glahn, and J. J. Egozcue. 2011. The principle of working on coordinates. In Compositional data analysis, ed. V. Pawlowsky-Glahn and A. Buccianti, 31–42. Chichester, UK: John Wiley & Sons, Ltd. doi: 10.1002/9781119976462.ch3.
  • Matt, F. J., M. A. Dietenberger, and D. R. Weise. 2020. Summative and ultimate analysis of live leaves from southern U.S. forest plants for use in fire modeling. Energy Fuels 34 (March):4703–20. doi:10.1021/acs.energyfuels.9b04107.
  • Mendenhall, W., R. L. Schaeaffer, and D. D. Wackerly. 1981. Mathematical statistical with applications. Belmont, CA: Wadsworth.
  • Paatero, P., and U. Tapper. 1994. Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values. Environmetrics 5 (2):111–26. doi:10.1002/env.3170050203.
  • Palarea-Albaladejo, J., and J. A. Martín-Fernández. 2008. A modified EM Alr-algorithm for replacing rounded zeros in compositional data sets. Comput. Geosci. 34 (8):902–17. doi:10.1016/j.cageo.2007.09.015.
  • Palarea-Albaladejo, J., and J. A. Martín-Fernández. 2013. Values below detection limit in compositional chemical data. Anal. Chim. Acta 764 (February):32–43. doi:10.1016/j.aca.2012.12.029.
  • Palarea-Albaladejo, J., and J. A. Martín-Fernández. 2015. zCompositions — R package for multivariate imputation of left-censored data under a compositional approach. Chemom. Intell. Lab. Syst. 143 (April):85–96. doi:10.1016/j.chemolab.2015.02.019.
  • Palarea-Albaladejo, J., J. A. Martín-Fernández, and A. Buccianti. 2014. Compositional methods for estimating elemental concentrations below the limit of detection in practice using R. J. Geochem. Explor. 141 (June):71–77. doi:10.1016/j.gexplo.2013.09.003.
  • Palarea-Albaladejo, J., J. A. Rooke, I. M. Nevison, and R. J. Dewhurst. 2017. Compositional mixed modeling of methane emissions and ruminal volatile fatty acids from individual cattle and multiple experiments1. J. Anim. Sci. 95 (6):2467–80. doi:10.2527/jas.2016.1339.
  • Pawlowsky-Glahn, V., J. J. Egozcue, and R. Tolosana-Delgado. 2015. Modelling and analysis of compositional data. Statistics in practice. Chichester, West Sussex, U.K.: Wiley.
  • Pearson, K. 1896. Mathematical contributions to the theory of evolution.–on a form of spurious correlation which may arise when indices are used in the measurement of organs. Proc. R. Soc. Lond. 1854–1905 60 (1):489–98. doi:10.1098/rspl.1896.0076.
  • Pettersen, R. C. 1984. The chemical composition of wood. In The chemistry of solid wood, ed. R. Rowell, Vol. 207, 57–126. Advances in Chemistry 207. Washington, DC: American Chemical Society. doi: 10.1021/ba-1984-0207.ch002.
  • R Core Team. 2018. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. http://www.R-project.org/(version 3.5.0).
  • Sekimoto, K., A. R. Koss, J. B. Gilman, V. Selimovic, M. M. Coggon, K. J. Zarzana, B. Yuan, B. M. Lerner, S. S. Brown, C. Warneke, et al.. 2018. High- and low-temperature pyrolysis profiles describe volatile organic compound emissions from western US wildfire fuels. Atmospheric Chem. Phys. 18 (13):9263–81. doi:10.5194/acp-18-9263-2018.
  • Siegl, W. O., J. F. O. Richert, T. E. Jensen, D. Schuetzle, S. J. Swarin, J. F. Loo, A. Prostak, D. Nagy, and A. M. Schlenker. 1993. Improved emissions speciation methodology for phase II of the auto/oil air quality improvement research program – hydrocarbons and oxygenates. In, 930142. doi:10.4271/930142.
  • Speranza, A., R. Caggiano, G. Pavese, and V. Summa. 2018. The study of characteristic environmental sites affected by diverse sources of mineral matter using compositional data analysis. Condens. Matter 3 (2):16. doi:10.3390/condmat3020016.
  • Templ, M., K. Hron, and P. Filzmoser. 2011. RobCompositions: An R-package for robust statistical analysis of compositional data. In Compositional data analysis, ed. V. Pawlowsky-Glahn and A. Buccianti, 341–55. Chichester, UK: John Wiley & Sons, Ltd. doi: 10.1002/9781119976462.ch25.
  • Thió-Henestrosa, S., and M. Comas. 2016. CoDaPack v2 User’s Guide. University of Girona, Dept. of Computer Science and Applied Mathematics. http://ima.udg.edu/codapack/assets/codapack-manual.pdf.
  • U.S. Environmental Protection Agency. 2020. Initial list of hazardous air pollutants with modifications. https://www.epa.gov/haps/initial-list-hazardous-air-pollutants-modifications.
  • Ulbrich, I. M., M. R. Canagaratna, Q. Zhang, D. R. Worsnop, and J. L. Jimenez. 2009. Interpretation of organic components from positive matrix factorization of aerosol mass spectrometric data. Atmospheric Chem. Phys. 9 (9):2891–918. doi:10.5194/acp-9-2891-2009.
  • van den Boogaart, K. G., and R. Tolosana-Delgado. 2013. Analyzing compositional data with R. Use R! Heidelberg: Springer.
  • Ward, D. E., and W. M. Hao. 1991. Projections of Emissions from Burning of Biomass for Use in Studies of Global Climate and Atmospheric Chemistry. In, 19. Vancouver, British Columbia, Canada: Air and Waste Management Association. http://www.treesearch.fs.fed.us/pubs/43258.
  • Weise, D. R., J. Cobian-Iñiguez, and M. Princevac. 2018. Surface to crown transition. In Encyclopedia of wildfires and wildland-urban interface (WUI) fires, ed. S. L. Manzello, 1–5. Cham: Springer International Publishing. doi: 10.1007/978-3-319-51727-8_24-1.
  • Weise, D. R., J. Palarea‐Albaladejo, T. J. Johnson, and H. Jung. 2020. Analyzing wildland fire smoke emissions data using compositional data techniques. J. Geophys. Res. Atmospheres 125 (6):e2019JD032128. doi:10.1029/2019JD032128.
  • Winberry, W. T., Jr., and G. Jungclaus. 1999. Compendium of methods for the determination of toxic organic compounds in ambient air, second edition: Compendium method TO-13A determination of polycyclic aromatic hydrocarbons (PAHs) in ambient air using gas chromatography/mass spectrometry (GC/MS). Method EPA/625/R-96/010b. Cincinnati, OH: U.S. Environmental Protection Agency. www.nepis.epa.gov.
  • Wrobel, C., and T. Reinhardt. 2003. Review of potential air emissions from burning polyethylene plastic sheeting with piled forest debris. Final Report Prepared for USDA Forest Service Pacific Northwest Research Station, Seattle, WA. Seattle, WA: URS Corporation. http://www.oregon.gov/odf/fire/docs/smp/finalpereportoct2803.pdf.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.