913
Views
2
CrossRef citations to date
0
Altmetric
Original Articles

Quantifying errors in the aerosol mixing-state index based on limited particle sample size

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon show all
Pages 1527-1541 | Received 17 Apr 2020, Accepted 07 Jul 2020, Published online: 03 Sep 2020

Abstract

This study evaluates the error that is introduced in quantifying observed aerosol mixing states due to a limited particle sample size. We used the particle-resolved model PartMC-MOSAIC to generate a scenario library that encompasses a large number of reference particle populations with a wide range of mixing states quantified by the mixing-state index χ. We stochastically sub-sampled these particle populations using sample sizes of 10 to 10,000 particles and recalculated χ based on the sub-samples. The finite sample size led to a consistent overestimation of χ, with the 95% confidence intervals ranging from −70 to 30 percentage points for sample sizes of 10 particles, and decreasing to ±10 percentage points for sample sizes of 10,000 particles. These findings were experimentally confirmed with single-particle measurements from the Pittsburgh area using a soot-particle aerosol mass spectrometer.

Copyright © 2020 American Association for Aerosol Research

EDITOR:

1. Introduction

Atmospheric aerosols are evolving mixtures of different chemical species (Prather, Hatch, and Grassian Citation2008). The term “aerosol mixing state” is commonly used to describe how different chemical species are distributed throughout a particle population (Winkler Citation1973; Riemer et al. Citation2019). Aerosol mixing state influences the particles’ reactivity (Ryder et al. Citation2014), their optical properties (Moffet and Prather Citation2009; Lesins, Chylek, and Lohmann Citation2002), their hygroscopicity (Sullivan et al. Citation2009; Ching et al. Citation2017), and their propensity to serve as ice nuclei (Beydoun, Polen, and Sullivan Citation2017; Knopf and Alpert Citation2013). Hence, to predict aerosol impacts on atmospheric chemistry and climate, it is important to account for mixing state (Riemer et al. Citation2019); and this motivates efforts to determine mixing state from ambient observations (Healy et al. Citation2014; O’Brien et al., 2015; Ye et al. Citation2018) and via modeling (Riemer et al. Citation2009; Riemer and West Citation2013; Ching, Riemer, and West Citation2016).

The terms “internal” and “external” mixture qualitatively describe mixing state. A population is considered fully internally mixed if each individual particle consists of the same species mixtures, while an external mixture implies that the different aerosol species reside in separate particles. Most ambient aerosol populations do not fit into either category, but resemble both internal and external mixtures to a degree. Often the term “mixing state” is applied to particles in a given size range, for example the accumulation mode or the coarse mode. This choice depends on the aerosol sampling instrumentation specifications or the science question being addressed. It is also important to be aware that mixing state as defined here does not capture the full potential diversity of particle populations, as the particle morphology (e.g., core-shell, well-mixed) can add additional diversity. In this article, we will only consider mixing state as defined above, which is also termed the “chemical mixing state” in Riemer et al. (Citation2019).

For a quantitative description of aerosol mixing state, the mixing-state index χ has been introduced (Riemer and West Citation2013), which can be calculated based on the particles’ species mass fractions. This scalar quantity ranges from 0 to 100% for fully external to internal mixtures, respectively. Several field studies have used this index to quantify mixing states for different ambient environments using sophisticated single-particle measurement techniques, including electron microscopy and X-ray spectroscopy (O’Brien et al., 2015), and mass spectrometry (Healy et al. Citation2013). These observations confirm that mixing states in the ambient atmosphere are neither completely internally nor externally mixed but rather exist on a spectrum in between those limiting cases with characteristic temporal and spatial variability. For example, a study by Healy et al. (Citation2014) for the MEGAPOLI campaign in Paris, France, revealed that the aerosol in Paris was estimated to be 59% internally mixed on average, with more external mixtures during the daytime when primary emissions from traffic and woodburning were present, and more internal mixtures during the night when ammonium nitrate formation prevailed. Healy et al. (Citation2014) used single-particle aerosol time-of-flight mass spectrometry (ATOFMS) data for their study. Ye et al. (Citation2018) quantified the spatial variation of χ for the city of Pittsburgh on a neighborhood scale with a mobile measurement platform using single-particle measurements from a soot-particle aerosol mass spectrometer (SP-AMS) and found the lowest values (36%) close to an interstate highway and the highest values (76%) in rural or suburban regions.

The mixing-state index χ is based on species diversity measures (Riemer and West Citation2013), which have been extensively used and developed in ecology and related fields. See Daly, Baetens, and Baets (Citation2018) for an excellent overview and Sherwin and Prat I Fornells (Citation2019) for a discussion of the history. Within ecology it is well-known that undersampling can result in inaccurate and biased estimates of diversity measures (Beck and Schwanghart Citation2010; Beck, Holloway, and Schwanghart Citation2013) and the performance of different statistical methods has been studied for realistic scenarios (Butturi-Gomes et al. Citation2017; Brocklehurst, Day, and Fröbisch Citation2018). In response, better estimators have been proposed that are based on the discovery rate of new species (Chao, Wang, and Jost Citation2013; Chao and Jost Citation2015; Haegeman et al. Citation2013), measures such as pairwise dissimilarities (Marion Citation2016) have been used as alternatives, and Bayesian estimators have been developed (Marion, Fordyce, and Fitzpatrick Citation2018).

Just as undersampling is a problem in ecology, measurements of atmospheric aerosols inherently sample a finite number of particles to estimate the mixing state and associated metrics. These finite samples range from a few hundred to many thousands of particles, depending on the instrumentation. For example, O’Brien et al. (2015) utilized electron microscopy and X-ray spectroscopy methods to analyze particle samples from the 2010 Carbonaceous Aerosols and Radiative Effects study with sample sizes ranging from several hundred to several thousand particles. Ye et al. (Citation2018) used single-particle mass spectrometry with sample sizes on the order of tens of thousand of particles. Unfortunately, we cannot directly apply the improved diversity estimators developed in ecology (Chao and Jost Citation2015) because they use the fact that species abundance is measured by sampling individuals in the species, a concept that does not readily transfer to chemical measurements.

The question arises of how large a particle sample size must be to adequately represent the mixing state of an atmospheric aerosol. Using a large ensemble of simulated aerosol populations generated with a particle-resolved model, the goal of this article is to quantify errors in determining χ introduced by limited-size particle samples. Since the “true” value of χ is not known when making observations in practice, we also provide confidence intervals around the measured χ values for different sample sizes.

2. Methods

2.1. Calculating mixing-state index χ

The mixing-state index χ by Riemer and West (Citation2013) provides a rigorous definition of aerosol mixing state. It is given by the affine ratio of the diversity metrics Dα and Dγ as (1) χ=Dα1Dγ1.(1)

The diversity metrics, in turn, are defined as follows. First, the particle mixing entropies Hi need to be calculated for each particle based on the particle species mass fractions (2) Hi=a=1Apialnpia,(2) where A is the number of distinct aerosol species, and pia is the mass fraction of species a in particle i. The particle diversities Di=exp(Hi) give the effective number of species in each particle, which is equal to the number of physical species if they are all present in equal proportions, and less otherwise.

The particle Hi values are then averaged (mass-weighted) over the entire population to give Hα, and finally the average particle species diversity Dα, by (3) Hα=i=1NppiHi,(3) (4) Dα=eHα,(4) where Np is the total number of particles in the population, and pi is the mass fraction of particle i in the population. Dα is the mean particle diversity, which gives the mean effective number of species over all particles in the population.

Lastly, the bulk diversity Dγ is defined by (5) Hγ=a=1Apalnpa,(5) (6) Dγ=eHγ.(6)

This is the total diversity of the population, which is the effective number of species in the aerosol bulk.

Importantly, the definition of “species” depends on the application or the instrumentation used to determine mass fractions. In some previous studies, elemental species have been used (O’Brien et al., 2015; Fraund et al. Citation2017; Bondy et al. Citation2018), while others used molecular species (Healy et al. Citation2014; Ye et al. Citation2018) or species groups (Dickau et al. Citation2016; Ching et al. Citation2017; Hughes et al. Citation2018). To make our results comparable to Ye et al. (Citation2018) (who observed organics, nitrate, sulfate, chloride and black carbon, as detected by the soot-particle aerosol mass spectrometer), we chose the aerosol model species that constitute the dry aerosol mass, such as ammonium, sulfate, nitrate, black carbon, as well as several organic species, with the addition of dust and sodium chloride. Note that the soot-particle aerosol mass spectrometer cannot measure dust and sodium chloride. Aerosol water is excluded from our calculations. While our scenario library includes coarse-mode particles, we only included sub-micron particles in our calculations for χ, since this is the relevant size range for ambient measurements using AMS instruments.

2.2. PartMC-MOSAIC model description

PartMC-MOSAIC is a unique modeling tool to simulate aerosol mixing state and its impacts under a wide variety of conditions. With this tool, each particle can be represented explicitly, allowing for accurate calculations of single-particle species mass fraction This is in contrast to other common aerosol modeling techniques which represent averages of particle composition over certain size ranges rather than per-particle composition (Riemer et al. Citation2019). A detailed description of PartMC-MOSAIC is given in Riemer et al. (Citation2009). In brief, PartMC (Particle-resolved Monte Carlo) is a box model that explicitly resolves composition on a per-particle level within a well-mixed computational volume representative of a much larger air parcel. The evolution of the particle population—due to Brownian coagulation, emission and dilution—is tracked throughout the simulation using the Monte Carlo approach. PartMC is coupled to MOSAIC (Model for Simulating Aerosol Interactions and Chemistry) (Zaveri et al. Citation2008) which models gas-phase chemistry and gas-particle partitioning (condensation processes). MOSAIC consists of four modules: (1) the gas-phase photochemistry mechanism CBM-Z (Zaveri and Peters Citation1999), (2) the Multicomponent Taylor Expansion Method (MTEM) (Zaveri, Easter, and Wexler Citation2005b), (3) the Multicomponent Equilibrium Solver for Aerosols (MESA) for intraparticle solid-liquid partitioning (Zaveri, Easter, and Peters Citation2005a), and (4) the Adaptive Step Time-split Euler Method (ASTEM) for dynamic gas-particle partitioning (Zaveri et al. Citation2008). To simulate secondary organic aerosol (SOA) the SORGAM scheme is used (Schell et al. Citation2001). MOSAIC treats all locally and globally important gas and aerosol species including a total of 77 gaseous species and 19 aerosol species including SO42,HSO4,NO3,Cl,CO3,NH4+,Na+,Ca2+, other inorganic mass (representing crustal material), black carbon (BC), primary organic aerosol (POA) and SOA.

2.3. Ensemble of scenarios and sampling technique

To generate confidence intervals for χ for a wide range of conditions, we made use of a scenario library comprising 1000 different PartMC-MOSAIC scenarios (Hughes et al. Citation2018). All scenarios used a simulation time of 24 h, starting at 6:00 AM local time, with output saved every hour. Each simulation was run using 100,000 computational particles, producing a high-resolution representation of aerosol mixing state. Twenty-four input parameters (temperature, relative humidity, latitude, gas phase emission rates, emission rates, size parameters and composition of primary aerosol particles, including carbonaceous particles, sea salt, and mineral dust) were varied between scenarios to allow for large variations in mixing-state evolution. The scenario inputs were generated using Latin hypercube sampling to provide efficient sampling across the high-dimensional input parameter space. The entire scenario library generated 24,000 distinct reference particle populations (24 h × 1000 scenarios) (Gasparik et al. Citation2020). For each reference population we calculated the reference value χref of the mixing-state index.

To mimic the sampling process used in single-particle field measurements, we subsampled each reference population without replacement using different sample sizes (N=10,100,1000,10,000). To determine confidence intervals we repeated each subsampling 1000 times, which resulted in 24,000,000 sampled particle populations (24 h × 1000 scenarios × 1000 samples) for each sample size N. For each sampled population we recalculated χ using only the particles in the sample and we denoted these χ values as χsampN. Similarly, Dα,sampN and Dγ,sampN are the α- and γ-diversities computed used only the sampled particles.

It is important to note that even the large number of 100,000 computational particles still represents an—albeit large—subsample of the “true” limiting population with a (near-)infinite number of particles. The key point is that the error between the 100,000 particle sample and the true population is expected to be much smaller than the error between our largest subsample (10,000 particles) and the true population. This is a reasonable assumption as the error scales with 1/N, meaning that the error for the 100,000-particle sample is a factor of 10 smaller than that of the 10,000-sample. To maintain this 10 factor, we limit the largest sample to 10% of the size of our reference population.

illustrates this process for one single reference population. In this case, χref was 46%. Sampling this population with only 10 particles produced a broad range of χsamp10 values from 20 to 100%. This range narrowed progressively as N increased, resulting in a range of 41 to 50% for a sample size of 10,000 particles. Another important result is that for small sample sizes, χsampN overestimates χref. In Section 3, we will see that this positive bias is a consistent result of the sampling process that can be explained with the fact that on average a sub-population overestimates Dα and underestimates Dγ. A rigorous proof is provided in Section 4.

Figure 1. Frequency distribution of χsampN for one example population (χref=46%) and different sample sizes. For each sample size, the sampling process was repeated 1000 times. The dashed colored lines correspond to the average χ¯sampN for the specified sample size.

Figure 1. Frequency distribution of χsampN for one example population (χref=46%) and different sample sizes. For each sample size, the sampling process was repeated 1000 times. The dashed colored lines correspond to the average χ¯sampN for the specified sample size.

2.4. Experimental determination of χ using observations in Pittsburgh

To provide experimental confirmation for the particle-resolved modeling results, a similar analysis was conducted using field data from Ye et al. (Citation2018). In this study, aerosol samples were collected using the single-particle mode of a soot-particle aerosol mass spectrometer on a mobile platform throughout the city of Pittsburgh, PA.

The seven major particle clusters or types identified in the Pittsburgh mobile sampling data set were classified as: a sulfate-rich inorganic class that also contained OA and nitrate measured in the summer, a nitrate-domoinated class with small amounts of sulfate and little OA measured in the winter, less-oxidized hydrocarbon-like organics (HOA) associated with vehicle emissions, less-oxidized cooking-like organic aerosol (COA) associated with restaurant emissions, black carbon-dominated with small amounts of OA or inorganics, more-oxygeneated OA rich, and less-oxygeneated OA rich (Ye et al. Citation2018). The later two OA-rich classes contained small amounts of inorganics. Differences in particle composition and mixing state and of these properties as a function of particle size were observed in different sampled environments that included: in highly trafficked tunnels, on highways, an urban area with high traffic density, and on a road through a large park. Other specific environments that produced unique particle mixing states included: inside a restaurant plume (COA dominated), downtown with high restaurant density (mixture of COA, inorganics, and HOA), and a suburban residential area with low restaurant density (diverse mixture of inorganics, OA, COA, and HOA) (Ye et al. Citation2018).

Three aerosol samples were used for the analysis here including the sample collected in Pittsburgh downtown (∼11,000 particles), at the Carnegie Mellon University (CMU) urban campus in the summer (∼15,000 particles) and at the CMU campus in the winter (∼47,000 particles). Aerosols collected in Pittsburgh downtown were chosen to represent samples from regions that are in close proximity to sources of primary particles, while the CMU campus represents an urban background. Aerosol populations collected in Pittsburgh downtown were a combination of six visits to downtown, and the sampling time for each visit ranged between 30 min and 1 h. Aerosol populations collected on CMU campus were aggregated over several hours to a day. Based on the single-particle spectra of the three populations, per-particle mass fractions were determined and mixing state indices χref were calculated for each population. Five species are considered for calculations of the mixing state index: organics, nitrate, sulfate, chloride and black carbon. Ammonium is not considered due to the large interference from the water signal in the mass spectra in the single-particle aerosol mass spectrometer. For more details about the method of using a soot-particle aerosol mass spectrometer to determine per-particle mass fractions and the mixing state index, please see Ye et al. (Citation2018). Populations of 10, 100, and 1000 particles were stochastically sampled from the full particle samples from each location. Since the full particle samples are on the order of tens of thousand particles, our largest subsample is 1000, with the rationale explained in Section 2.3. The mixing state indices χsamp were calculated for each of the limited sample sizes and used for comparative analysis in this article.

3. Numerical and observational results

The sampling results are presented as two-dimensional histograms that, for each sample size N, include all 1000 samples of the 24,000 simulated reference populations (resulting in 24,000,000 data points for each sample size). shows χsampN versus χref for the four sample sizes. Estimating χref based on a particle sample of only 10 particles led to results that can greatly overestimate or underestimate χref. As the sample size increased from 10 to 10,000 particles the points converged on the one-to-one line, meaning that sampled mixing-state values more accurately approximate the associated reference values.

Figure 2. Distribution of sampled population mixing-state index χsampN and reference mixing-state index χref for increasing sample sizes based on the simulated scenario library described in Section 2.3. The solid black line is the 1:1 line.

Figure 2. Distribution of sampled population mixing-state index χsampN and reference mixing-state index χref for increasing sample sizes based on the simulated scenario library described in Section 2.3. The solid black line is the 1:1 line.

shows the corresponding plot using the field data from Pittsburgh. Each vertical cluster of points corresponds to data from a location or time period where the sampling occurred. Each of these clusters has a different χref value, calculated from the full samples discussed in Section 2.4. For the downtown location, the χref value was lowest (40%), consistent with the expectation for an area where fresh emissions mix with more aged aerosol. The aerosol at the CMU campus during winter (χref=51%) was more externally mixed compared to the summer period (χref=72%). This can be explained by the more extensive photochemical oxidation that drives the production of condensible secondary components, which condense onto preexisting particles and thus homogenize aerosol composition. A more detailed discussion on the spatial and temporal variability of aerosol mixing state in the Pittsburgh area can be found in Ye et al. (Citation2018). Similar to the procedure used to analyze the modeled results, each χref population was stochastically sub-sampled, which resulted in a range of χsampN values that converged to the reference value as N increased.

Figure 3. Sampled χsampN as a function of χref from data obtained in Pittsburgh, PA. The one-to-one line is drawn for reference.

Figure 3. Sampled χsampN as a function of χref from data obtained in Pittsburgh, PA. The one-to-one line is drawn for reference.

Both the simulation () and field data () results show that the sampled χsampN has a positive bias. That is, it tends to be larger than the reference χref. To investigate this more precisely, we computed the average χ¯sampN for the simulation data, where the average is taken over all 1000 repeats and is mass-weighted (see Section 4.2 for details). This quantity is plotted versus χref in . In contrast to which displayed all individual 1000 × 24,000 data points, shows the averages over the 1000 repeats. This lets us see the patterns more clearly and confirms that χsampN is positively biased (above the one-to-one line). As expected, the bias vanished as the sample size increased from 10 to 10,000 particles. The question arises of how this bias can be explained. In particular, since χ is the affine ratio of the average particle species diversity Dα and the bulk diversity Dγ (EquationEquation (1)), we need to determine the effect of sub-sampling on estimating these quantities individually before calculating the ratio.

Figure 4. Distribution of average sampled χ¯sampN (averaged over the 1000 repeats) and χref for increasing sample sizes based on the simulated scenario library described in Section 2.3. The one-to-one line is drawn for reference.

Figure 4. Distribution of average sampled χ¯sampN (averaged over the 1000 repeats) and χref for increasing sample sizes based on the simulated scenario library described in Section 2.3. The one-to-one line is drawn for reference.

The positive bias is a result of biases in the average sampled diversity metrics, D¯α,sampN and D¯γ,sampN. and show the averaged sampled diversities versus reference diversities for the simulated populations. That is, shows the quantity needed for the numerator in EquationEquation (1), and shows the quantity needed for the denominator in EquationEquation (1). These show that, on average, sampling overestimates the mean particle diversity (Dα) and underestimates the total diversity (Dγ). Said another way, using a sample of particles will lead us to think that the average particle has more effective species than are really present, but that the bulk has less effective species than it actually does. Because χ is the affine ratio of mean particle to total diversity (see EquationEquation (1)), χ is thus overestimated. Writing these numerical observations in equations, we have (7) D¯α,sampNDα,ref,[sampled populationsoverestimatemean particle diversity](7) (8) D¯γ,sampNDγ,ref,[sampled populationsunderestimatetotal diversity](8) (9) χ¯sampNχref.[sampled populationsoverestimatemixingstate index](9)

Figure 5. Distribution of average sampled D¯α,sampN (mean particle diversity, averaged over the 1000 repeats) and Dα,ref (reference mean particle diversity) for increasing sample sizes based on the simulated scenario library described in Section 2.3. The one-to-one line is drawn for reference.

Figure 5. Distribution of average sampled D¯α,sampN (mean particle diversity, averaged over the 1000 repeats) and Dα,ref (reference mean particle diversity) for increasing sample sizes based on the simulated scenario library described in Section 2.3. The one-to-one line is drawn for reference.

Figure 6. Distribution of average sampled D¯γ,sampN (total diversity, averaged over the 1000 repeats) and Dγ,ref (reference total diversity) for increasing sample sizes based on the simulated scenario library described in Section 2.3. The one-to-one line is drawn for reference.

Figure 6. Distribution of average sampled D¯γ,sampN (total diversity, averaged over the 1000 repeats) and Dγ,ref (reference total diversity) for increasing sample sizes based on the simulated scenario library described in Section 2.3. The one-to-one line is drawn for reference.

Section 4 provides rigorous definitions of the sampling and averaging procedures and proves certain aspects of the above results.

To understand how many particles we should sample to be confident that the error is likely below some threshold, it is helpful to think about confidence intervals for the reference χ. The 95% confidence intervals for reference values, χref, are shown in as a range Δχ about the sampled values χsampN. This means that, if we measure a χsampN value from a sampled particle population, 95% of the time the true χref value will fall within the range χsampN+Δχ.

Figure 7. 95%-Confidence intervals for sampled χsampN values for sample sizes of N = 10, 100, 1000, and 10,000 particles based on the simulated scenario library described in Section 2.3.

Figure 7. 95%-Confidence intervals for sampled χsampN values for sample sizes of N = 10, 100, 1000, and 10,000 particles based on the simulated scenario library described in Section 2.3.

For small sample sizes N, the confidence interval is highly asymmetric and larger for populations that appear more internally mixed. For example, assuming that a sample of 10 particles is used to compute a sampled χsamp10 value of 20%, the 95% confidence interval for the reference χref for the whole population extends from 15% to 45% (i.e., Δχ ranges from −5 to + 25 percentage points). In contrast, for a χsamp10 value of 90%, the confidence interval extends from 20% to 95% (Δχ from −70 to + 5 points).

For large sample sizes (e.g., N = 10,000), the confidence interval is narrow for large and small sampled χ values. It broadens for intermediate χ values, but remains within ±10 percentage points. It is reasonable that both highly diverse (low χ) and highly homogeneous (high χ) populations can have their mixing state measured well from a reasonably small sample. More complex distributions with intermediate χ require the sampling of more particles to obtain accurate estimates of the mixing state index.

By the central limit theorem, we expect that the sampling error should decrease proportionally to the square root of the number of particles. We can observe this in , where going from N = 1000 to N = 10,000 particles reduced the 95% confidence interval bound by a factor of 3 (from 30 percentage points to 10 points), which is approximately 10, the square root of the increase in the number of samples.

Considering that the true χ value of a population is not known a priori, our results suggest that a sample size of 1000 particles is needed to obtain an estimate of χ within 30 percentage points and 10,000 particles are needed to determine χ within 10 percentage points for any mixing state.

4. Mathematical proofs

In Section 3 we have seen that χ¯sampN is positively biased, which was caused by a positive bias in D¯α,sampN, a negative bias in D¯γ,sampN, and the fact that χ=(Dα1)/(Dγ1). In this section, we will show that the overestimation of Dα and underestimation of Dγ are both consequences of the entropy averaging procedures combined with convexity of the exponential function (for Dα) and concavity of the entropy function (for Dγ). To do this, we will start by precisely defining what we mean by sampling and averaging, and then we will prove the results themselves.

4.1. Notation for sampled particle populations

We consider a reference population of particles to be a set π={μ1,,μN}. We use uppercase letters I, J to denote reference particle indices in π. Each particle is a vector μIRA with coordinates μI=(μI1,,μIA), where each coordinate μIa is the mass of species a in particle I. We use superscripts for species indexes and subscripts for particle indexes.

Consider sampled populations {π1,,πS} from π. Each sampled population πs={μs,1,,μs,Ns} has Ns particles corresponding to reference particle indices Is,1,,Is,Ns. We use lowercase letters i, j for sampled particle indices and we write μs,i for the i-th particle in sample s. The mass of species a in particle i of sample s is thus μs,ia. The sampled particle μs,i is equal to reference particle μI with I=Is,i, so the set of all reference particle indexes in a given sampled population is Is={Is,i|i=1,,Ns}.

Given a per-particle quantity XI, we write Xs,i=XI when the reference index matches the sampled particle index: I=Is,i. To understand sampled diversities versus reference-population diversities, we want to compare mass-weighted averages computed over the reference population, and over sampled populations. To do this, we will now introduce the Iverson bracket and the key averaging lemma (Lemma 1).

The Iverson bracket gives a binary indicator of set membership: (10) [IIs]={1if IIs,0otherwise.(10)

Using the Iverson bracket we can translate between local and global indexes: (11) i=1NsXs,i=I=1N[IIs]XI.(11)

In this article we consider particle populations to be sets, which means that all particles in the population must have at least one species with a different mass. In particular, this excludes monodisperse populations. To overcome this limitation we could consider populations to be multisets in the sense of (Knuth Citation1998, p. 473). Roughly speaking, a multiset is an extension of a set to allow elements to appear multiple times and for which the set union and set union operators have been appropriately extended. For multisets, the equivalent to the Iverson bracket is the multiplicity operator which gives the integer number of occurrences of any particle in the population. All the theoretical results in this article carry through for multisets, but we restrict ourselves to regular sets for convenience.

4.2. Mass fractions and mass-weighted probability distributions

Given a reference population π={μ1,,μN} and sampled populations πs={μs,1,,μs,Ns} for s=1,,S, as described in Section 4.1, we define the total masses: (12) μI=a=1AμIa,  [total mass of reference particleI](12) (13) μtot=I=1NμI,  [total mass of reference population](13) (14) μs,i=a=1Aμs,ia,  [total mass of particleiin sampled populations](14) (15) μs,tot=i=1Nsμs,i,  [total mass of sampled populations](15) (16) μsamp,tot=s=1Sμs,tot.  [total mass of all sampled populations](16)

From this we define the mass fractions (or probabilities): (17) pI=μIμtot,  [mass fraction of reference particleI](17) (18) ps,i=μs,iμs,tot,  [mass fraction of particleiin sampled populations](18) (19) ps,tot=μs,totμsamp,tot.  [mass fraction of sampled populations](19)

Interpreting these mass fractions as probabilities gives mass-weighted probability distributions over particle populations. For example, we can define the distribution pI to be the distribution over reference particles so that the probability IpI of reference particle I is pI. Doing this similarly for ps,i and ps,tot gives us probability distributions over sampled particles and the set of sampled populations. Note that we are using roman-letter subscripts to denote probability distributions.

Consider a quantity X that can be indexed by either the reference particle index, XI, the sampled particle indexes, Xs,i, or the sampled population index, Xs,tot. Then we can compute expected values with respect to the mass-weighted probability distributions by averaging over the corresponding sets: (20) EIpI[XI]=I=1NpIXI=I=1NμIμtotXI,  [reference average](20) (21) Eips,i[Xs,i]=i=1Nsps,iXs,i=i=1Nsμs,iμs,totXs,i,  [sample average](21) (22) Esps,tot[Xs,tot]=s=1Sps,totXs,tot=s=1Sμs,totμsamp,totXs,tot.  [average over samples](22)

4.3. Entropy and diversity measures of mixing state

The entropy H associated with a vector p of mass fractions (equivalently, probabilities) is (23) H(p)=a=1Apalog(pa).(23)

The diversity D is the exponential of the entropy: (24) D(p)=exp(H(p)).(24)

Importantly, entropy is a concave function, which is fundamental to our results on over- and under-estimation of mixing-state measures. In fact, entropy is also log-concave in low dimensions (Alirezaei and Mathar Citation2018), which will be used in the proof of Theorem 4.

The mass fractions in a particle (equivalently, species probability vector) can be indexed by either reference or sampled particle indexes. These are thus: (25) pI=μIμI,  [species mass fractions in reference particleI](25) (26) ps,i=μs,iμs,i.  [species mass fractions in particleiin sampled populations](26) Using the particle species mass fraction vectors, we denote the particle entropy by HI=H(pI) with reference particle indexes and Hs,i=H(ps,i) with sampled particle indexes. Similarly, we write DI and Ds,i for the corresponding diversities.

The α-entropy of a particle population is the mass-weighted average of the particle entropies, with the corresponding α-diversity. That is, α-entropies and α-diversities for our populations are (27) Hα,ref=I=1NpIHI=EIpI[HI],  [α-entropy of reference population](27) (28) Hα,s=i=1Nsps,iHs,i=Eips,i[Hs,i],    [α-entropy of sampled populations](28) (29) Dα,ref=exp(Hα,ref),      [α-diversity of reference population](29) (30) Dα,s=exp(Hα,s).        [α-diversity of sampled populations](30)

The γ-entropy of a population is the entropy of the mass-weighted average composition vector, with the corresponding γ-diversity. That is, while α-entropy is the average of entropies, γ-entropy is the entropy of the average. This gives: (31) Hγ,ref=H(I=1NpIpI)=H(EIpI[pI]),    [γ-entropy of reference population](31) (32) Hγ,s=H(i=1Nsps,ips,i)=H(Eips,i[ps,i]),    [γ-entropy of sampled populations](32) (33) Dγ,ref=exp(Hγ,ref),  [γ-diversity of reference population](33) (34) Dγ,s=exp(Hγ,s).    [γ-diversity of sampled populations](34)

From the population diversities we define the overall mixing-state index of a population to be the affine ratio: (35) χref=Dα,ref1Dγ,ref1,      [mixing-state index of reference population](35) (36) χs=Dα,s1Dγ,s1.      [mixing-state index of sampledpopulations](36)

We are particularly interested in the average sampled entropies and diversities, where the mass-weighted average is taken over all sampled populations. This gives (37) H¯α,samp=Esps[Hα,s],    [average sampledα-entropy](37) (38) H¯γ,samp=Esps[Hγ,s],    [average sampledγ-entropy](38) (39) D¯α,samp=Esps[Dα,s],    [average sampledα-diversity](39) (40) D¯γ,samp=Esps[Dγ,s],    [average sampledγ-diversity](40) (41) χ¯samp=Esps[χs].    [average sampled mixing-state index](41)

4.4. Fair sampling and a fundamental lemma

The sampled populations may all contain the same number of particles, in which case Ns is independent of s, or they may be of different sizes. Our theoretical results apply in either case, so long as we assume that the sampled populations sample all particles fairly. To make this precise, we define NI to be the number of sampled populations containing particle I: (42) NI=s=1S[IIs].(42)

Using this, we make our assumption precise as follows.

Assumption 1 (Uniform sampling). Each particle in the population appears in the same number of sampled populations, so NI = NJ for any I,J=1,,N.

Lemma 1.

Given a per-particle quantity X indexed both by the reference index, XI, and sampled population indexes, Xs,i, the mass-weighted average of X over all particles in all sampled populations is the same as the mass-weighted average over all reference particles: (43) Esps,tot[Eips,i[Xs,i]Xs,tot]=EIpI[XI].(43)

Proof.

Starting from the left hand side (LHS) of (43), we compute: (44) LHS=s=1Sμs,totμsamp,toti=1Nsμs,iμs,totXs,i  [Equations (21)and (22)](44) (45) =1μsamp,tots=1Si=1Nsμs,iXs,i(45) (46) =1s=1Si=1Nsμs,is=1Si=1Nsμs,iXs,i[Equations (15)and (16)](46) (47) =1s=1SJ=1N[JIs]μJs=1SI=1N[IIs]μIXI [Equation (11)](47) (48) =1J=1NμJs=1S[JIs]I=1NμIXIs=1S[IIs](48) (49) =1J=1NμJNJI=1NμIXINI[Equation (42)](49) (50) =1J=1NμJI=1NμIXI[Assumption 1](50) (51) =EIpI[XI].[Equations (13)and (20)](51)

4.5. Theoretical sampling results for mixing-state measures

We are now ready to prove our main results, which describe how the average sampled entropies and diversities relate to the reference entropies and diversities. To summarize, we will show that (52) H¯α,samp=Hα,ref,  [exactly estimateα-entropy from samples](52) (53) H¯γ,sampHγ,ref,  [underestimateγ-entropy from samples](53) (54) D¯α,sampDα,ref,  [overestimateα-diversity from samples](54) (55) D¯γ,sampDγ,ref.  [underestimateγ-diversity fromsamples(only with2or3species)](55)

See and for numerical simulations that confirm (54) and (55), respectively.

The mixing-state index χ is the affine ratio of Dα to Dγ, so overestimating Dα and underestimating Dγ makes it plausible that the average sampled χ¯samp values will overestimate the reference χref. As shown in , numerical simulations on atmospherically relevant particle populations show that χ¯samp values do indeed overestimate χref. However, because Dα,s is correlated with Dγ,s, it is not straightforward to prove a precise relationship between χ¯samp and χref.

In all of the following results we are using mass-weighted averages, as defined in Section 4.2, which is natural because α- and γ-entropy are mass-weighted quantities. We begin by showing that mass-weighted averaging results in reference and sampled α-diversities being equal on average.

Theorem 1

(Sampled α-entropy). If the sampled populations are drawn uniformly from the reference population (Assumption 1) then the average sampled α-entropy is equal to the reference α-entropy: (56) H¯α,samp=Hα,ref.(56)

Proof.

We compute (57) H¯α,samp=Esps[Hα,s]  [Definition of H¯α,samp](57) (58) =Esps[Eips,i[H(ps,i)]]  [Definition of Hα,s](58) (59) =EIpI[H(pI)]  [Lemma 1](59) (60) =Hα,ref.    [Definition of Hα,ref](60)

Next, we show that concavity of entropy means that γ-entropy is consistently underestimated from sampled populations.

Theorem 2

(Sampled γ-entropy). If the sampled populations are drawn uniformly from the reference population (Assumption 1) then the average sampled γ-entropy is less than or equal to the reference γ-entropy: (61) H¯γ,sampHγ,ref.(61)

Proof.

Similarly to Theorem 1, but with expectations and entropy reversed, we compute (62) H¯γ,samp=Esps[Hγ,s]  [Definition of H¯γ,samp](62) (63) =Esps[H(Eips,i[ps,i])]  [Definition of Hγ,s](63) (64) H(Esps[Eips,i[ps,i]])[Jensen's inequality and concavity ofH](64) (65) =H(EIpI[pI])    [Lemma 1](65) (66) =Hγ,ref.    [Definition of Hγ,ref](66)

Having established the average behavior of sampled entropies, we now turn our attention to sampled diversities. We begin by showing that the α-diversity is consistently overestimated from sampled populations, as we saw in . The reason for this overestimation is that the exponential function is convex, as illustrated in .

Figure 8. Schematic to illustrate Theorem 3. Here we consider two sampled populations with α-diversities of Hα,1 and Hα,2, which we assume have average exactly equal to the reference value Hα,ref (this is true on average, as we saw from Theorem 1). The exponential function maps entropies H to diversities D, and because it is convex the average sampled value, D¯α,samp, will be greater than the reference value, Dα,ref.

Figure 8. Schematic to illustrate Theorem 3. Here we consider two sampled populations with α-diversities of Hα,1 and Hα,2, which we assume have average exactly equal to the reference value Hα,ref (this is true on average, as we saw from Theorem 1). The exponential function maps entropies H to diversities D, and because it is convex the average sampled value, D¯α,samp, will be greater than the reference value, Dα,ref.

Theorem 3

(Sampled α-diversity). If the sampled populations are drawn uniformly from the reference population (Assumption 1) then the average sampled α-diversity is greater than or equal to the reference α-diversity: (67) D¯α,sampDα,ref.(67)

Proof.

Using Theorem 1 gives (68) D¯α,samp=Esps[Dα,s]    [Definition of D¯α,samp](68) (69) =Esps[exp(Hα,s)]    [Diversity is the exponential of entropy](69) (70) exp(Esps[Hα,s])  [Jensen's inequality and convexity of exp](70) (71) =exp(Hα,ref)    [Theorem 1](71) (72) =Dα,ref.  [Diversity is the exponential of entropy](72)

Finally, we consider the sampled γ-diversity. Because entropy is only log-concave in dimensions 2 and 3, we are only able to prove a consistent relationship when we have these number of species. As shown in , however, even in higher dimensions we see from numerical simulations that sampled populations tend to underestimate Dγ. illustrates how concavity of the diversity function leads to underestimation.

Figure 9. Schematic to illustrate Theorem 4 in the case of a two-species aerosol population, where p is the mass fraction of the first species. We consider two sampled populations with first-species mass-fractions of p1 and p2, which we assume have average exactly equal to the first-species reference mass fraction of pref (this is exactly true on average by (Equation43)). The diversity function is concave (for 2 or 3 species) so the average sampled value, D¯γ,samp, will be less than the reference value, Dγ,ref.

Figure 9. Schematic to illustrate Theorem 4 in the case of a two-species aerosol population, where p is the mass fraction of the first species. We consider two sampled populations with first-species mass-fractions of p1 and p2, which we assume have average exactly equal to the first-species reference mass fraction of pref (this is exactly true on average by (Equation43(43) Es∼ps,tot[Ei∼ps,i[Xs,i]︸Xs,tot]=EI∼pI[XI].(43) )). The diversity function is concave (for 2 or 3 species) so the average sampled value, D¯γ,samp, will be less than the reference value, Dγ,ref.

Theorem 4

(Sampled γ-diversity). If the sampled populations are drawn uniformly from the reference population (Assumption 1) and the number of species is 2 or 3, then the average sampled γ-diversity is less than or equal to the reference γ-diversity: (73) D¯γ,sampDγ,ref.(73)

Proof.

This proof is almost identical to that of Theorem 2, except we use the fact that D is concave in dimension 2 or 3. Because D(·)=exp(H(·)), or equivalently H(·)=log(D(·)), concavity of D is equivalent to log-concavity of H. As shown in Alirezaei and Mathar (Citation2018, Theorem 16), H is log-concave if and only if the dimension (number of species) is 2 or 3.

Assuming we have 2 or 3 species and thus concave D, we compute (74) D¯γ,samp=Esps[Dγ,s]  [Definition of D¯γ,samp](74) (75) =Esps[D(Eips,i[ps,i])]  [Definition of Dγ,s](75) (76) D(Esps[Eips,i[ps,i]])  [Jensen's inequality and concavity ofD](76) (77) =D(EIpI[pI])  [Lemma 1](77) (78) =Dγ,ref.  [Definition of Dγ,ref](78)

As we see from the above proofs, we now understand the source of the over/under-estimation of the average sampled mixing state index and diversity metrics that we observed from simulations and experimental data in Section 3. The consistent overestimation of Dα is a consequence of the exact averaging of Hα (Theorem 1) combined with the convexity of the exponential function (Theorem 3). The underestimation of Dγ (and Hγ), on the other hand, is due to the concavity of the (log-)entropy function (Theorems 2 and 4). By the central limit theorem, all of these over/under-estimations will decrease at a rate of 1/N as the number of sampled particles, N, increases.

5. Conclusions

Single-particle instruments necessarily use finite particle samples to determine population-level quantities, with the sample size being determined by practical considerations of data acquisition. In this study we developed a method to determine confidence intervals for a population-level quantity, the mixing-state index χ, that is determined from particle-level information. We accomplished this by using model-generated particle populations as a reference, which were subsequentially subsampled. Both numerical and mathematical analyses revealed that finite particle samples introduce a positive bias in the estimation of the diversity metric Dα (the average particle species diversity), and a negative bias in Dγ (the bulk diversity), which overall results in a positive bias in the estimation of χ. These results are consistent with the measurements using the aerosol samples of the Ye et al. (Citation2018) study of Pittsburgh. The confidence interval for χ, not surprisingly, depends on the mixing state itself.

A sample size of 1000 particles allows an estimate of χ within 30 percentage points, and 10,000 particles are needed to determine χ within 10 percentage points, for any mixing state. This approach could be extended to the measurement of other population-level quantities that are estimated based on particle samples. Furthermore, it may be important in practice to consider measurement error (not just undersampling) when calculating χ, or to extend χ to include species similarity (Leinster and Cobbold Citation2012) or functional diversity (Scheiner et al. Citation2017).

Data availability

The output of the particle-resolved modeling scenario library can be accessed at https://doi.org/10.13012/B2IDB-2774261_V1.

Additional information

Funding

We acknowledge funding from NSF AGS-1254428, NSF AGS-1543786, NSF CHE-1807530, and DOE ASR DE-SC0019192. This publication was developed under assistance agreement RD83587301 awarded by the U.S. Environmental Protection Agency as part of the Center for Air, Climate, and Energy Solutions. It has not been formally reviewed by EPA. The views expressed in this document are solely those of authors and do not necessarily reflect those of the Agency. EPA does not endorse any products or commercial services mentioned in this publication. This research is part of the Blue Waters sustained petascale computing project, which is supported by the National Science Foundation (awards OCI-0725070 and ACI-1238993) and the state of Illinois. Blue Waters is a joint effort of the University of Illinois at Urbana-Champaign and its National Center for Supercomputing Applications.

References

  • Alirezaei, G., and R. Mathar. 2018. On exponentially concave functions and their impact in information theory. In Proceedings of the 2018 Information Theory and Applications Workshop (ITA). doi:10.1109/ITA.2018.8503202.
  • Beck, J., and W. Schwanghart. 2010. Comparing measures of species diversity from incomplete inventories: An update. Methods Ecol. Evol. 1 (1):38–44. 00003.x. doi:10.1111/j.2041-210X.2009.
  • Beck, J., J. D. Holloway, and W. Schwanghart. 2013. Undersampling and the measurement of beta diversity. Methods Ecol. Evol. 4 (4):370–82. doi:10.1111/2041-210x.12023.
  • Beydoun, H., M. Polen, and R. C. Sullivan. 2017. A new multicomponent heterogeneous ice nucleation model and its application to snomax bacterial particles and a snomax–illite mineral particle mixture. Atmos. Chem. Phys. 17 (22):13545–57. doi:10.5194/acp-17-13545-2017.
  • Bondy, A. L., D. Bonanno, R. C. Moffet, B. Wang, A. Laskin, and A. P. Ault. 2018. The diverse chemical mixing state of aerosol particles in the southeastern United States. Atmos. Chem. Phys. 18 (16):12595–612. doi:10.5194/acp-18-12595-2018.
  • Brocklehurst, N., M. O. Day, and J. Fröbisch. 2018. Accounting for differences in species frequency distributions when calculating beta diversity in the fossil record. Methods Ecol. Evol. 9 (6):1409–20. doi:10.1111/2041-210X.13007.
  • Butturi-Gomes, D., M. Petrere, H. C. Giacomini, and S. S. Zocchi. 2017. Statistical performance of a multicomparison method for generalized species diversity indices under realistic empirical scenarios. Ecol. Indic. 72:545–52. doi:10.1016/j.ecolind.2016.08.054.
  • Chao, A., and L. Jost. 2015. Estimating diversity and entropy profiles via discovery rates of new species. Methods Ecol. Evol. 6 (8):873–82. doi:10.1111/2041-210X.12349.
  • Chao, A., Y. T. Wang, and L. Jost. 2013. Entropy and the species accumulation curve: A novel entropy estimator via discovery rates of new species. Methods Ecol. Evol. 4 (11):1091–100. doi:10.1111/2041-210X.12108.
  • Ching, J., J. Fast, M. West, and N. Riemer. 2017. Metrics to quantify the importance of mixing state for CCN activity. Atmos. Chem. Phys. 17 (12):7445–58. doi:10.5194/acp-17-7445-2017.
  • Ching, J., N. Riemer, and M. West. 2016. Black carbon mixing state impacts on cloud microphysical properties: Effects of aerosol plume and environmental conditions. J. Geophys. Res. Atmos. 121 (10):5990–6013. doi:10.1002/2016JD024851.
  • Daly, A. J., J. M. Baetens, and B. D. Baets. 2018. Ecological diversity: Measuring the unmeasurable. Mathematics 6 (7):119. doi:10.3390/math6070.
  • Dickau, M., J. Olfert, M. E. J. Stettler, A. Boies, A. Momenimovahed, K. Thomson, G. Smallwood, and M. Johnson. 2016. Methodology for quantifying the volatile mixing state of an aerosol. Aerosol. Sci. Technol. 50 (8):759–72. doi:10.1080/02786826.2016.1185509.
  • Fraund, M., D. Q. Pham, D. Bonanno, T. H. Harder, B. Wang, J. Brito, S. S. De Sá, S. Carbone, S. China, P. Artaxo, et al. 2017. Elemental mixing state of aerosol particles collected in Central Amazonia during GoAmazon2014/15. Atmosphere 8 (12):173. doi:10.3390/atmos8090173.
  • Gasparik, J. T., Q. Ye, J. H. Curtis, A. A. Presto, N. M. Donahue, R. C. Sullivan, M. West, and N. Riemer. 2020. Data from: Quantifying errors in the aerosol mixing-state index based on limited particle sample size [dataset]. University of Illinois at Urbana-Champaign. doi:10.13012/B2IDB-2774261_V1.
  • Haegeman, B., J. Hamelin, J. Moriarty, P. Neal, J. Dushoff, and J. S. Weitz. 2013. Robust estimation of microbial diversity in theory and in practice. ISME J. 7 (6):1092–101. doi:10.1038/ismej.2013.10.
  • Healy, R. M., N. Riemer, J. C. Wenger, M. Murphy, M. West, L. Poulain, A. Wiedensohler, I. P. O'Connor, E. McGillicuddy, J. R. Sodeau, et al. 2014. Single particle diversity and mixing state measurements. Atmos. Chem. Phys. 14 (12):6289–99., doi:10.5194/acp-14-6289-2014.
  • Healy, R., J. Sciare, L. Poulain, M. Crippa, A. Wiedensohler, A. Prévôt, U. Baltensperger, R. Sarda-Estève, M. McGuire, C.-H. Jeong, et al. 2013. Quantitative determination of carbonaceous particle mixing state in Paris using single-particle mass spectrometer and aerosol mass spectrometer measurements. Atmos. Chem. Phys. 13 (18):9479–96. doi:10.5194/acp-13-9479-2013.
  • Hughes, M., J. K. Kodros, J. R. Pierce, M. West, and N. Riemer. 2018. Machine learning to predict the global distribution of aerosol mixing state metrics. Atmosphere 9 (1):15. doi:10.3390/atmos9010015.
  • Knopf, D. A., and P. A. Alpert. 2013. A water activity based model of heterogeneous ice nucleation kinetics for freezing of water and aqueous solution droplets. Faraday Discuss. 165:513–34. doi:10.1039/c3fd00035d.
  • Knuth, D. E. 1998. The art of computer programming, volume 2: Seminumerical Algorithms. 3rd ed. Boston, MA: Addison Wesley.
  • Leinster, T., and C. A. Cobbold. 2012. Measuring diversity: The importance of species similarity. Ecology 93 (3):477–89. doi:10.1890/10-2402.1.
  • Lesins, G., P. Chylek, and U. Lohmann. 2002. A study of internal and external mixing scenarios and its effect on aerosol optical properties and direct radiative forcing. J. Geophys. Res. 107 (D10):AAC 5-1–4106. doi:10.1029/2001JD000973.
  • Marion, Z. H. 2016. On the quantification of complexity and diversity from phenotypes to ecosystems. Ph.D. thesis., University of Tennessee.
  • Marion, Z. H., J. A. Fordyce, and B. M. Fitzpatrick. 2018. A hierarchical Bayesian model to incorporate uncertainty into methods for diversity partitioning. Ecology 99 (4):947–56. doi:10.1002/ecy.2174.
  • Moffet, R. C., and K. A. Prather. 2009. In-situ measurements of the mixing state and optical properties of soot with implications for radiative forcing estimates. Proc. Natl. Acad. Sci. USA. 106 (29):11872–7. doi:10.1073/pnas.0900040106.
  • O'Brien, R. E., B. Wang, A. Laskin, N. Riemer, M. West, Q. Zhang, Y. Sun, X. ‐Y. Yu, P. Alpert, D. A. Knopf, et al. 2015. Chemical imaging of ambient aerosol particles: Observational constraints on mixing state parameterization. J. Geophys. Res. Atmos. 120 (18):9591–605. doi:10.1002/2015JD023480.
  • Prather, K. A., C. D. Hatch, and V. H. Grassian. 2008. Analysis of atmospheric aerosols. Annu Rev Anal Chem (Palo Alto Calif) 1:485–514. doi:10.1146/annurev.anchem.1.031207.113030.
  • Riemer, N., A. Ault, M. West, R. Craig, and J. Curtis. 2019. Aerosol mixing state: Measurements, modeling, and impacts. Rev. Geophys. 57 (2):187–249. doi:10.1029/2018RG000615.
  • Riemer, N., and M. West. 2013. Quantifying aerosol mixing state with entropy and diversity measurements. Atmos. Chem. Phys. 13 (22):11423–39. doi:10.5194/acp-13-11423-2013.
  • Riemer, N., M. West, R. Zaveri, and R. Easter. 2009. Simulating the evolution of soot mixing state with a particle-resolved aerosol model. J. Geophys. Res 114 (D9):D09202. doi:10.1029/2008JD011073.
  • Ryder, O. S., A. P. Ault, J. F. Cahill, T. L. Guasco, T. P. Riedel, L. A. Cuadra-Rodriguez, C. J. Gaston, E. Fitzgerald, C. Lee, K. A. Prather, et al. 2014. On the role of particle inorganic mixing state in the reactive uptake of N2O5 to ambient aerosol particles. Environ. Sci. Technol. 48 (3):1618–27. doi:10.1021/es4042622.
  • Scheiner, S. M., E. Kosman, S. J. Presley, and M. R. Willig. 2017. Decomposing functional diversity. Methods Ecol. Evol. 8 (7):809–20. doi:10.1111/2041-210X.12696.
  • Schell, B., I. J. Ackermann, H. Hass, F. S. Binkowski, and A. Ebel. 2001. Modeling the formation of secondary organic aerosol within a comprehensive air quality modeling system. J. Geophys. Res. 106 (D22):28275–93. doi:10.1029/2001JD000384.
  • Sherwin, W. B., and N. Prat I Fornells. 2019. The introduction of entropy and information methods to ecology by Ramon Margalef. Entropy 21 (8):794. doi:10.3390/e21080794.
  • Sullivan, R., M. Moore, M. Petters, S. Kreidenweis, G. Roberts, and K. Prather. 2009. Effect of chemical mixing state on the hygroscopicity and cloud nucleation properties of calcium mineral dust particles. Atmos. Chem. Phys. 9 (10):3303–16. doi:10.5194/acp-9-3303-2009.
  • Winkler, P. 1973. The growth of atmospheric aerosol particles as a function of the relative humidity-II. An improved concept of mixed nuclei. J. Aerosol Sci. 4 (5):373–87. doi:10.1016/0021-8502(73)90027-X.
  • Ye, Q., P. Gu, H. Z. Li, E. S. Robinson, E. Lipsky, C. Kaltsonoudis, A. K. Y. Lee, J. S. Apte, A. L. Robinson, R. C. Sullivan, et al. 2018. Spatial variability of sources and mixing state of atmospheric particles in a metropolitan area. Environ. Sci. Technol. 52 (12):6807–15., doi:10.1021/acs.est8b01011.
  • Zaveri, R., and L. Peters. 1999. A new lumped structure photochemical mechanism for large-scale applications. J. Geophys. Res. 104 (D23):30387–415. doi:10.1029/1999JD900876.
  • Zaveri, R., R. Easter, and A. Wexler. 2005b. A new method for multicomponent activity coefficients of electrolytes in aqueous atmospheric aerosols. J. Geophys. Res. 110 (D2), D02201. doi:10.1029/2004JD004681.
  • Zaveri, R., R. Easter, and L. Peters. 2005a. A computationally efficient multicomponent equilibrium solver for aerosols (MESA). J. Geophys. Res. 110 (D24), D24203. doi:10.1029/2004JD005618.
  • Zaveri, R., R. Easter, J. Fast, and L. Peters. 2008. Model for simulating aerosol interactions and chemistry (MOSAIC). J. Geophys. Res. 113 (D13), D13204. doi:10.1029/2007JD008782.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.