Search in:

Inverse Problems in Science and Engineering Volume 26, 2018 - Issue 2: Inverse Problems and Techniques with Applications in the Biological Sciences

Submit an article Journal homepage

Free access

378

Views

CrossRef citations to date

Altmetric

Listen

Articles

Estimating the rate of prion aggregate amplification in yeast with a generation and structured population model

H. T. Banksa Department of Mathematics, Center for Quantitative Sciences in Biomedicine, Center for Research in Scientific Computation, North Carolina State University , Raleigh, NC, USA.Correspondence[email protected]
View further author information

Kevin B. Floresa Department of Mathematics, Center for Quantitative Sciences in Biomedicine, Center for Research in Scientific Computation, North Carolina State University , Raleigh, NC, USA.View further author information

Christine R. Langloisb Max Planck Institute for Biochemistry , Martinsried, Germany.View further author information

Tricia R. Serioc Department of Molecular and Cellular Biology, The University of Arizona , Tucson, AZ, USA.View further author information

Suzanne S. Sindid Department of Applied Mathematics, School of Natural Sciences, University of California , Merced, CA, USA.View further author information

Pages 257-279 | Received 27 Mar 2017, Accepted 31 Mar 2017, Published online: 18 Apr 2017

Cite this article
https://doi.org/10.1080/17415977.2017.1316498
CrossMark

In this article

Abstract
1. Introduction
2. Model formulation
3. Inverse problem formulation and analysis
4. Parameter inference on experimental data
5. Discussion and concluding remarks
Acknowledgements
Additional information
Footnotes
References
Appendixes

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Prions are a special class of proteins capable of adopting multiple (misfolded) conformations, some of which have been associated with fatal diseases in mammals such as bovine spongiform encephalopathy or Creutzfeldt–Jakob Disease. Prion diseases, like protein misfolding diseases in general, are caused by the formation and amplification of ordered aggregates of proteins called amyloids. While such diseases in mammals can take decades to form, yeast have a variety of prion phenotypes that occur over a few hours, making this system an ideal model for protein misfolding disease in general. Most experimental assays of colonies with yeast prions provide steady-state population observations which complicate the inference of biochemical parameters both by the inability to directly measure aggregate amplification and by obscuring heterogeneity between cells. We develop a mathematical and inverse problem formulation to determine the amplification rate with prion aggregates from single-cell measurements observed in propagon amplification experiments. We demonstrate the ability of our formulation to determine heterogeneous amplification rates on simulated and experimental data. Our results show that aggregate amplification rates for two prion variants are strongly bimodal, suggesting that the generational structure in the yeast population impacts the ability of prion aggregates to amplify.

Keywords:

Structured population model
aggregate data
inverse problem
PDEs
prion

AMS Subject Classifications:

General biology and biomathematics

1. Introduction

Prions are misfolded proteins associated with a variety of fatal, untreatable neurodegenerative disorders in mammals [Citation1–Citation3]. Many of these diseases are associated with aging and, as such, visible symptoms can take decades to manifest. Because of this, the single celled yeast S. cerevisiae has emerged as a model for prion onset because prion phenotypes appear within hours. Additionally, since prions in yeast are associated with harmless heritable phenotypes, rather than disorders, this system allows a unique opportunity to observe prions not possible for mammals. Finally, the experimental tractability of the yeast system facilitates the observations of prion dynamics and manipulations in vivo [Citation4,Citation5].

However, a significant complication introduced by studying prions in yeast is that – unlike the mammalian counterpart – prions in yeast propagate within a dividing yeast colony [Citation6–Citation8]. Most mathematical approaches [Citation9–Citation13] neglect population heterogeneity and simply model aggregate dynamics in the context of an exponentially growing population of cells [Citation14]. Such models therefore neglect biased protein segregation during cell division as well as asymmetries in the cell cycle – both of which have been shown to be important in the yeast prion system [Citation15,Citation16]. In addition, most comparisons relied on steady-state population level experiments, which further mask heterogeneity between cells. Although stochastic simulations have proven useful for studying heterogeneity, such models are not amenable to standard analytical tools. Further, stochastic approaches are not readily used for parameter inference, as it is computationally intensive to sample from parameter space. There is a need for robust mathematical approaches to estimate biochemical parameters at the level of individual cells through comparisons with single-cell experimental data.

We provide the first mathematical treatment of yeast prion aggregates in a generational and aggregate structured population model. We demonstrate that we are able to decompose our problem from a system of PDEs into a related system of ODEs plus a single PDE with an explicit analytical solution. We formulate an inverse problem for the estimation of prion aggregate growth rates based on comparisons with propagon amplification assays is formulated. For two prion variants, we identify bimodality in the distribution of amplification rates, suggesting heterogeneity in the ability of individual cells to create aggregates. This work demonstrates that parameter inference on noisy single-cell data is indeed feasible and suggests further lines of study.

2. Model formulation

2.1. Generation and structured population model

Our model combines intracellular aggregate dynamics with a model structured by the number of divisions a cell has undergone since the start of the experiment. Typically, prion aggregate dynamics have been modelled with the nucleated polymerization equations (NPM) introduced by Masel, Jensen and Nowak [Citation17]. In this model one explicitly considers the evolution of both the number and distribution of aggregate sizes through the processes of conversion of normal protein by existing prion aggregates and fragmentation of prion aggregates into smaller templates. Formally, the NPM consists of a system of infinitely many ODEs when discrete aggregate sizes are modeled, or a coupled ODE/PDE system when continuous aggregate sizes are considered [Citation18,Citation19]. The NPM model has been adapted to consider prion aggregates existing within a continually dividing colony of yeast cells where prion aggregates are transmitted between cells during division. In yeast the typical approach has been to attempt to quantify prion variants in terms of their steady-state balance between soluble and aggregated protein. However, this single observation does not uniquely specify all parameters [Citation11,Citation14,Citation15].

As mentioned in Section 1, our goal is to provide a quantitative model suitable for comparison with single-cell measurements. As will be described in Section 3.1, single-cell experiments are only capable of measuring the number of aggregates in a single cell and not their size. Rather than explicitly model the full aggregate size distribution, or its simplified moment closure (as in [Citation11,Citation14]), we consider only the number of aggregates present in a cell at a particular time. To capture the behaviour of the NPM, we consider the number of aggregates present in a cell at any time as evolving according to the logistic equation where $λ$ represents the maximum rate of aggregate amplification and $K > 0$ the steady-state number of aggregates in a cell as $t \to \infty .$ Let y(t) be the number of aggregates a cell has t minutes after dividing, and following Banks, Flores and Sindi [Citation20], we define this evolution by(1) $\begin{matrix} \frac{d y}{d t} = \frac{λ y (K - y)}{K} : = v (y) \end{matrix}$ (1)

where $y (0) = y_{0}$ . Let N(t, y) be the density of cells with y aggregates at time t. Then this density will evolve in time according to the PDE(2) $\begin{matrix} \frac{\partial}{\partial t} N (t, y) + \frac{\partial}{\partial y} (v (y) N (t, y)) = 0 . \end{matrix}$ (2)

To capture the full dynamics of the system, where aggregates exist in dividing cells, we consider $N_{i} (t, y)$ , the density of cells that have undergone i divisions since the start of the experiment. As such, the aggregate dynamics in the population of dividing cells is given by the following system of coupled PDEs:(3) $\begin{matrix} \begin{matrix} \frac{\partial N_{0}}{\partial t} + \frac{\partial (v (y) N_{0} (t, y))}{\partial y} & = - (α_{0} (t) + β_{0} (t)) N_{0} (t, y) \\ \frac{\partial N_{1}}{\partial t} + \frac{\partial (v (y) N_{1} (t, y))}{\partial y} & = - (α_{1} (t) + β_{1} (t)) N_{1} (t, y) + R_{1} (t, y) \\ ⋮ \\ \frac{\partial N_{M}}{\partial t} + \frac{\partial (v (y) N_{M} (t, y))}{\partial y} & = - (α_{M} (t) + β_{M} (t)) N_{M} (t, y) + R_{M} (t, y) . \end{matrix} \end{matrix}$ (3)

Here the right side of each PDE represents the rates of loss of cells in generation i from division ( $α_{i} (t)$ ) and death ( $β_{i} (t)$ ) as well as the influx of newly divided cells from generation $i - 1$ to generation i: $R_{i} (t, y) = 2 γ α_{i - 1} (t) N_{i - 1} (t, γ y) .$

The parameter value $γ = 2$ corresponds to unbiased division where aggregates are equally split between cells when a cell of generation $(i - 1)$ creates two cells in generation i [Citation21,Citation22]. We note that this use of generation refers to the number of divisions cells have undergone since the start of the experiment and not – as is common for yeast cells – the total number of divisions since birth. Although the biological system consists of infinitely many generations, in most experimental settings the maximum number of cell divisions can be bounded, and thus our system (Equation3(3) $\begin{matrix} \begin{matrix} \frac{\partial N_{0}}{\partial t} + \frac{\partial (v (y) N_{0} (t, y))}{\partial y} & = - (α_{0} (t) + β_{0} (t)) N_{0} (t, y) \\ \frac{\partial N_{1}}{\partial t} + \frac{\partial (v (y) N_{1} (t, y))}{\partial y} & = - (α_{1} (t) + β_{1} (t)) N_{1} (t, y) + R_{1} (t, y) \\ ⋮ \\ \frac{\partial N_{M}}{\partial t} + \frac{\partial (v (y) N_{M} (t, y))}{\partial y} & = - (α_{M} (t) + β_{M} (t)) N_{M} (t, y) + R_{M} (t, y) . \end{matrix} \end{matrix}$ (3) ) considers cells only up to a specified maximum generation M. The solution of (Equation3(3) $\begin{matrix} \begin{matrix} \frac{\partial N_{0}}{\partial t} + \frac{\partial (v (y) N_{0} (t, y))}{\partial y} & = - (α_{0} (t) + β_{0} (t)) N_{0} (t, y) \\ \frac{\partial N_{1}}{\partial t} + \frac{\partial (v (y) N_{1} (t, y))}{\partial y} & = - (α_{1} (t) + β_{1} (t)) N_{1} (t, y) + R_{1} (t, y) \\ ⋮ \\ \frac{\partial N_{M}}{\partial t} + \frac{\partial (v (y) N_{M} (t, y))}{\partial y} & = - (α_{M} (t) + β_{M} (t)) N_{M} (t, y) + R_{M} (t, y) . \end{matrix} \end{matrix}$ (3) ) requires specification of an initial condition for each i,(4) $\begin{matrix} N_{i} (0, y) = ϕ_{i} (y) . \end{matrix}$ (4)

We also need to impose no-flux boundary conditions at $y = 0$ to solve the PDE system(5) $\begin{matrix} v (y) N_{i} {(t, y) |}_{y = 0} = 0 \end{matrix}$ (5)

for all $0 \leq i \leq M$ .

2.2. Population decomposition and analytic solution

We first note that our structured model facilitates computation of a variety of quantities related to the yeast cell population. For example, the total number of cells having undergone i divisions is given by $\bar{N_{i}} (t) = \int_{0}^{\infty} N_{i} (t, y) d y$

and the normalized density of aggregates can be defined by $n_{i} (t, y) = \frac{N_{i} (t, y)}{\bar{N_{i}} (t)}, for \bar{N_{i}} (t) > 0$

and $n_{i} (t, y) = 0$ otherwise. We further note that these quantities facilitate determining an analytical solution to the system (Equation3(3) $\begin{matrix} \begin{matrix} \frac{\partial N_{0}}{\partial t} + \frac{\partial (v (y) N_{0} (t, y))}{\partial y} & = - (α_{0} (t) + β_{0} (t)) N_{0} (t, y) \\ \frac{\partial N_{1}}{\partial t} + \frac{\partial (v (y) N_{1} (t, y))}{\partial y} & = - (α_{1} (t) + β_{1} (t)) N_{1} (t, y) + R_{1} (t, y) \\ ⋮ \\ \frac{\partial N_{M}}{\partial t} + \frac{\partial (v (y) N_{M} (t, y))}{\partial y} & = - (α_{M} (t) + β_{M} (t)) N_{M} (t, y) + R_{M} (t, y) . \end{matrix} \end{matrix}$ (3) ). Following previous approaches, [Citation20,Citation23], we define the initial number of cells:(6) $\begin{matrix} {\bar{N}}_{0, 0} = \int_{0}^{\infty} N_{0, 0} (y) d y \end{matrix}$ (6)

and the initial normalized aggregate density:(7) $\begin{matrix} {\bar{n}}_{0, 0} (y) = \frac{N_{0, 0} (y)}{{\bar{N}}_{0, 0}} . \end{matrix}$ (7)

The solution to our system is given by the following Theorem (generalized [Citation20] from Hasenauer et al. [Citation23]):

Theorem 2.1:

The solution of the system defined by (Equation3(3) $\begin{matrix} \begin{matrix} \frac{\partial N_{0}}{\partial t} + \frac{\partial (v (y) N_{0} (t, y))}{\partial y} & = - (α_{0} (t) + β_{0} (t)) N_{0} (t, y) \\ \frac{\partial N_{1}}{\partial t} + \frac{\partial (v (y) N_{1} (t, y))}{\partial y} & = - (α_{1} (t) + β_{1} (t)) N_{1} (t, y) + R_{1} (t, y) \\ ⋮ \\ \frac{\partial N_{M}}{\partial t} + \frac{\partial (v (y) N_{M} (t, y))}{\partial y} & = - (α_{M} (t) + β_{M} (t)) N_{M} (t, y) + R_{M} (t, y) . \end{matrix} \end{matrix}$ (3) ) with initial initial conditions (Equation6(6) $\begin{matrix} {\bar{N}}_{0, 0} = \int_{0}^{\infty} N_{0, 0} (y) d y \end{matrix}$ (6) ) and (Equation7(7) $\begin{matrix} {\bar{n}}_{0, 0} (y) = \frac{N_{0, 0} (y)}{{\bar{N}}_{0, 0}} . \end{matrix}$ (7) ) is given by:(8) $\begin{matrix} N_{i} (t, y) = {\bar{N}}_{i} (t) n_{i} (t, y) for 0 \leq i \leq M, \end{matrix}$ (8)

in which:

(1)	${\bar{N}}_{i} (t)$ is the solution of the system of ODEs: $\begin{matrix} i = 0 & : & \frac{d {\bar{N}}_{0}}{d t} = - (α_{0} (t) + β_{0} (t)) {\bar{N}}_{0} \\ for 1 \leq i \leq M & : & \frac{d {\bar{N}}_{i}}{d t} = - (α_{i} (t) + β_{i} (t)) {\bar{N}}_{i} + 2 α_{i - 1} {\bar{N}}_{i - 1} \end{matrix}$ where ${\bar{N}}_{0} (0) = {\bar{N}}_{0, 0}$ and $for i \geq 1 : {\bar{N}}_{i} (0) = 0 .$
(2)	$n_{i} (t, y)$ is the solution of the PDE(9) $\begin{matrix} \frac{\partial n_{i} (t, y)}{\partial t} + \frac{\partial (f (y) n_{i} (t, y))}{\partial y} = 0 \end{matrix}$ (9) with initial conditions $for all i \geq 0 : n_{i} (0, y) : = γ^{i} n_{0, 0} (γ^{i} y)$ .

In Hausenauer et al. [Citation23], the authors derive a solution to $n_{i} (t, y)$ under a specific form of the flux term $f (y) = - k y$ . However, this method applies to more general flux terms as described in [Citation20]. In order to prove the theorem, the method of characteristics is first used to determine an explicit solution to Equation (Equation9(9) $\begin{matrix} \frac{\partial n_{i} (t, y)}{\partial t} + \frac{\partial (f (y) n_{i} (t, y))}{\partial y} = 0 \end{matrix}$ (9) ):(10) $\begin{matrix} n_{i} (t, y) = γ^{i} n_{0, 0} (γ^{i} g (t, y)) \frac{e^{- λ t}}{K^{2}} {(g (t, y) (e^{λ t} - 1) + K)}^{2}, \end{matrix}$ (10)

where $g (t, y) = K y {(K e^{λ t} - y e^{λ t} + y)}^{- 1}$ gives the initial conditions of the characteristics under the assumption of logistic aggregate growth. Direct substitution of $\bar{N_{i}} (t) n_{i} (t, y)$ into Equation (Equation3(3) $\begin{matrix} \begin{matrix} \frac{\partial N_{0}}{\partial t} + \frac{\partial (v (y) N_{0} (t, y))}{\partial y} & = - (α_{0} (t) + β_{0} (t)) N_{0} (t, y) \\ \frac{\partial N_{1}}{\partial t} + \frac{\partial (v (y) N_{1} (t, y))}{\partial y} & = - (α_{1} (t) + β_{1} (t)) N_{1} (t, y) + R_{1} (t, y) \\ ⋮ \\ \frac{\partial N_{M}}{\partial t} + \frac{\partial (v (y) N_{M} (t, y))}{\partial y} & = - (α_{M} (t) + β_{M} (t)) N_{M} (t, y) + R_{M} (t, y) . \end{matrix} \end{matrix}$ (3) ) completes the proof (see [Citation23] and [Citation20] for further details).

3. Inverse problem formulation and analysis

We first describe the experimental propagon amplification assay used to collect the single-cell aggregate measurements. We develop an inverse problem formulation and demonstrate that we can accurately infer kinetic parameters on simulated data sets.

3.1. Experimentally quantifying aggregates

For the Sup35/[ $P S I^{+}$ ] yeast prion system, propagon amplification (biologists refer to the set of transmissible aggregates as propagons [Citation4]) experiments provide single-cell measurements that are a valuable alternative to the standard steady-state population level experiments used to estimate the typical number of transmissible aggregates (propagons) in a single cell [Citation7,Citation8,Citation24,Citation25]. The key feature exploited in these experiments is that when [ $P S I^{+}$ ] yeast are treated with guanidine hydrochloride (GdnHCl), the process of aggregate fragmentation is stopped, but other cellular processes such as cell division continue as normal. Since under GdnHCl treatment the total number of aggregates cannot increase (or decrease) and all transmissible aggregates will be diluted through cell division and, neglecting cell death, eventually there will be at most one per cell [Citation8,Citation26] (see Figure ). After the removal of GdnHCl, aggregates will again continue to increase in number through the conversion and fragmentation process.

Figure 1. Propagon Amplificaiton Assay. A two-step process is used to count the number of transmissible prion aggregates (propagons) in a single cell. (Left) A single target cell is isolated, and aggregate fragmentation is stopped through exposure to GdnHCl. Since aggregates (pinwheels) cannot increase in number, they are diluted through cell division. (Right) After sufficient dilution, the colony is replated onto solid medium, in the absence of GdnHCl, where each single cell serves as a founder of a distinct yeast colony. Colonies whose founding cell had at least one aggregate will have the [ $P S I^{+}$ ] prion phenotype (white) while those founded by a cell with no aggregate will have the [ $p s i^{-}$ ] non-prion phenotype (red). The number of aggregates in the target cell corresponds to the number of white colonies in the plate.

Propagon amplification experiments involve the application of GdnHCl in tandem to monitor the rate of aggregate recovery by measuring the number of propagons in a single cell [Citation14,Citation27]. First, cells are treated with GdnHCl in order to ensure all cells have at most one aggregate. Second, cells from this GdnHCl treatment are removed and allowed to grow under normal conditions for specified periods of time during which the total number of aggregates will increase and cells may divide. Finally, propagon counting assays (see Figure) are performed on cells resulting from the second step. The result is measurements of the number of aggregates in a randomly chosen cell at time $t > 0$ from a population founded by an initial cell having a single aggregate.

Figure 2. Experimental Propagon Amplification Data. (Left) Propagon counts for the WT variant, (Right) Propagon counts for the R15 variant.

In this work, we consider recent propagon amplification experiments performed on two Sup35 variants which differ in the number of oligopeptide repeats in the prion domain. The variant ‘WT’ represents the wild-type (i.e. standard) version of Sup35 with 5.5 oligopeptide repeats and ‘R15’ which has only the first 5 repeats (i.e. the partial oligopeptide repeat is missing). Both Sup35 variants are capable of forming aggregates and conferring the [ $P S I^{+}$ ] prion phenotype. Figure shows the result of experiments on both prion variants. The question we pursue in this analysis is: Are there differences in the rate of aggregate amplification, $λ$ , between these two variants?

3.2. Growth Rate Distribution Models and Inverse Problem Formulation

In this formulation we make several simplifying assumptions on both the yeast cells and aggregates themselves. While in practice yeast cell division and death rates are time varying, and have been well studied [Citation25,Citation28,Citation29], for simplicity we assume that division rates $α_{i} (t)$ are the same for all cells and generations i ( $α_{i} (t) = 0.0077$ 1/min) and that cell death is negligible, $β_{i} (t) = 0$ . Further, we assume the initial distribution of aggregates to be given by a truncated normal distribution with $μ = 1$ and $σ = . 1$ . The normal distribution is chosen for convenience but is consistent with the initial experimental step where cells are driven to have only a single aggregate (see Figure ). Finally, while there are two parameters K and $λ$ in the logistic model of prion dynamics (see Equation (Equation1(1) $\begin{matrix} \frac{d y}{d t} = \frac{λ y (K - y)}{K} : = v (y) \end{matrix}$ (1) )), because prion curing assays have been used to estimate K, the steady-state number of aggregates in a single cell [Citation7,Citation24,Citation25], we take this value as known ( $K = 200$ ) and focus effort on only estimating the growth rate $λ$ .

Suppose our experimental observations consist of numbers of aggregates ${y_{i}}_{i = 1}^{n}$ observed at times ${t_{i}}_{i = 1}^{n}$ . We assume that these experimentally observed counts reflect the expected number of aggregates in a cell chosen at that time as given below in Equation (Equation11(11) $\begin{matrix} E [y (t; λ)] = \frac{\sum_{i = 0}^{\infty} \int_{0}^{\infty} ξ N_{i} (t, ξ; λ) d ξ}{N (t)}, \end{matrix}$ (11) )(11) $\begin{matrix} E [y (t; λ)] = \frac{\sum_{i = 0}^{\infty} \int_{0}^{\infty} ξ N_{i} (t, ξ; λ) d ξ}{N (t)}, \end{matrix}$ (11)

where $N_{i} (t, ξ; λ)$ is the solution to the aggregate density problem for a given growth rate $λ$ , number of propagons $ξ$ , and generation i since the start of the experiment (see Equation (Equation9(9) $\begin{matrix} \frac{\partial n_{i} (t, y)}{\partial t} + \frac{\partial (f (y) n_{i} (t, y))}{\partial y} = 0 \end{matrix}$ (9) )), and $N (t) = \sum_{i = 0}^{\infty} {\bar{N}}_{i} (t)$ . Rather than go to infinity, the upper limit of the sums is chosen to be the maximum number of generations M chosen based on the length of the experiments. We are interested in estimating the growth rate $λ$ in the formulation equation (Equation11(11) $\begin{matrix} E [y (t; λ)] = \frac{\sum_{i = 0}^{\infty} \int_{0}^{\infty} ξ N_{i} (t, ξ; λ) d ξ}{N (t)}, \end{matrix}$ (11) ). Given a set of observations ${y_{i}}_{i = 1}^{n}$ for $E [y (t_{i}; λ)], i = 1, \dots, n$ , we are led naturally to an optimization problem(12) $\begin{matrix} min_{λ} Σ_{i = 1}^{n} {| E [y (t_{i}; λ)] - y_{i} |}^{2} . \end{matrix}$ (12)

However, we do not expect a single growth rate to suffice for the aggregate type data in which we are interested. To allow for heterogeneity in estimates of the aggregate growth rates, we follow an approach previously used in [Citation30,Citation31] relying on the Porohov Metric (which is equivalent to weak star convergence of measures [Citation32–Citation34]). To this end we assume a family $P (Λ)$ of distributions P of growth rates for our population over a continuum of values $Λ$ . We then use the data in a reformulation of the estimation problem(13) $\begin{matrix} P_{0} = arg min_{P \in P (Λ)} J_{O L S}^{n} (P) = arg min_{P \in P (Λ)} Σ_{i = 1}^{n} {| E [y (t_{i}; P)] - Y_{i} |}^{2}, \end{matrix}$ (13)

where(14) $\begin{matrix} E [y (t_{i}; P)] = \int_{Λ} E [y (t_{i}; λ)] d P (λ) . \end{matrix}$ (14)

Here our formulation assumes that the data ${y_{i}}$ are the realizations for random variable observations ${Y_{i}}$ in an i.i.d. error process given by(15) $\begin{matrix} Y_{i} = E [y (t_{i}; P)] + E_{i} \end{matrix}$ (15)

where $E [E_{i}] = 0$ , and $cov [E_{i}, E_{i}] = σ_{0}^{2}$ .

In our application here, one can use a continuum of growth rates $Λ = [0, λ_{max}]$ . Since $Λ$ with the Euclidean metric is a compact metric space, then the growth family $P (Λ)$ is a compact metric space when taken with the Prohorov metric [Citation34, p. 173]. Hence, since the least squares cost functional $J_{O L S}^{n}$ is continuous in P, we are guaranteed the existence of a minimizer in (Equation13(13) $\begin{matrix} P_{0} = arg min_{P \in P (Λ)} J_{O L S}^{n} (P) = arg min_{P \in P (Λ)} Σ_{i = 1}^{n} {| E [y (t_{i}; P)] - Y_{i} |}^{2}, \end{matrix}$ (13) ) for the given ${Y_{i}}$ .

The above problem is an infinite dimensional optimization problem which lends itself to a family of approximation problems. For a given finite family $Λ^{L} = {λ_{1}, λ_{2}, \dots, λ_{L}}$ of growth rates, we consider approximate families $P^{L} (Λ^{L}) = {P^{L} = Σ_{k = 1}^{L} p_{k} Δ_{λ_{k}} | p_{k} \geq 0 and \sum_{k = 1}^{L} p_{k} = 1}$ where $Δ_{λ_{k}}$ is the Dirac delta distribution with mass at $λ_{k}$ . We identify the probability distribution $P^{L}$ with a finite dimensional vector which we denote by the vector ${\vec{p}}^{L} = (p_{1}, \dots, p_{L})$ . The corresponding approximate problems are given by(16) $\begin{matrix} P_{0}^{L} = arg min_{P^{L} \in P^{L} (Λ^{L})} J_{O L S}^{n} (P^{L}) = arg min_{P^{L} \in P^{L} (Λ^{L})} Σ_{i = 1}^{n} {| E [y (t_{i}; Λ^{L}, {\vec{p}}^{L})] - Y_{i} |}^{2} . \end{matrix}$ (16)

Again, properties of the Prohorov metric [Citation34, p.173] can be used to guarantee the existence of a minimizer $P_{0}^{L}$ in (Equation16(16) $\begin{matrix} P_{0}^{L} = arg min_{P^{L} \in P^{L} (Λ^{L})} J_{O L S}^{n} (P^{L}) = arg min_{P^{L} \in P^{L} (Λ^{L})} Σ_{i = 1}^{n} {| E [y (t_{i}; Λ^{L}, {\vec{p}}^{L})] - Y_{i} |}^{2} . \end{matrix}$ (16) ) for the given data process ${Y_{i}}$ .

During the last decade, we have developed a full theoretical framework related to convergence and consistency in problems for the Prohorov metric such as those being discussed here. This theoretical framework, called the Prohorov Metric Framework, or PMF for short, is summarized and discussed in the context of fairly general problems in [Citation34, Chapter 5]. We summarize here portions of the theory that are relevant to our current problems. We first discuss issues related to convergence as $L \to \infty$ of the approximations $P_{0}^{L}$ as defined in equation (Equation16(16) $\begin{matrix} P_{0}^{L} = arg min_{P^{L} \in P^{L} (Λ^{L})} J_{O L S}^{n} (P^{L}) = arg min_{P^{L} \in P^{L} (Λ^{L})} Σ_{i = 1}^{n} {| E [y (t_{i}; Λ^{L}, {\vec{p}}^{L})] - Y_{i} |}^{2} . \end{matrix}$ (16) ) above. These results were first developed in [Citation35] and subsequently used in other problems related to aggregate data.

We let $Λ = [0, λ_{max}]$ as discussed above and $Λ_{D} = {λ_{k}^{D}}_{k = 1}^{\infty}$ be an enumeration of a countable dense subset $Λ_{D}$ of $Λ$ . We define $P_{D} (Λ_{D})$ as the collection of all finite convex combinations $Σ_{k = 1}^{L} p_{k} Δ_{λ_{k}}$ of Dirac measures on $Λ$ with atoms $λ_{k}^{D} \in Λ_{D}$ and weights ${\vec{p}}^{L}$ . Then $P_{D} (Λ_{D})$ is dense in $P (Λ)$ . A proof of this result can be found in [Citation35].

For each integer L, we define(17) $\begin{matrix} P_{D}^{L} (Λ_{D}) = \{P^{L} \in P_{D} (Λ_{D}) | P^{L} = \sum_{k = 1}^{L} p_{k} Δ_{λ_{k}^{D}}\} . \end{matrix}$ (17)

That is, for each L, $P_{D}^{L} (Λ_{D})$ is the set of all atomic probability measures with nodes placed at the first L elements in the enumeration of the countable dense subset $Λ_{D}$ . Note that $P_{D}^{L} (Λ_{D}) \subset P^{L} (Λ) \subset P (Λ)$ for all L. The following theorem provides the desired convergence result.

Theorem 3.1:

There exists a sequence of minimizers ${P_{n}^{L}}$ of $J_{O L S}^{n} (P)$ over $P^{L} (Λ)$ . Moreover, this sequence has at least one convergent subsequence. The limit $P^{n *}$ (as $L \to \infty$ ) of such a subsequence minimizes $J_{O L S}^{n}$ over $P (Λ)$ .

This theorem follows immediately from the theoretical framework and convergence theorems of [Citation36] and the results above. The power of this theorem is that it transforms an infinite dimensional minimization problem (over $P (Λ)$ ) into an L dimensional optimization problem. In practice, L nodes ${λ_{k}^{D}}_{k = 1}^{L}$ from the dense, countable subset $Λ_{D} = {λ_{k}^{D}}_{k = 1}^{\infty}$ are fixed in advance and then the L weights $p_{k}^{L}$ are estimated in a minimization problem.

We turn next to the question of consistency [Citation34, Section 5.5] which relates to the question of what happens to the minimizer $P^{n *}$ of $J_{O L S}^{n} (P)$ over $P (Λ)$ , as $n \to \infty$ , (i.e. as an increasing amount of data becomes available). These results, in the context of probability measures, assure us that the measures $P^{n *}$ converge in some sense to an underlying ‘true’ probability measure $P_{0}$ . Under appropriate conditions (primarily that the data is collected in a manner that the ‘sampling points fill up the space’ $Λ$ and an additional identifiability assumption), one has (see [Citation34, Theorem 5.5.2]) that the $P^{n *}$ , as $n \to \infty$ , converge with probability 1 to a ‘true’ measure $P_{0}$ . That is, $Prob \{ω | P^{n *} (ω) \to P_{0} (ω)\} = 1 .$

In the above examples, the formulation of the objectives assumes that the observations ${Y_{i}}$ follow an absolute error model. That is, in an absolute error model we assume that the data, propagon counts ${y_{i}}$ , are realizations of observations of the form(18) $\begin{matrix} Y_{i} = E [y (t_{i}; Λ^{L}, {\vec{p}}^{L})] + E_{i} \end{matrix}$ (18)

where the $E_{i}$ represent an i.i.d. error process with $E [E_{i}] = 0$ , and $cov [E_{i}, E_{i}] = σ_{0}^{2}$ .

However, because the variance of our data set appeared to increase over time and multiple data points were available at each time (see Figure ), we considered a proportional error model where the error in the experimental observations is proportional to the expected outcome. In this case, the data ${y_{i}}$ are assumed to be realizations of observations of the form(19) $\begin{matrix} Y_{i} = E [y (t_{i}; Λ^{L}, {\vec{p}}^{L})] + E {[y (t_{i}; Λ^{L}, {\vec{p}}^{L})]}^{δ} E_{i}, where δ > 0 . \end{matrix}$ (19)

As discussed in Appendix 1, we found our fit was best described by a proportional error model with exponent $δ$ between $0.4 - 0.7$ . For simplicity we chose $δ = 0.4$ in our analysis below, but we note the results were not overly sensitive to this value. With the choice of our proportional error model, we adjust our objective from the OLS to the generalized least squares (GLS) formulation:(20) $\begin{matrix} P_{0}^{L} = arg min_{P^{L} \in P^{L} (Λ^{L})} J (P^{L}) = arg min \sum_{i = 1}^{n} {(\frac{E [y (t_{i}; Λ^{L}, {\vec{p}}^{L})] - Y_{i}}{E {[y (t_{i}; Λ^{L}, {\vec{p}}^{L})]}^{δ}})}^{2} . \end{matrix}$ (20)

While the original OLS problem can be readily solved with quadratic programming [Citation37][Citation34], the GLS formulation cannot. In the analysis below we solve our inverse problem by using the Matlab fmincon function and constraining our parameters such that ${\vec{p}}^{L}$ satisfies $p_{k} \geq 0$ and $\sum_{k = 1}^{L} p_{k} = 1$ for different choices for our possible growth rates.

Before proceeding, we comment further on our choice of statistical model described above. Our present effort represents, to the best of our knowledge, the first attempt to precisely model and quantify propagon counting assays. We believe there are two sources of error in these experiments. First, there is randomness associated with sampling from a population. Second, there is error in the measurements from the final counting which are performed by a human. Our work has demonstrated that there is considerably quantitative power using these assays.

3.3. Inference on simulated data

Before moving to experimental data, we investigated if we could successfully estimate parameters from simulated data-sets. We generated simulated data with the same proportional variance model ( $δ = 0.4$ ) that we chose to use with the experimental data-sets and analysed our ability to correctly predict growth rates under different conditions. In order to maximize consistency between simulations and experiments, we chose the same time points as our experiments and simulated 10 observations at each. We first considered a simulated data-set with a unimodal growth rate distribution where we approximated the normal distribution ( $μ = 1.5, σ = 0.25$ ) by equally spaced growth rates in the range $Λ = [0.5, 2.5]$ . We explored our inverse problem formulation varying the number L of nodes in our estimation. The convergence obtained is depicted in Figure and is consistent with that discussed above and illustrated earlier in [Citation37].

Figure 3. Simulated Data with Unimodal Growth Distribution. Each row demonstrates our ability to determine the best fit data to a set of simulated propagon counts with increasing numbers of equally spaced nodes in our discrete model: (Row 1) 5 Nodes, (Row 2) 10 Nodes, (Row 3) 25 Nodes, (Row 4) 50 Nodes and (Row 5) 100 Nodes. In each row we depict: (Left) the simulated propagon counts (black circles) and best fit model output (blue); (Centre) the true probability density (black) compared to estimated probability density (blue) and (Right) the true cumulative distribution (black) compared to estimated cumulative distribution function. We see that with increasing nodes we converge to the cumulative distribution.

Figure 4. Simulated Data with Bimodal Growth Distribution. Each row demonstrates our ability to determine the best fit data to a set of simulated propagon counts with increasing numbers of equally spaced nodes in our discrete model: (Row 1) 5 Nodes, (Row 2) 10 Nodes, (Row 3) 25 Nodes, (Row 4) 50 Nodes and (Row 5) 100 Nodes. In each row we depict: (Left) the simulated propagon counts (black circles) and best fit model output (blue); (Centre) the true probability density (black) compared to estimated probability density (blue) and (Right) the true cumulative distribution (black) compared to estimated cumulative distribution function. We see that with increasing nodes we converge to the cumulative distribution.

We also generated simulated data with only two nodes: $λ_{} low \approx 0.90$ , (30% of population), and $λ_{high} \approx 2.1$ , ( $70 %$ of population) and solved inverse problems on the same node sets consisting of $L = 5, 10, 25, 50 and 100$ equally spaced nodes in [0.5, 2.5] with results be summarized in Figure . In all cases the distributions identified in solving the inverse problems were consistent with the true bimodal distribution, although the location of $λ_{low}$ was not always correctly predicted. Based on these results, we conclude that the underlying characteristics of the resulting approximations – such as support and modality – in general, are likely to be in agreement with the true bimodal distribution.

4. Parameter inference on experimental data

4.1. Estimating growth rates

Motivated by our results on simulated data-sets, we formulated and solved inverse problems for our two prion variants WT and R15 (see Figure ). Using the proportional error model, we identified the optimal discrete probability distribution $\vec{p}$ for different numbers of equally spaced nodes between 0.5 and 2.5 (see Figures – solid lines). As the number of nodes increased from 4, we observed bimodality in the probability distribution functions (solid lines) for both WT (black) and R15 (red) variants. Indeed, it seemed that the delta function model was converging for each variant to a bimodal distribution. Based on the results from our simulated data, these estimated node weights appeared consistent with two distinct growth rates rather than a broad unimodal distribution of growth rates.

To determine bimodality more directly, we explored an alternative method of parameter estimation by allowing the location of $λ$ to vary continuously throughout the interval [0.5, 2.5] and identifying the optimal two node distribution of $λ$ (see Figure ). For reference, we also show the continuously varying values of $λ$ and their associated probabilities in dashed lines in Figures - (dashed lines). Together, the consistent node placement between the continuous and discrete node models suggest that there are two distinct prion aggregate growth rates for WT and R15 variants. Intriguingly, the bimodal behaviour was distinct between variants; assuming cells from the lower and higher $λ$ subpopulations are consistent between variants, this suggests an overall higher rate of aggregate amplification for cells with the WT variant compared to the R15 variant (see Figure ).

Figure 5. 4 Fixed Nodes. Fit to 4 nodes equally spaced in $λ = [0.5, 2.5] .$

Figure 6. 9 Fixed Nodes. Fit to 9 nodes equally spaced in $λ = [0.5, 2.5] .$

Figure 7. 17 Fixed Nodes. Fit to 17 nodes equally spaced in $λ = [0.5, 2.5] .$

Figure 8. 33 Fixed Nodes. Fit to 33 nodes equally spaced in $λ = [0.5, 2.5] .$

Figure 9. 65 Fixed Nodes. Fit to 65 nodes equally spaced in $λ = [0.5, 2.5] .$

Figure 10. 129 Fixed Nodes. Fit to 129 nodes equally spaced in $λ = [0.5, 2.5] .$

Figure 11. Continuously Varying Nodes. Based on the observations for fixed node models, it appeared that the underlying data had a strong preference for two specific nodes.

4.2. Model selection test to assess bimodality

In order to more robustly test the hypothesis of bimodality in the data-sets, we performed a model selection test where we compared our model of two continuously varying values for $λ$ with a model where we chose only a single value of $λ$ . In Figure we compare the location of the best choice for a single node (solid line) with the two node model (dashed lines) for both WT and R15. We note that the optimal single node chosen appears to be close to the weighted average for the two node model. To decide between the one or two growth rate model we performed a model selection test on the residual sum of squares (RSS) for our proportional error model (Equation Equation20(20) $\begin{matrix} P_{0}^{L} = arg min_{P^{L} \in P^{L} (Λ^{L})} J (P^{L}) = arg min \sum_{i = 1}^{n} {(\frac{E [y (t_{i}; Λ^{L}, {\vec{p}}^{L})] - Y_{i}}{E {[y (t_{i}; Λ^{L}, {\vec{p}}^{L})]}^{δ}})}^{2} . \end{matrix}$ (20) ) as described in [Citation34,Citation38,Citation39]. Our statisticcompares the difference between RSS for the one growth rate model ( $R S S_{1}$ , one parameter $λ$ ) and the RSS for the two growth rate model ( $R S S_{2}$ , three parameters $λ_{1}, λ_{2}, p_{1}$ ): $\frac{R S S_{1} - R S S_{2}}{R S S_{2}} \sim χ^{2} (2) .$

According to the $χ^{2}$ distribution, there was a significantly better fit for the two node model than the one node model for the WT variant ( $p = 0.00335$ ) but not the R15 variant $(p = . 1554218)$ . However, we note that the R15 data still seem to support the bimodality, otherwise the parameter estimated for the two node model would have converged to the signal node distribution.

Figure 12. 1 vs. 2 Nodes. In order to decide if the underlying data-sets were bimodal, we compare the best bimodal nodes (dashed lines) with the best single node model for each case (WT $=$ black, R15 $=$ red).

4.3. Sensitivity analysis

Having strong evidence in support of the two node model for the WT variant and weaker but still suggestive evidence for R15, we analysed the sensitivity of our results to our parameter estimates. Following [Citation39] we used the system of differential equations for the local sensitivity of observed propagon counts to changes from optimal parameters. The sensitivities were computed using the complex-step method [Citation40]. In the analysis below we have chosen to represent the bimodal model with the following three parameters: $p, λ_{1}, λ_{2}$ where $λ_{1} < λ_{2}$ and the probability of growth rate $λ_{1}$ is p and $λ_{2}$ is $(1 - p)$ .

In Figure we plot the normalized sensitivities in time. We first note that, as expected, the model with R15 parameters has lower predicted aggregate counts than the WT parameters (see Figure (Top)). We note differences in the local sensitivity behaviour of the variants to p and $λ_{1}$ . The behaviour of WT is increasingly sensitive to p until around 7 h where its sensitivity then begins to decrease in contrast to R15 where sensitivity appears to be slowly increasing then levelling out. Further in comparison to WT, R15 has minimal sensitivity to $λ_{1}$ . Both WT and R15 exhibit similar behaviour to sensitivities in $λ_{2}$ where the sensitivity increases and then decreases after the maximum occurs at around 7 h. We expect strongly decreasing sensitivity to $λ$ values as each of the respective subpopulations approach the same aggregate carrying capacity, which likely explains the $λ_{2}$ behaviour for both variants.

Figure 13. Time Varying Local Sensitivities. (Top) Model output x for optimal parameter choices. Note that despite the difference in the optimal parameters, the model behaviour for WT (black) and R15 (red) are quite similar. (Second - Bottom) Time varying sensitivities for WT (black) and R15 (red).

4.4. Confidence intervals

We next used our sensitivity matrices (of the previous Section) to estimate confidence intervals for our parameters. We followed the asymptotic theory approach given in [Citation34] for estimating confidence intervals for generalized least squares problems. We first describe F, the sensitivity matrix, which is in this case $12 \times 3$ matrix for our 12 time points ${t_{1}, t_{2}, \dots, t_{12}}$ and three parameters. Recall, the output of our model is the expected number of aggregates observed at particular time t. For ease in notation, we redefine our model output for the bimodal case (originally defined in Equation (Equation14(14) $\begin{matrix} E [y (t_{i}; P)] = \int_{Λ} E [y (t_{i}; λ)] d P (λ) . \end{matrix}$ (14) )) as follows:(21) $\begin{matrix} E [t; λ_{1}, λ_{2}, p] = p E [y (t; λ_{1})] + (1 - p) E [y (t; λ_{2})] . \end{matrix}$ (21)

The sensitivity matrix is given by(22) $\begin{matrix} F [λ_{1}, λ_{2}, p] = [\begin{matrix} \frac{\partial E [t_{1}; λ_{1}, λ_{2}, p]}{\partial p} & \frac{\partial E [t_{1}; λ_{1}, λ_{2}, p]}{\partial λ_{1}} & \frac{\partial E [t_{1}; λ_{1}, λ_{2}, p]}{\partial λ_{2}} \\ \dots & \dots & \dots \\ \frac{\partial E [t_{12}; λ_{1}, λ_{2}, p]}{\partial p} & \frac{\partial E [t_{12}; λ_{1}, λ_{2}, p]}{\partial λ_{1}} & \frac{\partial E [t_{12}; λ_{1}, λ_{2}, p]}{\partial λ_{2}} \end{matrix}] \end{matrix}$ (22)

Our matrix of weights is given by a proportional variance model, where we chose $δ = 0.4$ :(23) $\begin{matrix} W = diag (E {[t_{1}; λ_{1}, λ_{2}, p]}^{2 δ}, E {[t_{2}; λ_{1}, λ_{2}, p]}^{2 δ} \dots, E {[t_{12}; λ_{1}, λ_{2}, p]}^{2 δ}) . \end{matrix}$ (23)

The standard errors of our parameter estimates are given by the square root of the diagonal elements of the covariance matrix $Σ$ , where:(24) $\begin{matrix} Σ = σ^{2} {(F^{T} [λ_{1}, λ_{2}, p] W F [λ_{1}, λ_{2}, p])}^{- 1} . \end{matrix}$ (24) (If the precise value of $σ^{2}$ , the variance of the noise terms in the proportional error model, were known this would be used. Otherwise, as we do in this case, the value $σ^{2}$ is estimated from the data.)

Using the formulation above, we computed the standard error and $95 %$ confidence intervals for the WT and R15 parameters (see Tables and , respectively). We found greater confidence in our estimated parameters for the WT variant data than those for the R15 data. This is evident in the smaller standard error (SE) and normalized standard error (SE/estimate). A bimodal distribution in growth rates is strongly suggested for the WT variant data because the $95 %$ confidence intervals for $λ_{1}$ and $λ_{2}$ were disjoint. As we will describe in more detail, the most likely explanation for the two subpopulations in the yeast data is that mother and daughter cells have lower and higher rates of aggregate growth, respectively. Since roughly $40 %$ of the population at steady-state consists of mother cells, the estimate of $p = 0.45$ is especially compelling.

For the R15 data, the confidence in our model parameters was lower than that for WT. Of particular concern is the $95 %$ confidence interval for $λ_{1}$ not only contains the entire $95 %$ confidence interval for $λ_{2}$ but allows for negative growth rates. While based on our inverse problem results, there is support for bimodal growth rates with the R15 variant, our data does not allow us to conclude this with statistical certainty.

Table 1. WT variant – standard error and 95% confidence intervals.

Display Table

Table 2. R15 variant – standard error and 95% confidence intervals.

Display Table

5. Discussion and concluding remarks

Our mathematical model and inverse problem formulation for prion amplification in a yeast colony indicate the possible presence of bimodality in the prion aggregate growth rate distribution. We believe this bimodality is consistent with the underlying asymmetry in the budding yeast reproductive process. When yeast cells divide, the resulting cells are not identical. A daughter grows as a bud on a mature mother cell. When they separate, the daughter cell is smaller and does not inherit the replicative age of the mother. Since previous work has demonstrated a difference in prion aggregate composition between yeast cells in their first generation (daughters) and mature yeast cells (mothers) [Citation15], it is attractive to speculate that these are the same two populations we have identified here. We note that, according to the nucleated polymerization model [Citation11,Citation17], the amount of soluble (healthy) protein directly impacts the aggregate growth rate, so it is possible that differences in the amount of soluble Sup35 in mother and daughter cells could explain this difference.

We note that while our model provides statistical support for bimodality in the aggregate growth rates for the WT variant but not the R15 variant, it is possible that further experimental data might change this conclusion. In addition, we note that there were several simplifying assumptions we made in the analysis of our model that could be revisited. First, our model of cell division considered only exponential cell growth, a more realistic model of yeast cell growth would employ time varying rates of division. Second, our inverse problem formulation considered only heterogeneity in the aggregate growth rates, but we could consider simultaneously estimating growth rates and the aggregate carrying capacity. In future studies we intend to investigate these, and other, extensions.

Acknowledgements

The authors gratefully acknowledge several reviewers whose comments and questions substantially improved the presentation in this manuscript.

Correction Statement

This article has been republished with minor changes. These changes do not impact the academic content of the article.

Additional information

Funding

This research was supported in part by the Air Force Office of Scientific Research [grant number AFOSR FA9550-15-1-0298]; in part by the National Science Foundation under NSF [grant number DMS-0946431], [grant number DMS-1514929], [grant number INSPIRE BCS-13-44279]; in part by the National Institutes of Health [grant number NIGMS R35 GM118042].

Notes

No potential conflict of interest was reported by the authors.

Related Research Data

Cell division is essential for elimination of the yeast [PSI+] prion by guanidine hydrochloride

Source: Proceedings of the National Academy of Sciences

Estimation of growth rate distributions in size structured population models

Source: American Mathematical Society (AMS)

A study in nucleated polymerization models of protein aggregation

Source: Elsevier BV

Statistical methods for model comparison in parameter estimation problems for distributed systems

Source: Springer Science and Business Media LLC

Kinetic models of guanidine hydrochloride-induced curing of the yeast [PSI+] prion

Source: Elsevier BV

A mathematical analysis of the dynamics of prion proliferation

Source: Elsevier BV

Mammalian Prion Biology: One Century of Evolving Concepts

Source: Elsevier BV

The physical basis of how prion conformations determine strain phenotypes

Source: Springer Science and Business Media LLC

On analytical and numerical approaches to division and label structured population models

Source: Elsevier BV

Modeling and estimating uncertainty in parameter estimation

Source: IOP Publishing

Estimating the number of prions in yeast cells

Source: Oxford University Press (OUP)

Estimation of Cell Proliferation Dynamics Using CFSE Data

Source: Springer Science and Business Media LLC

Analysis of a model for the dynamics of prions II

Source: Elsevier BV

The amyloid state and its association with protein misfolding diseases

Source: Springer Science and Business Media LLC

Prion dynamics with size dependency-strain phenomena

Source: Informa UK Limited

Prion dynamics and the quest for the genetic determinant in protein-only inheritance.

Source: Elsevier BV

Spatial quality control bypasses cell-based limitations on proteostasis to promote prion curing

Source: eLife Sciences Publications, Ltd

Estimation of probability distributions for individual parameters using aggregate population observations

Source: Birkhäuser Boston

A Functional Analysis Framework for Modeling, Estimation and Control in Science and Engineering

Source: Chapman and Hall/CRC

A comparison of approximation methods for the estimation of probability distributions on parameters

Source: Elsevier BV

Modeling and Inverse Problems in the Presence of Uncertainty

Source: Chapman and Hall/CRC

A mathematical model of the dynamics of prion aggregates with chaperone-mediated fragmentation

Source: Springer Science and Business Media LLC

A Crump-Mode-Jagers branching process model of prion loss in yeast

Source: Cambridge University Press (CUP)

Estimation Techniques for Distributed Parameter Systems

Source: Birkhäuser Boston

A Size Threshold Limits Prion Transmission and Establishes Phenotypic Diversity

Source: American Association for the Advancement of Science (AAAS)

Estimation Techniques for Distributed Parameter Systems

Source: Birkhäuser Boston

Quantifying the kinetic parameters of prion replication.

Source: Elsevier BV

Prion Diseases of Humans and Animals: Their Causes and Molecular Basis

Source: Annual Reviews

Linking provided by

References

Knowles TPJ , Vendruscolo M , Dobson CM . The amyloid state and its association with protein misfolding diseases. Nat Rev Mol Cell Biol. 2014;15(6):384–396.
PubMed Web of Science ®Google Scholar
Aguzzi A , Polymenidou M . Mammalian prion biology: one century of evolving concepts. Cell. 2004;116(2):313–327.
PubMed Web of Science ®Google Scholar
Collinge J . Prion diseases of humans and animals: their causes and molecular basis. Ann Rev Neurosci. 2001;24(1):519–550.
PubMedGoogle Scholar
Sindi SS , Serio TR . Prion dynamics and the quest for the genetic determinant in protein-only inheritance. Curr Opin Microbiol. 2009;12(6):623–630.
PubMed Web of Science ®Google Scholar
Tuite MF , Serio TR . The prion hypothesis: from biological anomaly to basic regulatory mechanism. Nat Rev Mol Cell Biol. 2010;11(12):823–833.
PubMed Web of Science ®Google Scholar
Palmer KJ , Ridout MS , Morgan BJT . Kinetic models of guanidine hydrochloride-induced curing of the yeast [psi+] prion. J Theor Biol. 2011;274(1):1–11.
PubMed Web of Science ®Google Scholar
Olofsson P , Sindi SS . A crump-mode-jagers branching process model of prion loss in yeast. J Appl Probab. 2014;51(2):453–465.
Web of Science ®Google Scholar
Ridout M , Giagos V , Morgan B , et al. Modelling prion dynamics in yeast. International Statistical Institute Proceedings of the 58th World Statistical Congress, Dublin (Session IPS020). 2011.
Google Scholar
Pöschel T , Brilliantov NV , Frömmel C . Kinetics of prion growth. Biophys J. 2003;85(6):3460–3474.
PubMed Web of Science ®Google Scholar
Greer ML , Pujo-Menjouet L , Webb GF . A mathematical analysis of the dynamics of prion proliferation. J Theor Biol. 2006;242(3):598–606.
PubMed Web of Science ®Google Scholar
Davis JK , Sindi SS . A mathematical model of the dynamics of prion aggregates with chaperone-mediated fragmentation. J Math Biol. 2016;72(6):1555–1578.
PubMed Web of Science ®Google Scholar
Prüss J , Pujo-Menjouet L , Webb G , et al . Analysis of a model for the dynamics of prions. Discrete Contin Dyn Syst Ser B. 2006;6(1):225–235.
Web of Science ®Google Scholar
Engler H , Prüss J , Webb GF . Analysis of a model for the dynamics of prions ii. J Math Anal Appl. 2006;324(1):98–117.
Web of Science ®Google Scholar
Tanaka M , Collins SR , Toyama BH , et al . The physical basis of how prion conformations determine strain phenotypes. Nature. 2006;442(7102):585–589.
PubMed Web of Science ®Google Scholar
Derdowski A , Sindi SS , Klaips CL , et al . A size threshold limits prion transmission and establishes phenotypic diversity. Science. 2010;330(6004):680–683.
PubMed Web of Science ®Google Scholar
Klaips CL , Hochstrasser ML , Langlois CR , et al. Spatial quality control bypasses cell-based limitations on proteostasis to promote prion curing. eLife. 2014;3:e04288
Web of Science ®Google Scholar
Masel J , Jansen VAA , Nowak MA . Quantifying the kinetic parameters of prion replication. Biophys Chem. 1999;77(2):139–152.
PubMed Web of Science ®Google Scholar
Davis JK , Sindi SS . A study in nucleated polymerization models of protein aggregation. Appl Math Lett. 2015;40:97–101.
PubMed Web of Science ®Google Scholar
Calvez V , Lenuzza N , Doumic M , et al . Prion dynamics with size dependency-strain phenomena. J Biol Dyn. 2010;4(1):28–42.
PubMedGoogle Scholar
Banks HT , Flores KB , Sindi SS . On analytical and numerical approaches to division and label structured population models. Appl Math Lett. 2016;60:81–88.
Web of Science ®Google Scholar
Banks HT , Sutton KL , Thompson WC , et al . Estimation of cell proliferation dynamics using cfse data. Bull Math Biol. 2011;73(1):116–150.
PubMed Web of Science ®Google Scholar
Thompson WC . Partial differential equation modeling of flow cytometry data from CFSE-based proliferation assays. Raleigh (NC): North Carolina State University; 2011.
Google Scholar
Hasenauer J , Schittler D , Allgöwer F . Analysis and simulation of division-and label-structured population models. Bull Math Biol. 2012;74(11):2692–2732.
PubMed Web of Science ®Google Scholar
Cole DJ , Morgan BJT , Ridout MS , et al . Estimating the number of prions in yeast cells. Math Med Biol. 2004;21(4):369–395.
PubMed Web of Science ®Google Scholar
Byrne LJ , Cole DJ , Cox BS , et al . The number and transmission of [psi+] prion seeds (propagons) in the yeast saccharomyces cerevisiae. PLoS One. 2009;4(3):e4670–e4670.
PubMed Web of Science ®Google Scholar
Byrne LJ , Cox BS , Cole DJ , et al . Cell division is essential for elimination of the yeast [psi+] prion by guanidine hydrochloride. Proc Nat Acad Sci. 2007;104(28):11688–11693.
PubMed Web of Science ®Google Scholar
Langlois CR , Pei F , Sindi SS , et al . Distinct prion domain sequences ensure efficient amyloid propagation by promoting chaperone binding or processing In vivo. In Review.
Google Scholar
Cipollina C , Vai M , Porro D , et al . Towards understanding of the complex structure of growing yeast populations. J Biotechnol. 2007;128(2):393–402.
PubMed Web of Science ®Google Scholar
Gyllenberg M . The size and scar distributions of the yeast saccharomyces cerevisiae. J Math Biol. 1986;24(1):81–101.
Web of Science ®Google Scholar
Banks HT , Fitzpatrick BG . Estimation of growth rate distributions in size structured population models. Q Appl Math. 1991;49:215–235.
Web of Science ®Google Scholar
Banks HT , Fitzpatrick BG , Potter Y , et al . Estimation of probability distributions for individual parameters using aggregate population data. Stochastic analysis, control, optimization and applications. Springer; 1999. p. 353–371.
Google Scholar
Billingsley P . Convergence of probability measures. Wiley; 2013.
Google Scholar
Banks HT . A functional analysis framework for modeling, estimation and control in science and engineering. CRC Press; 2012.
Google Scholar
Banks HT , Hu S , Thompson WC . Modeling and inverse problems in the presence of uncertainty. CRC Press; 2014.
Google Scholar
Banks HT , Bihari KL . Modeling and estimating uncertainty in parameter estimation. Inverse Prob. 2001;17:95–111.
Web of Science ®Google Scholar
Banks HT , Kunisch Karl . Estimation techniques for distributed parameter systems. Boston (MA): Birkhausen; 1989.
Google Scholar
Banks HT , Davis JL . A comparison of approximation methods for the estimation of probability distributions on parameters. Appl Numer Math. 2007;57(5):753–777.
Web of Science ®Google Scholar
Banks HT , Fitzpatrick BG . Statistical methods for model comparison in parameter estimation problems for distributed systems. J Math Biol. 1990;28(5):501–527.
Web of Science ®Google Scholar
Banks HT , Tran HT . Mathematical and experimental modeling of physical and biological processes. CRC Press; 2009.
Google Scholar
Martins JRRA , Kroo IM , Alonso JJ . An automated method for sensitivity analysis using complex variables. AIAA paper 2000–0689, 38th Aerospace Sciences Meeting. Reno (NC); 2000. p. 2000–0689.
Google Scholar

Appendix 1

Fitting proportional error model

Figure A1. Modified Residuals vs. Time for WT data.

Figure A2. Modified Residuals vs. Time for R15 data.

Since our experimental data had multiple observations at each time point, we considered fitting a proportional error model. In Figures and we determined modified residuals by subtractingthe mean of the observed values and dividing by the mean to the appropriate power. That is, if we compute

{\bar{y}}_{i}

is the average of all observed counts at the same time, then the modified residuals are:

(A1)

\begin{matrix} M (y_{i}) = \frac{y_{i} - {\bar{y}}_{i}}{{\bar{y}}_{i}^{δ}} . \end{matrix}

(A1)

In a proportional error model, we would expect the residuals to be uniformly dispersed at $δ = 0$ . However, we see that the data are more consistent with $δ > 0$ as dispersal increases with nonzero $δ$ . In the analysis of our data we chose the value $δ = 0.4$ but the results were consistent for $δ$ values between 0.4 and 0.6.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Estimating the rate of prion aggregate amplification in yeast with a generation and structured population model

Abstract

1. Introduction

2. Model formulation

2.1. Generation and structured population model

2.2. Population decomposition and analytic solution

3. Inverse problem formulation and analysis

3.1. Experimentally quantifying aggregates