2,923
Views
1
CrossRef citations to date
0
Altmetric
Research Paper

Technical versus biological variability in a synthetic human gut community

, , , , & ORCID Icon
Article: 2155019 | Received 06 Jul 2022, Accepted 30 Nov 2022, Published online: 29 Dec 2022

ABSTRACT

Synthetic communities grown in well-controlled conditions are an important tool to decipher the mechanisms driving community dynamics. However, replicate time series of synthetic human gut communities in chemostats are rare, and it is thus still an open question to what extent stochasticity impacts gut community dynamics. Here, we address this question with a synthetic human gut bacterial community using an automated fermentation system that allows for a larger number of biological replicates. We collected six biological replicates for a community initially consisting of five common gut bacterial species that fill different metabolic niches. After an initial 12 hours in batch mode, we switched to chemostat mode and observed the community to stabilize after 2–3 days. Community profiling with 16S rRNA resulted in high variability across replicate vessels and high technical variability, while the variability across replicates was significantly lower for flow cytometric data. Both techniques agree on the decrease in the abundance of Bacteroides thetaiotaomicron, accompanied by an initial increase in Blautia hydrogenotrophica. These changes occurred together with reproducible metabolic shifts, namely a fast depletion of glucose and trehalose concentration in batch followed by a decrease in formic acid and pyruvic acid concentrations within the first 12 hours after the switch to chemostat mode. In conclusion, the observed variability in the synthetic bacterial human gut community, as assessed with 16S rRNA gene sequencing, is largely due to technical variability. The low variability seen in HPLC and flow cytometry data suggests a highly deterministic system.

Introduction

The human gut microbiome is a complex and dynamic ecosystem, containing numerous microbes that perform different functions. They produce essential nutrients such as vitamins and short-chain fatty acids (SCFA).Citation1,Citation2 In addition, the human gut microbiome plays an important role in the regulation of the host immune system, which is essential for health.Citation3

Longitudinal cohort studies show that the composition of the human gut microbiome as assessed with fecal samples does not change substantially when unperturbed,Citation4,Citation5 but that day-to-day variation is nevertheless visible. This poses the question whether short-term variation is due to small changes in the environment (e.g. diet or circadian rhythms of the host),Citation6,Citation7 the intrinsic stochasticity of the system (e.g. variable growth and death rates), or technical variability.

Here, our goal is to investigate the reproducibility of the dynamics of a defined human gut bacterial community to quantify the variability in gut microbial abundances in well-controlled conditions. To do this, we need to work with a single source of inoculum and control all external sources that influence microbial dynamics. The latter is only possible in a bioreactor where atmosphere, temperature, and pH are kept constant, and the process can be operated in chemostat conditions in which a steady state can be reached. This requires a continuous culture to regularly add fresh nutrients and remove waste products. In such well-controlled conditions, the remaining variation is either due to community dynamics (i.e. stochasticity, but also nearby alternative stable states or chaosCitation8), short-term evolution (not explored here), or technical variability of measurements. Although variability of microbial gut community dynamics can also be assessed with fecal slurries,Citation9,Citation10 working with a defined community makes it easier to understand the mechanisms driving the dynamics as it is less complex than fecal slurries, lacking archaea, fungi, and viruses.

Bioreactor systems range in complexity from coupled vessels mimicking the human intestinal system to well plates,Citation11,Citation12,Citation33 representing a trade-off in realism and control versus the number of replicates that can be achieved. Although well plates combined with serial transfer are claimed to emulate chemostat conditions, they instead represent a sequential batch process (i.e., a series of (short) batch experiments). Traditional chemostats in bench-top bioreactors adopt a continuous inflow of fresh medium and outflow of broth keeping the volume constant. A series of such chemostats (with different specific process parameters) have been coupled to mimic the intestinal tract.Citation11 Since such bench-top chemostat bioreactors are cost-, time- and labor-intensive, replicate time series of defined human gut microbial communities in such chemostats are rare. The few that have been collected show that gut communities reach a stable state within a few days and that the dynamics are reproducible, but that there is some variability across biological replicates.Citation14–16 However, the number of replicates in these experiments is small and cell densities are usually not measured, such that conclusions rely on relative instead of absolute abundances.

Here, we monitored microbial community composition in six parallel bioreactors in chemostat mode to investigate the variability of the dynamics of a defined human gut community and obtained total cell counts via flow cytometry. The community was composed of gut bacterial species with different metabolic roles, including the butyrate producer Roseburia intestinalis DSM 14610 (RI),Citation17 the acetogen Blautia hydrogenotrophica DSM 10507 (BH),Citation18,Citation19 the lactate producer Collinsella aerofaciens RCC 1377 (CA),Citation20 and the succinate producers Bacteroides thetaiotaomicron DSM 2079 (BT)Citation21 and Prevotella copri DSM18205 (PC).Citation22 The two latter species are also representatives of two different enterotypes.Citation23,Citation24 We calculated the total number of cells in each sample using flow cytometry and systematically assessed the technical variability of 16S rRNA gene sequencing. As an alternative to the latter, we also applied supervised classification to flow cytometry data to classify events by species. This method has already been validated in the context of gut bacteria, where a synthetic community comprised four bacteria (RI, BH, BT, and FP) was evaluated.Citation25 In both cases, we observed a reproducible change in composition before the community stabilized.

Results

Transient community dynamics is characterized by reproducible metabolic changes

Monocultures and communities of five gut bacterial species were inoculated in Wilkins-Chalgren medium, and community composition was followed for 67 hours in six biological replicates. All microbioreactors were inoculated from the same pool of cells for each experiment, respectively, with the same number of cells/ml for each species as determined by flow cytometry. We started the chemostat mode after 12 hours of batch incubation, to allow the slower-growing species to reach sufficiently high densities to avoid wash-out (e.g. BH and PC, see Supplementary Figure 1). Since the system has no automatic outflow of medium, we set up a pipetting scheme where excess liquid was pipetted out every 2 hours during the day and every 4 hours during the night, thus creating an emulated chemostat. An overview of the experimental setup is given in .

Figure 1. Experimental setup. a) Twelve microbioreactor vessels are inoculated with five monocultures (Roseburia intestinalis, Blautia hydrogenotrophica, Bacteroides thetaiotaomicron, Collinsella aerofaciens and Prevotella copri) and six replicates for the synthetic community. However, Prevotella was not consistently detected in the communities after the batch phase. b) After an initial twelve hours in batch where the pH was kept constant at pH 6.5, the system was switched to chemostat mode. Samples were taken at specific timepoints and subsequently processed for cell count, 16S rRNA gene sequencing and HPLC.

Figure 1. Experimental setup. a) Twelve microbioreactor vessels are inoculated with five monocultures (Roseburia intestinalis, Blautia hydrogenotrophica, Bacteroides thetaiotaomicron, Collinsella aerofaciens and Prevotella copri) and six replicates for the synthetic community. However, Prevotella was not consistently detected in the communities after the batch phase. b) After an initial twelve hours in batch where the pH was kept constant at pH 6.5, the system was switched to chemostat mode. Samples were taken at specific timepoints and subsequently processed for cell count, 16S rRNA gene sequencing and HPLC.

We combined 16S rRNA gene sequencing with flow cytometry to measure absolute species abundances at specific timepoints ( a and b). PC declined quickly and was on the border of the detection limit, but the abundances of the other four species stabilized at the end of the experiment (i.e., 2–3 residence times). We also performed a validation experiment with a modified community composition (Faecalibacterium prausnitzii instead of CA), which also stabilized (Supplementary Figure 2). The abundance of BT decreased over time, while BH increased initially. The dominance of BT in the first 12 hours is expected since it grows faster than the other strains in the chosen medium (Supplementary Figure 1).

Figure 2. Microbial abundances assessed with 16S rRNA gene sequencing (corrected for copy number) and metabolite concentrations over time for the synthetic gut community. a) The mean and standard deviation of species abundance across six biological replicates is shown in logarithmic scale. For timepoint ‘0’, flow cytometry data of the monocultures for the inoculum was used. b) The same as a, without the logarithmic scale. a,) Only species consistently present across technical replicates and time points are shown. See Supplementary Figure 3 for time series obtained with the second and third technical replicate. c) Mean concentrations for metabolites with decreasing concentrations across the six replicates. d) Mean concentrations for metabolites with increasing concentrations across the six replicates. For the metabolite data, timepoint 0 represents the blank medium. For all graphs, error bars represent standard deviation across the biological replicates.

Figure 2. Microbial abundances assessed with 16S rRNA gene sequencing (corrected for copy number) and metabolite concentrations over time for the synthetic gut community. a) The mean and standard deviation of species abundance across six biological replicates is shown in logarithmic scale. For timepoint ‘0’, flow cytometry data of the monocultures for the inoculum was used. b) The same as a, without the logarithmic scale. a,) Only species consistently present across technical replicates and time points are shown. See Supplementary Figure 3 for time series obtained with the second and third technical replicate. c) Mean concentrations for metabolites with decreasing concentrations across the six replicates. d) Mean concentrations for metabolites with increasing concentrations across the six replicates. For the metabolite data, timepoint 0 represents the blank medium. For all graphs, error bars represent standard deviation across the biological replicates.

Next, we looked at the metabolite concentrations ( c and d). Over time, acetic, butyric, propionic, and iso-valeric acid concentrations increased, while trehalose, glucose, pyruvic, succinic, lactic, and formic acid decreased. The transient dynamics is characterized by two phases. First, we saw a net increase in formic, succinic, and lactic acid, a depletion in glucose and pyruvic acid, and a decrease in trehalose in the first 12–25 hours. Subsequently, succinic, lactic, and formic acid decreased before metabolites stabilized. This decrease could be either due to consumption or to reduced production. This dynamic is reproduced in the six biological replicates for which metabolites were measured. Metabolite measurements of monocultures after 12 hours of incubation show that BH did not consume glucose, while BT and to a lesser extent RI and CA consumed it (, Supplementary Figure 4) suggesting that BT was the main driver of its depletion at 12 hours in the community. In addition, no organism consumed lactate in the monoculture during the batch phase, but it decreased in the transient phase of the chemostat. Succinic acid was first produced and decreased after 25 hours. It has no consumer according to the monocultures in batch but is significantly correlated with Bacteroides, its only producer in monoculture (Supplementary Figure 5). Butyric acid, acetic acid, and iso-valeric acid, which are produced in the monocultures, increase in the community until reaching steady state.

Figure 3. a) Schematic representation of produced (green) and consumed (red) metabolites at T = 12 hours after inoculation in batch fermentation in monoculture. The thickness of the arrow indicates the relative amount of production/consumption compared to the blank (see b). b) HPLC results for monocultures after 12 h in batch mode. Prevotella copri was excluded due to contamination (see Supplementary Figure 7). The arrow thickness was scaled by taking ½ of the square root.

Figure 3. a) Schematic representation of produced (green) and consumed (red) metabolites at T = 12 hours after inoculation in batch fermentation in monoculture. The thickness of the arrow indicates the relative amount of production/consumption compared to the blank (see b). b) HPLC results for monocultures after 12 h in batch mode. Prevotella copri was excluded due to contamination (see Supplementary Figure 7). The arrow thickness was scaled by taking ½ of the square root.

Technical variability of 16S rRNA gene sequencing exceeds biological variability

To explore the sources of variability in the data, we assessed the technical variability of 16S rRNA gene sequencing. For this, we extracted, amplified, and sequenced DNA from all biological replicates for each time point three times. The two replicates that were processed together were more similar to each other than either was to the third replicate (Supplementary Figure 6). The three technical replicates reproduced the main trend (decrease of BT and increase of BH, Supplementary Figure 3). However, the technical variability of 16S rRNA gene sequencing is larger than the variability between the replicate vessels (, Supplementary Figures 8, 9). We calculated the coefficient of variation (CV) for the results of HPLC, flow cytometry (CellScanner, see below), and 16S rRNA gene sequencing () and found that the CV of 16S rRNA relative abundances are significantly higher than for those obtained with CellScanner (, paired Wilcoxon signed rank test p-value < 0.001).

Figure 4. PCoA on relative abundances from 16S rRNA gene sequencing (corrected for copy number) and flow cytometry data processed with CellScanner. Metabolites significantly correlated to principal components were added using envfit. a) First technical replicate of 16S rRNA gene sequencing for six biological replicates and six timepoints. b) 75% confidence areas calculated on three technical replicates of 16S rRNA gene sequencing for six biological replicates and six time points. Dot size encodes time. c) CellScanner results for six biological replicates and six timepoints. a-c) Color encodes biological replicate. d) Distributions of CV (coefficient of variation) values of relative abundances per species and time point across vessels. The CV values of the first technical replicate are significantly higher than for CellScanner (Wilcoxon signed-rank test).

Figure 4. PCoA on relative abundances from 16S rRNA gene sequencing (corrected for copy number) and flow cytometry data processed with CellScanner. Metabolites significantly correlated to principal components were added using envfit. a) First technical replicate of 16S rRNA gene sequencing for six biological replicates and six timepoints. b) 75% confidence areas calculated on three technical replicates of 16S rRNA gene sequencing for six biological replicates and six time points. Dot size encodes time. c) CellScanner results for six biological replicates and six timepoints. a-c) Color encodes biological replicate. d) Distributions of CV (coefficient of variation) values of relative abundances per species and time point across vessels. The CV values of the first technical replicate are significantly higher than for CellScanner (Wilcoxon signed-rank test).

Table 1. Coefficient of variation across vessels for metabolite data

Table 2. Coefficient of variation across vessels for absolute and relative strain abundances obtained with CellScanner

Table 3. Coefficient of variation across vessels for the first technical replicate of 16S rRNA gene sequencing. Pseudomonas is a contaminant detected in a sub-set of the samples in low amounts.

Table 4. Coefficient of variation across vessels and strains.

Abundances obtained through flow cytometry data analysis reproduce the main trend

Given the high technical variability of 16S rRNA gene sequencing, we used an alternative method to detect and quantify bacterial species over time. CellScanner trains classifiers (random forests) on monoculture flow cytometry data, which then determines the species of an event in a community flow cytometry file. CellScanner also performs automated gating on monocultures and blanks using the same principle. To assess how well different supervised classification methods in CellScanner can distinguish the species, we evaluated them on communities assembled in silico from mono-culture data (Supplementary Figure 10A and 10B). CellScanner agrees with 16S rRNA gene sequencing on the same trend: the abundance of BT decreased and that of BH increased, while RI and CA remain relatively constant (). More precisely, for BH and BT, CellScanner and sequencing abundances are significantly correlated. For CA, they are weakly correlated, and for RI, the techniques disagree (Supplementary Table S1). In agreement with the lower CV values seen for CellScanner, PCoA plots show that the trajectories of individual vessels are closer to each other when assessed with flow cytometry data as opposed to sequencing data ( a and c).

Figure 5. Comparison of 16S rRNA gene sequencing results (corrected for copy number) and CellScanner. a) Mean relative abundance of gut bacterial species by 16S rRNA gene sequencing across six biological replicates for the first technical replicate. b) Mean relative abundance of gut bacterial species by CellScanner across six biological replicates (see Supplementary Figure 10C for the CellScanner proportions without unknowns). Events on which the classifiers did not agree are classified as unknown. Error bars represent standard deviation.

Figure 5. Comparison of 16S rRNA gene sequencing results (corrected for copy number) and CellScanner. a) Mean relative abundance of gut bacterial species by 16S rRNA gene sequencing across six biological replicates for the first technical replicate. b) Mean relative abundance of gut bacterial species by CellScanner across six biological replicates (see Supplementary Figure 10C for the CellScanner proportions without unknowns). Events on which the classifiers did not agree are classified as unknown. Error bars represent standard deviation.

Dynamics of total cell counts and heterogeneity

There was a reproducible decline in total cell counts between the end of the batch phase at 12 h and 25 h (, Wilcoxon rank-sum test p-value 0.0022). In the validation experiment, where the initial batch phase was reduced to 4 hours, we see a decline in cell count after 36 hours. In both experiments, the decline in total cell count is accompanied by a decrease in the abundance of BT ( a and b, Supplementary Figure 2A-B), the most abundant species at both time points. Taken together, this suggests that the decline in the BT population is responsible for the strong initial decline in cell numbers. Next, we quantified population heterogeneity as the mean of the range of flow cytometry channel values (e.g. the values of each of the 23 flow cytometry parameters, see Material and methods) and observed a large change within the first 24 hours, perhaps due to dead cells that are washed out in chemostat mode. Intriguingly, the heterogeneity then increases again over time (), though the difference between 25 and 67 hours is not significant (Wilcoxon rank-sum test p-value 0.09).

Figure 6. a) The cell density decreases significantly over time for the first data set (exp1, Pearson’s r: −0.76). The error bar shows the standard deviation across six biological replicates for the first experiment and three biological replicates for the validation experiment (exp2). Two different gating techniques were applied to distinguish cells from debris in flow cytometry data, namely manual gating and gating through supervised classification as implemented in CellScanner. The full graph of the validation experiment is shown in Supplementary Figure 11. b) Heterogeneity of flow cytometry data shown across the six biological replicates for the first data set.

Figure 6. a) The cell density decreases significantly over time for the first data set (exp1, Pearson’s r: −0.76). The error bar shows the standard deviation across six biological replicates for the first experiment and three biological replicates for the validation experiment (exp2). Two different gating techniques were applied to distinguish cells from debris in flow cytometry data, namely manual gating and gating through supervised classification as implemented in CellScanner. The full graph of the validation experiment is shown in Supplementary Figure 11. b) Heterogeneity of flow cytometry data shown across the six biological replicates for the first data set.

Discussion

Here, we investigated the dynamics of a defined community of human gut bacteria in continuous culture in a comparatively large number of replicates, while maintaining a high degree of environmental control. To our knowledge, this is the first time that the dynamics of a synthetic human gut bacterial community is investigated in chemostat mode using an automated multi-bioreactor system.

A limitation of the current study is the frequent contamination in monocultures despite their location inside a HEPA-filtered anaerobic chamber. We adopted a stricter cleaning routine for the validation experiment, which reduced contamination in the community below the detection level. Another limitation is that the growth stage of the inoculated strains (18 h pre-cultures) may affect the dynamics of the community, which was not further explored here.

We found that species abundances and metabolite concentrations in the community were reproducible across replicates, in agreement with previous results.Citation14,Citation16 However, we also observed that the technical variability of 16S rRNA gene sequencing exceeded the variability across vessels. Since we repeated the entire sequencing protocol three times, we do not know whether variation comes from DNA extraction, PCR, or the sequencing runs. We positioned blanks randomly and differently across replicate well plates, to avoid bias due to well-to-well contamination. Previous studies using mock communities have shown that most of the technical variation is due to extraction and amplification and not from the sequencing step itself.Citation26,Citation27

The high technical variability of 16S rRNA gene sequencing limits how much biological variability we can see. Therefore, we also applied an alternative method for the analysis of community composition based on flow cytometry data, implemented in CellScanner. Here, we found that trajectories across biological replicates were more similar to each other than they were for 16S rRNA sequencing data. Together with the low variation of metabolite data, this suggests that the technical variability of 16S rRNA sequencing inflates the variability observed with sequencing data and that the true biological variability is lower. However, CellScanner results are also potentially biased, since only the first monoculture time point was usable for BT and CA because of subsequent contamination. Changes in cell structure due to physiological changes from batch to continuous mode or species interactions could not be considered when training the classifiers. Thus, neither 16S rRNA gene sequencing nor CellScanner may have captured the exact composition. Alternative techniques such as whole-genome shotgun and full-length 16S rRNA gene sequencing are worth mentioning in this context, but they are work-intensive and it is questionable whether they reduce the technical variability compared to 16S rRNA gene sequencing. Quantitative PCR with species-specific primers is another alternative that delivers absolute abundances, which were observed previously to agree in sum with flow cytometry counts.Citation33 The advantage of supervised classification of flow cytometry data is that it gives absolute abundances without DNA extraction. However, like species-resolved quantitative PCR, it does not scale to complex communities.Citation25 Obtaining reliable species-specific cell counts in a high-throughput manner remains a major challenge when exploring community dynamics.

Both CellScanner and 16S rRNA gene sequencing agree on the main trend, namely a decrease in BT and an increase in other species, notably BH. BT grows faster than BH in batch. In chemostat, BT cannot sustain its high growth rate given the glucose feed and its abundance therefore declines. Other species are less dependent on glucose and exploit alternative carbon sources such as pyruvate. This is especially true for BH, which did not consume glucose in the first 12 h but lowered the pyruvate and trehalose concentrations instead. Interestingly, in an unintended invasion experiment of BH versus BT (initially, a BT monoculture), BH dominated at 67 h (Supplementary Figure 7A). However, BT won over BH in a two-species co-culture where a chemostat was emulated through serial transfer.Citation13 This may be due to the fact that the medium in the serial-transfer experiment contained starch in addition to sugar, which may have given BT, a known carbohydrate specialist,Citation28,Citation29 an edge. In our experiments, BT displays the behavior of a pioneer species that quickly exploits an easily accessible carbon source but is then replaced, to an extent, by slower growing but metabolically more versatile species. This also fits observations in cohort studies, which show that the Bacteroides 2 enterotype dominates infant microbiota after weaningCitation30 but is less frequent in adults. PC disappeared quickly from the community; after 25 hours the abundance was already bordering the detection limit of 16S rRNA gene sequencing. We do not know whether it was surviving in the community with a cell density below the detection limit, whether it was washed out, or whether it was outcompeted by other species in the community. A serial transfer experiment in a similar medium previously showed that BT outcompetes PC.Citation13

Transient dynamics in which a species loses its dominant position was also observed in two other continuous cultures with defined human gut communities. In both cases, the community was first dominated by Escherichia coli, which then declined in favor of Bacteroides thetaiotaomicron.Citation14,Citation15 The observations described here are to our knowledge the first instance of a defined human gut community where a Bacteroides species decreases while other gut species increase and as such may contribute to understanding the enterotype shifts observed in fecal time-series data.

The metabolite consumption and production patterns agree with previous observations and with predictions based on metabolic reconstructions from the AGORA database (Supplementary Table S2).Citation31 For instance, we saw a decrease in formate in the community. It has been shown before that formate produced in co-culture can be converted to acetate by BH using the Wood-Ljungdahl pathway.Citation32–34 It is a challenge to disentangle the contributions of different species to metabolite concentrations in a community. Although meta-omics can provide clues of pathway activation in a community context in different conditions,Citation35 it does not measure metabolite uptake and production rates. We saw that some species-metabolite correlations computed from co-culture data agree with monoculture data (e.g. BH consumes pyruvate, trehalose, and formate and produces acetate in monoculture and is negatively correlated with the three former and positively with the latter for CellScanner data) but there are also several correlations that do not reflect consumption/production (such as the negative correlation of BH with succinate, which is an indirect relationship due to the decrease of BT, the only succinate producer). Species may alter their metabolism when encountering low levels of their preferred nutrient and/or in the presence of interaction partners,Citation32 and thus determining who produces and consumes what in this community is a task for the future.

Our data suggest that the community reached a steady state. We also note that all replicates followed similar trajectories i.e. only one state was reached. It is possible that this community does not have alternative stable states (i.e. it is not multi-stable). Alternatively, the different stable states for the species combination could be too far apart for small variations to trigger a change in community state. A different community state might also be too rare to see using six replicates. Further research is necessary to test whether initial differences in species abundances or perturbations would alter the final state reached.

In conclusion, we have shown that the defined human gut community investigated here is a highly deterministic system where transient dynamics (succession) is reproducible, and the observed variation is due in large part to the technical variability of 16S rRNA gene sequencing data. This further encourages the development of mathematical models for gut microbial communities.

Material and methods

Strains and inoculum preparation

The following gut bacterial species were used in this study: Roseburia intestinalis (DSM 14610), Blautia hydrogenotrophica (DSM 10507), Bacteroides thetaiotaomicron (DSM 2079), Faecalibacterium prausnitzii (DSM 17677), Prevotella copri (DSM 18205), and Collinsella aerofaciens (Raes Culture Collection 1366).

All bacteria were precultured twice at 37°C without agitation under anaerobic conditions in a Don Whitley A135 or A35 Anaerobic Workstation with HEPA filter (10% H2, 10% CO2, 80% N2, 55% humidity). First, strains were precultured for 48 hours in a modified Gifu Anaerobic Medium broth (mGAM,Citation36 HyServe), except for F. prausnitzii DSM17677, which was grown in Reinforced Clostridial Medium broth (RCM, Oxoid). The strains were subsequently cultured for 18 hours in Wilkins-Chalgren anaerobe broth (WC, Oxoid) before inoculation in the Ambr bioreactors.

The microbioreactors were inoculated to a concentration of roughly 1*106 – 8*106 cells per ml. The number of cells for this inoculation was determined by flow cytometry, after which the cells were diluted in a WC medium and in the case of the community cultures mixed together in even ratios. 1 ml of each monoculture or community inoculum was used to inoculate the bioreactors to a total volume of 10 ml. Continuous feeding and sampling started 12 hours after inoculation to prevent washout of slower-growing strains. Emulated chemostat fermentations were performed using an Ambr® 15 Fermentation (Ambr 15 f) system (Sartorius Stedim Biotech, Royston, UK) located inside a Don Whitley A155 Anaerobic Workstation with a HEPA filter (10% H2, 10% CO2, 80% N2, 55% humidity). We used one culture station containing 12 single-use microbioreactors () with a working volume of 8–12 ml. The vessels were kept at 37°C (± 0.3°C) by individual vessel heaters, while each vessel was stirred at 300 rpm by a single Rushton-like impeller. Each vessel contains optical sensors for pH and dissolved oxygen (DO) monitoring (12 s per cycle).

The Ambr 15 f has a liquid handler for automated pipetting and two pumped liquid lines to each individual vessel for feed and base addition that deliver liquid in 5 μl shots. The feed (WC anaerobe broth) was pumped in continuously at 6.9 μl/min, resulting in a complete changeover of the medium in the vessels in 24 hours. Samples (250 μl) were pipetted into a cooled plate (4°C). To prevent the vessels from overflowing due to the continuous feed, and to create an emulated chemostat, excess liquid was pipetted out at regular intervals into a waste plate to maintain an approximate volume of 10 ml per bioreactor. During daytime, 840 μl was pipetted out every 2 hours. During the night, 1680 μl was pipetted out every 4 hours.

The pH was measured online using fluorescent sensor patches, and an additional base (0.5 mM NaOH) was automatically added with a target pH of 6.5 and a trigger pH of 6.3 if needed. The pH was measured offline using an analysis module, which automatically re-calibrates the online sensors if necessary. A minimum of 0.3 ml (blank) up to 1.6 ml (BT monoculture) of base was added over a period of 67 hours. Dissolved oxygen tension (DOT) was measured online using fluorescent sensor patches, but as we keep the Ambr 15 f in a controlled, oxygen-free environment, it was measured below 0.25% during all experiments. While this experimental design enabled parallel fermentation and automatization of sample collection, it did not allow quantifying the concentrations of gases.

One hundred microliters of the samples taken were used for cell count by flow cytometry. The rest was split in half in 96-well plates (75 μl each), and the cells were pelleted by centrifugation at 3220 RCF for 10 minutes. Both pellet- and supernatant plates were stored at −80°C for further use. Samples taken at 12, 25, 37, 49, 61 and 67 hours after inoculation were used for16S rRNA gene sequencing (pellets) and HPLC (supernatant).

Validation experiment

The inoculum for the validation experiment was based on OD600, where we inoculated RI, BH, and FP with a final OD of 0.004 and BT with a final OD of 0.002 in triplicate. The pH was monitored but not adjusted. We started the feed earlier at 4 hours after the inoculation. Excess medium was taken out every 3 hours after starting the feed, including samples every 6 hours. In addition, we ran this experiment for 258 hours.

Cell counting with flow cytometry

For the inoculation, monocultures were serially diluted 1000x in PBS to an approximate cell density of 106 cells per ml. The samples taken during the experiment were serially diluted 100x in PBS. All the samples were stained with 1 μl/ml SYBR green I (1:100 dilution in dimethyl sulfoxide; 20 min incubation at 37°C; 10.000 concentrate, Thermo Fisher Scientific) following the protocols previously described.Citation25,Citation32,Citation37–39

A CytoFLEX S flow cytometer (Beckman Coulter) was used in the experiments. This resulted in a multiparametric description of each sample consisting of 23 parameters (FSC-A, FSC-H, SSC-A, SSC-H, FL1-A, FL1-H, FL2-orange-A, FL2-orange-H, FL3-red-A, FL3-red-H, FL4-A, FL4-H, APC-A750-A, APC-A750H, VSSC-A, VSSC-H, KO525-A, KO252-H, mCherry-A, mCherry-H, PI-A, PI-H, and FSC-Width). These parameters were used as classifiers by CellScanner (see below). During the study period, the instrument was calibrated daily with CytoFLEX Daily QC Fluorospheres. A blank control containing only medium compared to SYBR positive communities was used to gate for total cell counts in the FL1-A channel, resulting in a threshold on FL1-A of ± 3,5*103.

All the events were quantified using a volumetric method (events/μl). The same cell counts were used for calculating the absolute abundances of the three 16S rRNA gene sequencing replicates. We measured the technical replicates of two dilutions (100x and 1000x) of different species for two different timepoints, ± 20 minutes apart (Supplementary Table S3).

Metabolite measurements

Frozen supernatant was thawed from −80°C (30 minutes, room temperature). Concentrations of trehalose, glucose, pyruvic acid, succinic acid, lactic acid, formic acid, acetic acid, propionic acid, iso-butyric acid, butyric acid, and iso-valeric acid in the supernatant were measured on an Agilent 1200 HPLC (Diegem, Belgium) system equipped with an Aminex HPX-87 H column (Bio-rad, Temse, Belgium). The column was kept at 40°C and eluted with 5 mM H2SO4 at a rate of 0.60 mL/min. Organic- and fatty acids were detected with UV (Agilent DAD, at 210 nm) and refractive index changes (Agilent RID, at 40°C). Trehalose and glucose were detected with RID only.Citation40

16S rRNA gene sequencing and data processing

Extraction of DNA was carried out following an adapted protocol described by Falony et al. using the MagAttract PowerMicrobiome DNA/RNA Kit KF (Qiagen) according to the manufacturer’s instructions with the addition of a heating step (10 minutes at 90°C) after bead beating to increase DNA yield and removal of β-mercaptoethanol.Citation41 The DNA was extracted and purified by the freedom evo TECAN extraction platform (Tecan, Switzerland) by means of several washing steps using ClearMag magnetic particles. Subsequently, a primer pair 515 F (5’ GTGYCAGCMGCCGCGGTAA 3’)/806 R (5’ GGACTACNVGGGTWTCTAAT 3’) was used to amplify the V4 region of the 16S rRNA gene by adding adapters and barcodes for sequencing. Sequencing was performed using the Illumina MiSeq platform to generate paired-end reads of 250 base pairs.

After demultiplexing with the bioinformatics tool sdm (simple demultiplexer) as part of the LotuS pipeline without allowing for mismatches, fastq sequences were preprocessed using DADA2 pipeline v1.14.1.Citation42,Citation43 The forward and reverse fastq files were submitted to ENA with the accession number PRJEB51873.

The taxonomy was assigned initially using the RDP classifier v2.13,Citation44 but for taxa that were not correctly identified (e.g. only identified up to UC_g or UC_f, see Supplementary Table S4) the sequence variants were aligned using the BLAST Sequence Analysis Tool with refseq_rna to ensure accurate assignment of the species.Citation45 Since our community is defined, we excluded species that were not community members assuming that they were contaminants introduced by the 16S rRNA gene sequencing pipeline, unless they were identified in more than one sample (Pseudomonas in the first experiment, see Supplementary Figure 3).

Taxon counts were corrected with 16S rRNA gene copy numbers of the exact strains retrieved from the rRNA operon copy number database rrnDB, and the National Center for Biotechnology Information (NCBI, Bethesda (MD): National Library of Medicine (US)) by looking at the whole-genome sequence summary (e.g. https://www.ncbi.nlm.nih.gov/nuccore/223987233). Sequencing data were converted to relative abundances and multiplied by the total cell count of the sample obtained by flow cytometry to obtain absolute abundance.Citation46

Three technical replicates of 16S rRNA gene sequencing were performed in this study following the pipeline described above. The first technical replicate was prepared and sequenced separately from the other technical replicates. The second and third technical replicates were later processed together on the same plate during extraction, PCR and 16S rRNA sequencing runs. Due to the different placement of the six blank wells in the extraction- and PCR plates, the samples did not have the same neighboring samples as their technical replicates. The samples were handled by the same persons for all three technical replicates to minimize protocol bias.

Event classification in mixed cultures with CellScanner

CellScanner is a standalone tool (manuscript submitted), which relies on supervised classification to train classifiers on flow cytometry data of monocultures to assign events in mixed cultures to species. The same principle is also used to automatically gate raw flow cytometry data. This gating strategy was used for all CellScanner data presented in this paper.

Machine gating was performed by training six random forest classifiers on flow cytometry data of monocultures and the blank vessel, respectively. These classifiers then recognize events as either cells or debris. Next, 10 random forest classifiers were trained and tested on 5000 random events for each gated monoculture dataset (4286 events for training and 714 events for testing). Next, each classifier predicts the species to which an event belongs in the community. When at least seven classifiers agree on a species, the event is classified as that species, else it is labeled as unknown. The monoculture of PC was already contaminated at the first measured timepoint, and since in addition PC was not consistently found in the 16S rRNA sequencing data, it was left out of the CellScanner analysis. Since we only had non-contaminated monocultures available at 67 h for RI and BH, we used monoculture data from 12 h for BT and CA and monoculture data from matching time points for RI and BH. In fact, by selecting monoculture datasets from different time points for the training, this grouping/combination resulted in the highest accuracy (assessed in silico) with fewer events assigned as unknown (Supplementary Figure 12).

Statistical analyses

The PCoA plots were computed with R package vegan 2.5.7 using Bray Curtis dissimilarity and vegan’s envfit function to include metabolite data. R packages ggplot2 (3.3.6) and ggpubr (0.4.0) were used to create violin plots.

Heterogeneity in flow cytometry data was computed as the sum of the mean range for each channel across all events after gating.

Data sharing statement

The 16S rRNA gene sequencing data have been submitted to ENA with accession number PRJEB51873 and will be made available upon acceptance of this work. The processed data and R code for the figures is available at: http://msysbiology.com/supplements.html#variability CellScanner is available at: http://msysbiology.com/cellscanner.html.

Supplemental material

Supplemental Material

Download Zip (2.6 MB)

Acknowledgments

We thank Daniel Rios Garza for helpful discussions, Anna Krzynowek for helping with the SILVA database, Raul Yhossef Tito Tadeo for processing raw 16S reads and Leen Rymenans for technical assistance during 16S library preparation. This project was supported by funding from the Research Foundation—Flanders (grant no. G0I0918N) and from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program under grant agreement no. 801747.

Disclosure statement

The authors report no conflict of interest.

Supplementary material

Supplemental data for this article can be accessed online at https://doi.org/10.1080/19490976.2022.2155019

Additional information

Funding

This work was supported by the H2020 European Research Council (ERC) [801747]; Research Foundation Flanders [G0I0918N].

References

  • Blumberg R, Powrie F. Microbiota, disease, and back to health: a metastable journey. Sci Transl Med. 2012;4(137):137rv7. doi:10.1126/scitranslmed.3004184.
  • Bäckhed F, Ley RE, Sonnenburg JL, Peterson DA, Gordont JI, Backhed F. Host-bacterial mutualism in the human intestine. Science. 2016;307(5717):1915–15. doi:10.1126/science.1104816.
  • Lloyd-Price J, Abu-Ali G, Huttenhower C. The healthy human microbiome. Genome Med. 2016;8(1). doi:10.1186/s13073-016-0307-y.
  • Faith JJ, Guruge JL, Charbonneau M, Subramanian S, Seedorf H, Goodman AL, Clemente JC, Knight R, Heath AC, Leibel RL, et al. The long-term stability of the human gut microbiota. Science. 2013;341(6141):1237439. doi:10.1126/science.1237439.
  • Caporaso JG, Lauber CL, Costello EK, Berg-Lyons D, Gonzalez A, Stombaugh J, Knights D, Gajer P, Ravel J, Fierer N, et al. Moving pictures of the human microbiome. Genome Biol. 2011;12(5):R50. doi:10.1186/gb-2011-12-5-r50.
  • Leeming ER, Johnson AJ, Spector TD, Roy CIL. Effect of diet on the gut microbiota: rethinking intervention duration. Nutrients. 2019;11(12):2862. doi:10.3390/NU11122862.
  • Frazier K, Chang EB. Intersection of the gut microbiome and circadian rhythms in metabolism. Trends Endocrinol Metab. 2020;31(1):25. doi:10.1016/J.TEM.2019.08.013.
  • Becks L, Hilker FM, Malchow H, Jürgens K, Arndt H. Experimental demonstration of chaos in a microbial food web. Nature. 2005;435(7046):1226–1229. doi:10.1038/nature03627.
  • Silverman JD, Durand HK, Bloom RJ, Mukherjee S, David LA. Dynamic linear models guide design and analysis of microbiota studies within artificial human guts. Microbiome. 2018;6(1):202. doi:10.1186/s40168-018-0584-3.
  • Possemiers S, Verthé K, Uyttendaele S, Verstraete W. PCR-DGGE-based quantification of stability of the microbial community in a simulator of the human intestinal microbial ecosystem. FEMS Microbiol Ecol. 2004;49(3):495–507. doi:10.1016/J.FEMSEC.2004.05.002.
  • van de Wiele T, van den Abbeele P, Ossieur W, Possemiers S, Marzorati M. The simulator of the human intestinal microbial ecosystem (SHIME ®). Impact Food Bioact Heal Vitr Ex Vivo Model. 2015;305–317. doi:10.1007/978-3-319-16104-4_27.
  • Minekus M. The TNO gastro-intestinal model (TIM). Impact Food Bioact Heal Vitr Ex Vivo Model. 2015. doi:10.1007/978-3-319-16104-4_5.
  • Venturelli OS, Carr AC, Fisher G, Hsu RH, Lau R, Bowen BP, Hromada S, Northen T, Arkin AP. Deciphering microbial interactions in synthetic human gut microbiome communities. Mol Syst Biol. 2018;14(6):e8157. doi:10.15252/msb.20178157.
  • Krause JL, Schaepe SS, Fritz-Wallace K, Engelmann B, Rolle-Kampczyk U, Kleinsteuber S, Schattenberg F, Liu Z, Mueller S, Jehmlich N, et al. Following the community development of SIHUMIx–a new intestinal in vitro model for bioreactor use. Gut Microbes. 2020;11(4):1116–1129. doi:10.1080/19490976.2019.1702431.
  • Trosvik P, Rudi K, Strætkvern KO, Jakobsen KS, Næs T, Stenseth NC. Web of ecological interactions in an experimental gut microbiota. Environ Microbiol. 2010;12(10):2677–2687. doi:10.1111/j.1462-2920.2010.02236.x.
  • Oliphant K, Parreira VR, Cochrane K, Allen-Vercoe E. Drivers of human gut microbial community assembly: coadaptation, determinism and stochasticity. ISME J. 2019;13(12):3080–3092. doi:10.1038/s41396-019-0498-5.
  • Duncan SH, Hold GL, Barcenilla A, Stewart CS, Flint HJ. Roseburia intestinalis sp. nov., a novel saccharolytic, butyrate-producing bacterium from human faeces. Int J Syst Evol Microbiol. 2002;52(5):1615–1620. doi:10.1099/00207713-52-5-1615.
  • Bernalier A, Willems A, Leclerc M, Rochet V, Collins MD. Ruminococcus hydrogenotrophicus sp. nov., a new H2/CO2-utilizing acetogenic bacterium isolated from human feces. Arch Microbiol. 1996;166(3):176–183. doi:10.1007/s002030050373.
  • Liu C, Finegold SM, Song Y, Lawson PA. Reclassification of Clostridium coccoides, Ruminococcus hansenii, Ruminococcus hydrogenotrophicus, Ruminococcus luti, Ruminococcus productus and Ruminococcus schinkii as Blautia coccoides gen. nov., comb. nov., Blautia hansenii comb. nov., Blautia hydroge. Int J Syst Evol Microbiol. 2008;58(8):1896–1902. doi:10.1099/ijs.0.65208-0.
  • Kageyama A, Benno Y, Nakase T. Phylogenetic and phenotypic evidence for the transfer of Eubacterium aerofaciens to the genus Collinsella as Collinsella aerofaciens gen. nov., comb. nov. Int J Syst Bacteriol. 1999;49(2):557–565. doi:10.1099/00207713-49-2-557.
  • Eggerth AH, Gagnon BH. The bacteroides of human feces. J Bacteriol. 1933;25(4):389–413. doi:10.1128/jb.25.4.389-413.1933.
  • Hayashi H, Shibata K, Sakamoto M, Tomita S, Benno Y. Prevotella copri sp. nov. and Prevotella stercorea sp. nov., isolated from human faeces. Int J Syst Evol Microbiol. 2007;57(5):941–946. doi:10.1099/ijs.0.64778-0.
  • Arumugam M, Raes J, Pelletier E, Le Paslier D, Yamada T, Mende DR, Fernandes GR, Tap J, Bruls T, Batto JM, et al. Enterotypes of the human gut microbiome. Nature. 2011;473(7346):174–180. doi:10.1038/nature09944.
  • Costea PI, Hildebrand F, Manimozhiyan A, Bäckhed F, Blaser MJ, Bushman FD, De Vos WM, Ehrlich SD, Fraser CM, Hattori M, et al. Enterotypes in the landscape of gut microbial community composition. Nat Microbiol. 2017;3(1):8–16. doi:10.1038/s41564-017-0072-8.
  • van de Velde CC, Joseph C, Biclot A, Huys GRB, Pinheiro VB, Bernaerts K, Raes J, Faust K. Fast quantification of gut bacterial species in cocultures using flow cytometry and supervised classification. ISME Commun. 2022;2(1):40. doi:10.1038/s43705-022-00123-6.
  • Brooks JP, Edwards DJ, Harwich MD, Rivera MC, Fettweis JM, Serrano MG, Reris RA, Sheth NU, Huang B, Girerd P, et al. The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies Ecological and evolutionary microbiology. BMC Microbiol. 2015;15(1):66. doi:10.1186/S12866-015-0351-6.
  • Teng F, Darveekaran Nair SS, Zhu P, Li S, Huang S, Li X, Xu J, Yang F. Impact of DNA extraction method and targeted 16S-rRNA hypervariable region on oral microbiota profiling. Sci Rep. 2018;8(1):16321. doi:10.1038/s41598-018-34294-x.
  • Koropatkin NM, Cameron EA, Martens EC. How glycan metabolism shapes the human gut microbiota. Nat Rev Microbiol. 2012;10(5):323–335. doi:10.1038/nrmicro2746.
  • Tuson HH, Foley MH, Koropatkin NM, Biteen JS. The starch utilization system assembles around stationary starch-binding proteins. Biophys J. 2018;114(2):242–250. doi:10.1016/J.BPJ.2017.12.015.
  • Beller L, Deboutte W, Falony G, Vieira-Silva S, Tito RY, Valles-Colomer M, Rymenans L, Jansen D, Van Espen L, Papadaki MI, et al. Successional stages in infant gut microbiota maturation. MBio. 2021;12(6):e1857–21. doi:10.1128/MBIO.01857-21.
  • Magnúsdóttir S, Heinken A, Kutt L, Ravcheev DA, Bauer E, Noronha A, Greenhalgh K, Jäger C, Baginska J, Wilmes P, et al. Generation of genome-scale metabolic reconstructions for 773 members of the human gut microbiota. Nat Biotechnol. 2016;35(1):81–89. doi:10.1038/nbt.3703.
  • D’hoe K, Vet S, Faust K, Moens F, Falony G, Gonze D, Llorens-Rico V, Gelens L, Danckaert J, De Vuyst L, et al. Integrated culturing, modeling and transcriptomics uncovers complex interactions and emergent behavior in a three-species synthetic gut community. Elife. 2018;7:e37090. doi:10.7554/eLife.37090.
  • Laverde Gomez JA, Mukhopadhya I, Duncan SH, Louis P, Shaw S, Collie-Duguid E, Crost E, Juge N, Flint HJ. Formate cross-feeding and cooperative metabolic interactions revealed by transcriptomics in co-cultures of acetogenic and amylolytic human colonic bacteria. Environ Microbiol. 2019;21(1):259–271. doi:10.1111/1462-2920.14454.
  • Rey FE, Faith JJ, Bain J, Muehlbauer MJ, Stevens RD, Newgard CB, Gordon JI. Dissecting the in vivo metabolic potential of two human gut acetogens. J Biol Chem. 2010;285(29):22082–22090. doi:10.1074/JBC.M110.117713.
  • Shetty SA, Kostopoulos I, Geerlings SY, Smidt H, de Vos WM, Belzer C. Dynamic metabolic interactions and trophic roles of human gut microbes identified using a minimal microbiome exhibiting ecological properties. ISME J. 2022;16(9):2144–2159. doi:10.1038/s41396-022-01255-2.
  • Rettedal EA, Gumpert H, Sommer MOA. Cultivation-based multiplex phenotyping of human gut microbiota allows targeted recovery of previously uncultured bacteria. Nat Commun. 2014;5(1):4714. doi:10.1038/ncomms5714.
  • Vandeputte D, Kathagen G, D’hoe K, Vieira-Silva S, Valles-Colomer M, Sabino J, Wang J, Tito RY, De Commer L, Darzi Y, et al. Quantitative microbiome profiling links gut community variation to microbial load. Nature. 2017;551(7681):507–511. doi:10.1038/nature24460.
  • Özel Duygan BD, Hadadi N, Babu AF, Seyfried M, van der Meer JR. Rapid detection of microbiota cell type diversity using machine-learned classification of flow cytometry data. Commun Biol. 2020;3(1):379. doi:10.1038/s42003-020-1106-y.
  • Rubbens P, Props R, Boon N, Waegeman W. Flow cytometric single-cell identification of populations in synthetic bacterial communities. PLoS One. 2017;12(1):e0169754. doi:10.1371/journal.pone.0169754.
  • Bio Rad Laboratories. 1994. Guide to Aminex® HPLC columns: for food and beverage, biotechnology, and bio-organic analysis. http://www.hplc.sk/pdf/Biorad/Guide_to_Aminex_HPLC_columns.pdf
  • Falony G, Joossens M, Vieira-Silva S, Wang J, Darzi Y, Faust K, Kurilshikov A, Bonder MJ, Valles-Colomer M, and Vandeputte D, et al. Population-level analysis of gut microbiome variation. Science (80-). 2016;352(6285):560–564. doi:10.1126/science.aad3503.
  • Hildebrand F, Tadeo R, Voigt AY, Bork P, Raes J. LotuS: an efficient and user-friendly OTU processing pipeline. Microbiome. 2014;2(1):30. doi:10.1186/2049-2618-2-30.
  • Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. DADA2: high-resolution sample inference from Illumina amplicon data. Nat Methods. 2016;13(7):581–583. doi:10.1038/nmeth.3869.
  • Wang Q, Garrity GM, Tiedje JM, Cole JR. Naïve Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl Environ Microbiol. 2007;73(16):5261–5267. doi:10.1128/AEM.00062-07.
  • Madden T. The BLAST Sequence Analysis Tool. In: McEntyre J, Ostell J, editors. Bethesda (MD): NCBI Handb; 2002.
  • Stoddard SF, Smith BJ, Hein R, Roller BRK, Schmidt TM. rrnDB: improved tools for interpreting rRNA gene abundance in bacteria and archaea and a new foundation for future development. Nucleic Acids Res. 2015;43(D1):593–598. doi:10.1093/nar/gku1201.