1,236
Views
0
CrossRef citations to date
0
Altmetric
Research Paper

Comparative genomics and phenotypic studies to determine site-specificity of Escherichia coli in the lower gastrointestinal tract of humans

ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Article: 2223332 | Received 26 Apr 2023, Accepted 06 Jun 2023, Published online: 20 Jun 2023

ABSTRACT

Escherichia coli (E. coli) is an important commensal in the human gut; however, it is unknown whether strains show site-specificity in the lower gut. To investigate this, we assessed genotypic and phenotypic differences in 37 clone pairs (two strains with very similar multiple locus variable-number-tandem-repeat analysis [MLVA] profiles) of E. coli isolated from mucosal biopsies of two different gut locations (terminal ileum and rectum). The clone pairs varied at the genomic level; single nucleotide polymorphisms (SNPs) were common, multiple nucleotide polymorphisms (MNPs) were observed but less common, and few indels (insertions and deletions) were detected. The variation was higher in clone pairs that are associated with non-human-associated sequence types (ST) compared to human-associated STs, such as ST95, ST131, and ST73. No gene(s) with non-synonymous mutations were found to be commonly associated with either the terminal ileum or the rectal strains. At the phenotypic level, we identified the metabolic signatures for some STs. Rectum strains of some STs showed consistently higher metabolic activity with particular carbon sources. Clone pairs belonging to specific STs showed distinct growth patterns under different pH conditions. Overall, this study showed that E. coli may exhibit genomic and phenotypic variability at different locations in the gut. Although genomics did not reveal significant information suggesting the site-specificity of strains, some phenotypic studies have suggested that strains may display site-specificity in the lower gut. These results provide insights into the nature and adaptation of E. coli in the lower gut of humans. To the best of our knowledge, no study has investigated or demonstrated the site-specificity of commensal E. coli in the human gut.

Introduction

The site-specificity of commensal Escherichia coli (E. coli) in the human gut is poorly understood. Although several studies have examined the distribution of E. coli in the human gut, only one study has analyzed and compared E. coli from multiple locations in the lower gut.Citation1–5 This study found that, in some individuals, E. coli strains were detected in only one gut location, for example, in the ileum, but not in the rectum. Transient passage of strains is an unlikely explanation for the detection of strains in multiple gut locations, given that E. coli were isolated from mucosal biopsies, and the loose adherent mucus was washed away before excision. The authors suggested that there may be E. coli site-specificity in the lower gut. Studying site-specificity in the gut may provide a greater understanding of the genotypic and phenotypic changes that occur as strains adapt to different niches.

The presence of E. coli in the human gut is remarkably consistent, even though it comprises just 0.01–1% of total gut anaerobes.Citation6,Citation7 E. coli can be divided into nine phylogroups: the major phylogroups (A, B1, B2 and D) and minor phylogroups (C, E, F, G and H).Citation6 Strains belonging to B2 phylogroup are frequently detected in people living in developed regions, and phylogroup distribution is thought to be associated with food habits.Citation8,Citation9 Common human-associated B2 phylogroup strains which belong to sequence types (STs) ST73, ST95, and ST131, may cause extraintestinal infections, although these lineages have been recovered from asymptomatic humans.Citation10–13

Based on the time spent in the host, E. coli can be transient or persistent. While transient strains can disappear from the host gut within days or weeks via fecal discharge, persistent strains can persist for months to years. Strains of the B2 phylogroup have been widely reported as persistent colonizers of the human gut.Citation3,Citation14

Evolution-driven changes in E. coli residing in the gut may occur randomly or through selection. Site-specific selective pressure on E. coli in the human gut may occur due to differences in motility, nutrient concentration, pH, and epithelial cell types in different gut regions, which may favor strains with certain properties.Citation15–17 The evolution of E. coli genomes during residence within a host may also confer a survival advantage at a particular location.Citation18–20 Mutations in E. coli chromosome(s) can accumulate in subsequent generationsCitation20,Citation21 due to the influence of environmental stressesCitation22,Citation23 or the immune systemCitation24, resulting in genomic evolution. E. coli has been shown to persist in a host for up to six years, providing enough time for genomic evolution to occur.Citation25–27 In humans, the most and second most abundant E. coli are likely members of the same phylogroup.Citation5 New variants may appear due to a new colonization event or within-host evolution of a strain. Dixit et alCitation18 estimated that new colonization events and within-host evolution contributed equally to the diversity of E. coli strains found in the gut. In that study, within-host evolution was defined as two or more strains of the same ST, isolated from the same individual and varying between 2 and 50 single nucleotide polymorphisms (SNPs). Given these findings, it can be hypothesized that E. coli undergoes genomic and phenotypic changes to adapt to different gut locations.

Several studies have reported tissue tropism of E. coli in the intestine, but most have focused on animal origins (e.g., bovine, porcine, and rodent) and pathogenic E. coli, such as Enterohemorrhagic E. coli (EHEC), Enterotoxigenic E. coli (ETEC), and Enteropathogenic E. coli (EPEC).Citation28–31 In porcine studies, the recovery of E. coli phylogroups and genes for metabolite secretion was found to vary according to the gut regions.Citation32,Citation33 Some bacteriocin genes, such as colicin Ia/Ib, colicin B, and colicin M, were more frequently associated with strains isolated from the ileum than the duodenum, colon, or feces of pigs.Citation33 Therefore, it is plausible that commensal E. coli site-specificity may also exist in the human gut. To the best of our knowledge, no study has determined whether commensal E. coli shows site-specificity in the human gut.

To do so, we used two similar E. coli strains (clone pairs) isolated from mucosal biopsies taken from two different locations in the lower gut (terminal ileum and rectum) of the same individual. We sought to identify differences in the genetic and phenotypic characteristics of the clone pairs that may account for site-specificity and within-host adaptation.

Results

Similarity of the clone pairs

Molecular profiling revealed that the strains belonging to each clone pair were highly similar. A comparison of the terminal ileum and rectum strains showed that approximately 97% of clone pairs (36/37) were similar according to sequence types (STs), O-, H-, and fimH types (). One pair (P−37) revealed different O-types, but identical ST, H-, and fimH types. The clone pairs were found to carry genes conferring resistance to several groups of antimicrobial drugs. Among the 37 clone pairs, 36 (97%) carried similar antimicrobial resistance genes and one pair (P−13) showed variation, where genes aadA5_1 (aminoglycoside group), dfrA17_1 (trimethoprim group), and mph(A)_2 (MLS group: macrolide, lincosamide, and streptogramin) were present only in the terminal ileum strain (Figure S1).

Table 1. Basic molecular typing of Escherichia coli clone pairs according to ST, O-type, H-type and fimH type.

Plasmid analysis showed that 95% of the clone pairs carried similar plasmids; clone pairs P−7 and P−13 showed variation (Figure S2). Both the terminal ileum and rectum strains of P−13 carried the same number of plasmids (N = 8), where seven of the plasmids were the same and only one differed per strain. Col156_1 was present only in the terminal ileum and Col440II_1 was present only in the rectal strain. For virulence factor profiling, we considered only those genes frequently reported for the B2 phylogroupCitation34–38, which included afaA, agn43, aslA, cdtB, chuA, clbB, clbN, cnf1, csgA, cvac, draD, draP, fimH, focG, fyuA, hlyA, hlyD, ibeA, ireA, iroN, iss2, iutA, kpsM, malX, ompA, ompt, paa, papA, papC, pic, sat, sfaA, sfaS, trat, usp and vat. The terminal ileum and rectum strains of all the clone pairs had identical virulence factor genes ().

Figure 1. Profiling of virulence factor genes in Escherichia coli clone pairs. All clone pairs had identical virulence factor profiles. Blank and colored boxes indicate the absence and presence of the virulence factor genes, respectively. Ti, Terminal ileum; R, Rectum.

Figure 1. Profiling of virulence factor genes in Escherichia coli clone pairs. All clone pairs had identical virulence factor profiles. Blank and colored boxes indicate the absence and presence of the virulence factor genes, respectively. Ti, Terminal ileum; R, Rectum.

Variation in mutations in the clone pairs

The Pathosystems Resource Integration Center (PATRIC, which operates under the Bacterial and Viral Bioinformatics Resource Center [BV-BRC; https://www.bv-brc.org/]) Variation Analysis provides genetic variation outputs in the form of single and multiple nucleotide polymorphisms (SNPs and MNPs), and insertions and deletions (Indels). Confidence scores are applied that correspond to calls such as synonymous mutations, non-synonymous mutations, and frameshifts. On average, each clone pair varied by 78 mutations (range: 4–200, Figure S3). SNPs were the most common type of genomic variation, followed by MNPs, and few Indels were observed. Analysis of means (ANOM) for STs represented by two or more clone pairs showed that the mean number of mutations between clone pairs did not vary within the human-associated STs, ST73, ST95, and ST131 (p > 0.05). However, the mean number of mutations for these human-associated STs was significantly lower (<50) than that for clone pairs belonging to non-human-associated STs, ST80, ST372, and ST569 (>130; p ≤ 0.05 [). We also determined whether the distribution of mutational variation between clone pairs differed according to the demographic characteristics of the individuals. The distribution of mutational variation did not vary according to the disease status (), sex, or age of the individuals (p > 0.05).

Figure 2. Analysis of means (ANOM) report for mutational differences between clone pairs according to ST. Red tip indicates either significantly lower or higher mutational variation between clone pairs for each ST. UDL, upper decision limit; LDL, lower decision limit. Different upper and lower decision limits are due to variations in sample size of clone pairs for each ST. Mutational differences between clone pairs of ST73, ST95 and ST131 are not significantly different; mutational differences in ST80 are significantly different from all other STs; and mutational differences in ST372 and ST569 are similar and significantly different from other STs. The statistical test was based on pair-wise comparisons of mutational differences for each ST using student’s t-test and ordered difference report. STs containing only a single clone pair (ST14, ST91, ST491, ST550, ST636, ST1193, and ST11174) were not included in the analysis.

Figure 2. Analysis of means (ANOM) report for mutational differences between clone pairs according to ST. Red tip indicates either significantly lower or higher mutational variation between clone pairs for each ST. UDL, upper decision limit; LDL, lower decision limit. Different upper and lower decision limits are due to variations in sample size of clone pairs for each ST. Mutational differences between clone pairs of ST73, ST95 and ST131 are not significantly different; mutational differences in ST80 are significantly different from all other STs; and mutational differences in ST372 and ST569 are similar and significantly different from other STs. The statistical test was based on pair-wise comparisons of mutational differences for each ST using student’s t-test and ordered difference report. STs containing only a single clone pair (ST14, ST91, ST491, ST550, ST636, ST1193, and ST11174) were not included in the analysis.

Figure 3. Distribution of mutational variation of clone pairs between patients with and without inflammatory bowel disease (IBD). IBD, individuals with Crohn’s disease (CD, eight clone pairs from seven individuals) and ulcerative colitis (UC, three clone pairs from three individuals); N, no disease (individuals without IBD or diarrheal conditions, 26 clone pairs from 24 individuals).

Figure 3. Distribution of mutational variation of clone pairs between patients with and without inflammatory bowel disease (IBD). IBD, individuals with Crohn’s disease (CD, eight clone pairs from seven individuals) and ulcerative colitis (UC, three clone pairs from three individuals); N, no disease (individuals without IBD or diarrheal conditions, 26 clone pairs from 24 individuals).

Analysis of mutational type, snpEFF-type, and snpEFF-impact

PATRIC was used to provide additional information, such as mutational type, snpEFF-type, and snpEFF-impact of the mutations. Of the total mutations identified, 76% were synonymous, 23% were non-synonymous, and the remaining were indels (). The terminal ileum and rectum strains of all STs did not vary significantly in carriage of synonymous or non-synonymous mutations. Synonymous mutations were excluded from the snpEFF-type analysis. The remaining mutations were missense variants, mutations in intergenic regions, frameshift variants, stop-gain variants, and stop-lost variants. Missense variants and mutations in intergenic regions were comparatively more common than those in other types. Although frameshift variants appeared in some pairs, there were very few stop-lost or stop-gain variants. However, neither the terminal ileum nor the rectum strains of any ST varied in terms of carrying missense variants, frameshift variants, or mutations in intergenic regions. Most of the non-synonymous mutational impacts were low (69%), followed by moderate (20%), modifier (10%), and high (1%) [data not shown]. Within an ST, neither the terminal ileum nor the rectum strains varied with respect to carrying high, moderate, or modifier impacts.

Figure 4. Frequency of types of mutations in Escherichia coli clone pairs across all STs. Y-axis at the left-hand side indicates the frequency of synonymous mutations, the right-hand side indicates the frequency of non-synonymous mutations and insertions and deletions. Statistical tests were performed for STs represented by two or more clone pairs only, and included: ST73, ST80, ST95, ST131, ST372 and ST569. Ti, Terminal ileum; R, Rectum.

Figure 4. Frequency of types of mutations in Escherichia coli clone pairs across all STs. Y-axis at the left-hand side indicates the frequency of synonymous mutations, the right-hand side indicates the frequency of non-synonymous mutations and insertions and deletions. Statistical tests were performed for STs represented by two or more clone pairs only, and included: ST73, ST80, ST95, ST131, ST372 and ST569. Ti, Terminal ileum; R, Rectum.

Mutations in particular genes were not common to either terminal ileum or rectum strains

We sought to identify mutated gene(s) specific to the terminal ileum or rectum strains. The frequency with which a gene was associated with non-synonymous mutations was compared between the terminal ileum and rectum strains. Genes with the same frequency between the terminal ileum and rectum strains were excluded because they did not provide any distinguishing information. If mutations were associated with a gene exclusively related to either the terminal ileum or rectum strain, they were included in the analysis. From the gene lists, we excluded genes that were not likely to be core genes, such as genes encoding phage-associated proteins, plasmid conjugative transfer proteins (e.g., pilus assembly), transposases, mobile element proteins, insertion elements, iron uptake systems, and antimicrobial drug-associated proteins.

Comparing the terminal ileum and rectum strains overall and by ST, we did not find any gene(s) that were commonly associated with either the terminal ileum or rectum strains (Table S1a-S1m). We found numerous non-synonymous mutations associated with hypothetical proteins (HP). To distinguish hypothetical proteins, we recorded their upstream features (UF) and downstream features (DF) whenever possible. Although no signature gene(s) were identified, we observed that some genes appeared more commonly in the terminal ileum and rectum strains. Trehalose−6-phosphate hydrolase or genes related to trehalose appeared comparatively more commonly in the terminal ileum strains (Tables S1a, S1h, S1i, and S1l) and the uncharacterized protein YeeL in the rectum strains (Tables S1a, and S1k). In ST131, possible periplasmic thioredoxin (a specific protein name was not detected) was slightly more common in terminal ileum strains (Table S1b). In ST80, a hypothetical protein (no upstream or downstream features), in ST73 another hypothetical protein (downstream feature uncharacterized protein YfjI), and ST372 catalase KatE-intracellular protease were slightly more common in the terminal ileum strains (Tables S1c, S1d, and S1f). D-mannonate oxidoreductase and UPF0192 protein YfaS were found to be slightly more common in the rectal strains of ST569 (Table S1e).

Metabolism differences between terminal ileum and rectum strains

Biolog GENIII MicroPlates (CellBioscience) were used to biochemically profile the total collection of the terminal ileum and rectum strains. Clone pairs were included in the analysis if the terminal ileum and rectum strains showed evidence of a metabolic response in the presence of a specific carbon source but varied in their optical densities (ODs) [marked black color in Table S2]. As we did not have multiple assay replicates for each strain, we restricted our analysis to a comparison of the total collection of the terminal ileum and rectum strains and within ST comparisons of those that carried multiple clone pairs. The overall comparison showed that the rectum strains had higher ODs in D-fucose (p = 0.0317), L-arginine (p = 0.0428), guanidine HCl (p = 0.0303), and aztreonam (p = 0.0462) [], and the terminal ileum strains had higher ODs in glucuronamide (p = 0.0286) [].

Figure 5. Comparison of OD values for the total collection terminal ileum (Ti) and rectum (R) clone pair strains of E. coli grown in various carbon sources. (a) Plots showing carbon sources where rectum strains showed higher metabolic activity than the ileum strains. (b) Plot showing that ileum strains had higher metabolic activity than rectum strains in glucuronamide. Data shown only for carbon sources where the ileum and rectum strains significantly varied in their metabolic activity. Paired t-tests were performed for this analysis. Each dot indicates the mean (x-axis) and difference (y-axis) for each clone pair. The red line or the intersected point of red lines indicates the mean difference (y-axis), the dashed red lines indicate the upper and lower confidence intervals (95%). MB, metabolism; >, indicates which strains (rectum or ileum) showed higher metabolic activity.

Figure 5. Comparison of OD values for the total collection terminal ileum (Ti) and rectum (R) clone pair strains of E. coli grown in various carbon sources. (a) Plots showing carbon sources where rectum strains showed higher metabolic activity than the ileum strains. (b) Plot showing that ileum strains had higher metabolic activity than rectum strains in glucuronamide. Data shown only for carbon sources where the ileum and rectum strains significantly varied in their metabolic activity. Paired t-tests were performed for this analysis. Each dot indicates the mean (x-axis) and difference (y-axis) for each clone pair. The red line or the intersected point of red lines indicates the mean difference (y-axis), the dashed red lines indicate the upper and lower confidence intervals (95%). MB, metabolism; >, indicates which strains (rectum or ileum) showed higher metabolic activity.

In more than 75% of the observed variations, the rectal strains showed higher metabolic activity than the ileal strains. ST-wise comparisons revealed further variation between the terminal ileum and rectum strains with respect to the use of carbon sources and chemicals (p ≤ 0.05 or 0.01). In all ST131 and ST569 isolates, the rectal strains consistently showed higher metabolism than the ileal strains (). The rectal strains of multiple STs showed higher metabolic activity than the ileal strain in the presence of D-fructose−6-phosphate (ST131 in ; ST73 and ST569 in ), glycyl-L-proline (ST131 in and ST569 in ), and formic acid (ST131 in and ST372 in Figure S4). In contrast, the terminal ileum strains of multiple STs showed higher metabolic activity than the rectal strains in the presence of glucuronamide (). The terminal ileum and rectum strains of ST131 did not show variation at pH 6.0, but did at pH 5.0; the rectum strains had higher ODs at pH 5.0 (). The terminal ileum and rectum strains of ST80 varied when grown in 4% NaCl but not in 1% or 8% NaCl; rectum strains had higher ODs when grown in 4% NaCl (Figure S4).

Figure 6. Comparison of OD values for the terminal ileum (Ti) and rectum (R) strains of E. coli ST131 clone pairs (eight clone pairs) grown in various carbon sources. Data shown only for carbon sources where the ileum and rectum strains significantly varied in their metabolic activity (D-Gluconic Acid [p = 0.0338], Formic Acid [p = 0.0424], Glycyl-L-Proline [p = 0.0143], L-Alanine [p = 0.0495], Bromo-Succinic Acid [p = 0.0276], Dextrin [p = 0.0257], D-Maltose [p = 0.0153], β-Methyl-D-Glucoside [p = 0.0083], L-Arginine [p = 0.0146], D-fructose −6-phosphate [p = 0.0469], pH 5 [p = 0.0361], Niraproof 4 [p = 0.0438]). Paired t-tests were performed for this analysis. Each dot indicates the mean (x-axis) and difference (y-axis) for each clone pair. The red line or the intersected point of red lines indicates the mean difference (y-axis), the dashed red lines indicate the upper and lower confidence intervals (95%). MB, metabolism; >, indicates which strains (rectum or ileum) showed higher metabolic activity.

Figure 6. Comparison of OD values for the terminal ileum (Ti) and rectum (R) strains of E. coli ST131 clone pairs (eight clone pairs) grown in various carbon sources. Data shown only for carbon sources where the ileum and rectum strains significantly varied in their metabolic activity (D-Gluconic Acid [p = 0.0338], Formic Acid [p = 0.0424], Glycyl-L-Proline [p = 0.0143], L-Alanine [p = 0.0495], Bromo-Succinic Acid [p = 0.0276], Dextrin [p = 0.0257], D-Maltose [p = 0.0153], β-Methyl-D-Glucoside [p = 0.0083], L-Arginine [p = 0.0146], D-fructose −6-phosphate [p = 0.0469], pH 5 [p = 0.0361], Niraproof 4 [p = 0.0438]). Paired t-tests were performed for this analysis. Each dot indicates the mean (x-axis) and difference (y-axis) for each clone pair. The red line or the intersected point of red lines indicates the mean difference (y-axis), the dashed red lines indicate the upper and lower confidence intervals (95%). MB, metabolism; >, indicates which strains (rectum or ileum) showed higher metabolic activity.

Figure 7. Comparison of OD values for the terminal ileum (Ti) and rectum (R) strains of E. coli ST569 clone pairs (three clone pairs) grown in various carbon sources. Data shown only for carbon sources where the ileum and rectum strains significantly varied in their metabolic activity (Glycyl-L-Proline [p = 0.0043], L-Galactonic Acid Lactone [p = 0.0291], Mucic Acid [p = 0.0289], and Rifamycin SV [p = 0.0072]). Paired t-tests were performed for this analysis. Each dot indicates the mean (x-axis) and difference (y-axis) for each clone pair. The red line indicates the mean difference (y-axis), the dashed red lines indicate the upper and lower confidence intervals (95%). MB, metabolism; >, indicates which strains (rectum or ileum) showed higher metabolic activity.

Figure 7. Comparison of OD values for the terminal ileum (Ti) and rectum (R) strains of E. coli ST569 clone pairs (three clone pairs) grown in various carbon sources. Data shown only for carbon sources where the ileum and rectum strains significantly varied in their metabolic activity (Glycyl-L-Proline [p = 0.0043], L-Galactonic Acid Lactone [p = 0.0291], Mucic Acid [p = 0.0289], and Rifamycin SV [p = 0.0072]). Paired t-tests were performed for this analysis. Each dot indicates the mean (x-axis) and difference (y-axis) for each clone pair. The red line indicates the mean difference (y-axis), the dashed red lines indicate the upper and lower confidence intervals (95%). MB, metabolism; >, indicates which strains (rectum or ileum) showed higher metabolic activity.

Figure 8. Comparison of OD values for the terminal ileum (Ti) and rectum (R) strains of E. coli of multiple STs (ST73: three clone pairs; ST569: three clone pairs, and ST95: ten clone pairs), when grown in the same carbon sources. (a) Plots showing that the rectum strains of multiple STs had higher metabolic activity than the ileum strains in D-fructose −6-phosphate (ST73 [p = 0.0494], and ST569 [p = 0.0239]). (b) Plots showing that the ileum strains of multiple STs had higher metabolic activity than the rectum strains in glucuronamide (ST73 [p = 0.0395], and ST95 [p = 0.0397]). Data shown only for carbon sources where the ileum and rectum strains significantly varied in their metabolic activity. Paired t-tests were performed for this analysis. Each dot indicates the mean (x-axis) and difference (y-axis) for each clone pair. The red line indicates the mean difference (y-axis), the dashed red lines indicate the upper and lower confidence intervals (95%). MB, metabolism; >, indicates which strains (rectum or ileum) showed higher metabolic activity.

Figure 8. Comparison of OD values for the terminal ileum (Ti) and rectum (R) strains of E. coli of multiple STs (ST73: three clone pairs; ST569: three clone pairs, and ST95: ten clone pairs), when grown in the same carbon sources. (a) Plots showing that the rectum strains of multiple STs had higher metabolic activity than the ileum strains in D-fructose −6-phosphate (ST73 [p = 0.0494], and ST569 [p = 0.0239]). (b) Plots showing that the ileum strains of multiple STs had higher metabolic activity than the rectum strains in glucuronamide (ST73 [p = 0.0395], and ST95 [p = 0.0397]). Data shown only for carbon sources where the ileum and rectum strains significantly varied in their metabolic activity. Paired t-tests were performed for this analysis. Each dot indicates the mean (x-axis) and difference (y-axis) for each clone pair. The red line indicates the mean difference (y-axis), the dashed red lines indicate the upper and lower confidence intervals (95%). MB, metabolism; >, indicates which strains (rectum or ileum) showed higher metabolic activity.

Growth rate variability between terminal ileum and rectum strains in different pH conditions

The overall comparison showed that the maximum growth rates of the terminal ileum and rectum strains varied significantly at pH 4.10; the terminal ileum strains had higher maximum growth rates than the rectum strains (). The clone pairs did not vary significantly under other pH conditions. For some STs, the terminal ileum and rectum strains showed variations across multiple pH ranges (). The maximum growth rates of the terminal ileum and rectum strains of ST131 showed significant variation at pH 4.10 and 4.40; the terminal ileum strains had higher growth rates under both pH conditions. Differences in growth rate were also observed for the terminal ileum and rectum strains belonging to ST80 and ST95 at different pH values. At pH 5.80, three STs showed significant variation between the terminal ileum and rectum strains. In all cases, the rectum strains showed significantly higher growth than the terminal ileum strains. At neutral pH (pH 7.20 and pH 7.60), two STs (ST80 and ST95) showed variation, with the terminal ileum strains showing higher growth rates. Each clone pair analysis showed that of the 37 clone pairs, a few clone pairs varied significantly according to the pH of their culture conditions (Figure S5). Considering only the pH conditions at which the maximum numbers of clone pairs showed variation, the terminal ileum strains showed a higher growth rate at pH 4.80, whereas the rectal strains showed higher growth rates at pH 5.80.

Figure 9. Overall comparison of maximum growth rates for the terminal ileum and rectum strains of E. coli clone pairs grown under different pH conditions. The analysis includes three technical replicates for each strain. Each error bar was produced using one standard deviation (±SD) from the mean. Paired t-tests were performed to determine the mean difference between the terminal ileum and rectum strains. **, p ≤ 0.01 otherwise p > 0.05.

Figure 9. Overall comparison of maximum growth rates for the terminal ileum and rectum strains of E. coli clone pairs grown under different pH conditions. The analysis includes three technical replicates for each strain. Each error bar was produced using one standard deviation (±SD) from the mean. Paired t-tests were performed to determine the mean difference between the terminal ileum and rectum strains. **, p ≤ 0.01 otherwise p > 0.05.

Figure 10. Maximum growth rate comparison between the terminal ileum and rectum strains of E. coli grown under different pH conditions across the STs. STs that had only one clone pair were not included in the analysis. Each error bar was produced using one standard deviation (±SD) from the mean. Paired t-tests were performed to determine the mean difference between the terminal ileum and rectum strains within an ST. *, p ≤ 0.05; **, p ≤ 0.01.

Figure 10. Maximum growth rate comparison between the terminal ileum and rectum strains of E. coli grown under different pH conditions across the STs. STs that had only one clone pair were not included in the analysis. Each error bar was produced using one standard deviation (±SD) from the mean. Paired t-tests were performed to determine the mean difference between the terminal ileum and rectum strains within an ST. *, p ≤ 0.05; **, p ≤ 0.01.

Discussion

E. coli strains with identical MLVA profiles (“clone pairs”), were isolated from the terminal ileum and rectum and showed genomic variation (mainly SNPs), suggesting that clone pairs may not be truly identical at the genomic level. These variations were likely due to random mutations with a neutral effect. This study did not track genomic changes in strains over time, but identified genomic differences between strains isolated from the ileum and those isolated from the rectum. As members of the B2 phylogroup, it is likely that the clone pairs resided in the two gut locations for a longer period of time, which provides an opportunity for the clone pairs to undergo genomic differences.

Mutations and horizontal gene transfer events are two modes of bacterial evolution.Citation39–41 A real-time evolutionary experiment carried out in mice showed that bacteriophage-mediated horizontal gene transfer exceeds mutations for rapid colonization of E. coli in the gut.Citation42 As we did not find that the clone pairs varied in plasmid content, virulence factors, or antimicrobial resistance profiles, it is unlikely that the clone pairs were significantly affected by horizontal gene transfer events, and they were more likely to acquire mutations while residing in the human gut. In one study, multiple clones of E. coli ED1a were isolated from fecal samples from a human at three time points within one year; there was little evidence of phage integration or gain or loss of plasmids, but there was mutational variation between clones.Citation19

The within-host evolution of E. coli has been reported previously.Citation18,Citation26,Citation27 The genomic variation in clone pairs varies between human and non-human-associated STs. Clone pairs of STs that are more likely to be human-associated, such as, ST73, ST95, and ST131, showed fewer variations, suggesting that they are better adapted to the human gut. Clone pairs belonging to ST80, ST372, ST569, and ST636, which are less likely to be reported in human hostsCitation43–45, showed greater variation. This suggests that these STs may have greater selective pressure to acquire more mutations to adapt to the human gut.Citation46,Citation47 However, it is difficult to ascertain a specific cause of this variation, as the human gut is a complex system and many mechanisms are simultaneously at play.Citation48 It should be acknowledged that we did not compare a single pair isolated from the same site. Such a comparison could further strengthen our evidence of the genomic variation of clone pairs.

We analyzed non-synonymous gene mutations, especially the fimbrial and non-fimbrial factors, to identify the genetic signatures shared by the clone pairs. We did not detect any genes common to either the terminal ileum or the rectum isolates. This suggests that most non-synonymous mutations were accumulated neutrally. In a one-year in vivo study, mice fed a western diet and a chow diet showed a convergence effect on the lactose operon of E. coli, but the authors did not detect any molecular signature specific to either of the diets.Citation49 The absence of a molecular signature was also observed in a longitudinal evolutionary study of an E. coli isolate that was dominant in the host for over one year.Citation19 Tissue tropism is a feature of several E. coli, namely EPEC, EHEC and ETEC.Citation50–54 ETEC causes diarrheal illnesses in infants, adults, and travelers, mainly in developing countries, and specifically colonizes and infects the small intestine.Citation31,Citation55 ETEC-associated adhesion or colonization factors include fibrillar proteins (CFA/1, CS1-CS7, CS14, CS17, CS21 and others)Citation31, and other non-fimbrial adhesins (outer membrane protein Tia, autotransporter protein TibA, glycoprotein EtpA, pore-forming transport protein EtpB, and putative glycosyl transferase EtpC).Citation56 We also found non-synonymous mutations associated with some fimbrial and membrane protein genes; however, they were not common to either the terminal ileum or rectum strains. The lack of a molecular marker that discriminates between the terminal ileum and rectum isolates may be due to the use of two genotypically similar B2 isolates that are resident colonizers of the gut. Alternatively, virulence factor genes alone may account for tissue tropism in pathogens, and the lack of virulence properties in commensal E. coli clone pairs meant that we did not observe a site-specific gene signature.Citation57,Citation58

Phenotypic characterization revealed the key observations that similar terminal ileum and rectum strains had different metabolic profiles. In more than 75% of the comparisons, the rectal strains showed higher metabolic activity than the terminal ileum strains. The double mucus layers that create a favorable bacterial niche in the colon may contribute to the rectal strains being better than the terminal ileum strains when using various carbon sources, such as d-fucose, and l-arginine.Citation59–62 The terminal ileum strains showed higher metabolic activity than the rectum strains for one carbon source, glucuronamide, which is a type of hexose related to glucuronic acid. Glucuronic acid is involved in the formation of glucuronides in the liver, which are then transferred to the intestinal tract via biliary secretion.Citation63,Citation64 Among the bacterial communities, the genus Escherichia was comparatively more common than the other members of Enterobacteriaceae in duodenal fluid and bile samples.Citation65 Enterohepatic circulation results in the uptake of bile salts in the distal small intestine, so that the colon is not exposed to bile salts. This may have resulted in the terminal ileum strains adapting to the utilization of glucuronamide better than the rectal strains.

The metabolic profile also showed that at least some STs (e.g., ST131 and ST569) may be better adapted to the rectum. For example, the clone pairs of ST131 showed metabolic variation in ten carbon sources and two chemicals; in all cases, the rectum strains showed higher metabolism than the terminal ileum strains. However, no metabolic signature was observed at the genomic level. This may be because the growth of clone pairs was more sensitive when grown in specific carbonaceous media (the observed variation in Biolog components), rather than simple lysogeny broth (LB) media, in which the strains were grown before DNA extraction for genome sequencing. Strains exhibiting greater metabolic potential may be important for strain colonization and adaptation in the intestine.Citation66 Strains belonging to ST131 may have greater metabolic potential than non-ST131 strainsCitation67, and a superior capacity to colonize the mouse gut.Citation68 However, no distinct metabolic properties of ST131 over non-ST131 have been reported in other studies.Citation69,Citation70 Nicolas-Chanoine et alCitation11 suggested that the higher metabolic potential of ST131 may make it a more successful colonizers of the gut, which is required prior to its uropathogenicity. In our study, we found that clone pairs of ST131 showed the most phenotypic variation. The consistently higher metabolism of the rectum strains compared to the terminal ileum strains of ST131 suggests that strains of this ST may have better colonization and survival properties in the distal part of the gut, such as the rectum.

A possible limitation of this study is the use of draft genomes for the clone pairs. For the best output, we could have used the complete genome sequences of our strains and compared the clone pairs directly against each other. However, we compared each draft genome with a reference genome for the variation analysis. This approach allowed us to avoid any possible influence on mutational variation that may appear due to the use of draft genomes for clone pairs. Although, there were fewer examples of molecular trait variations between clone pairs, we confirmed any discrepancies by checking their coverage in the applicable contig. Some of the genomic variations found in the terminal ileum and rectum strains may have occurred before colonization in the gut or may have evolved in the laboratory before genomic DNA extraction. Studies have suggested the evolution of E. coli especially under long-term in vitro experimental conditions.Citation71,Citation72 We did not have evidence to determine the pre-colonization mutational effects of clone pairs. However, there were only two sub-culturing steps before DNA isolation, so it is unlikely that the clone pairs would have been significantly affected by in vitro evolution.

In conclusion, this study suggests that similar E. coli isolates from different gut locations are not truly similar at the genomic or phenotypic level. We found that clone pairs belonging to STs more frequently isolated from the human gut had less genomic variation, suggesting that their evolutionary history makes them better adapted to the gut. The lack of specific genetic signatures between the terminal ileum and rectum strains suggests that the observed variations are not due to site-specificity but rather because of the neutral evolution of strains. While the use of two genetically similar strains may have prevented us from identifying site-specific gene signatures of clone pairs, this study provides evidence that clone pairs isolated from different gut locations are genetically divergent. The site-specificity of E. coli is better observed from phenotypic studies that showed differences between clone pairs isolated from different gut locations, including in the metabolism of different carbon sources and growth at different pH, suggesting that strains adapt to different gut regions. Overall, this study showed that a strain’s genomic and phenotypic properties can be influenced by gut location, suggesting E. coli may be site-specific or show a level of plasticity when colonizing different gut regions.

Materials and methods

Ethics approval

This study was approved by the ACT (Australian Capital Territory) Health Research Ethics Committee (ETH.5.07.464) and the ANU (Australian National University) Human Research Ethics Committee (2012/596), ACT, Australia.

Defining Escherichia coli clone pairs

In this study, a clone pair represents two E. coli isolates with the same MLVA gel electrophoresis profile belonging to the B2 phylogroup, where one isolate was obtained from the terminal ileum and the other from the rectum of the same individual (Figure S6). Given the dominance and colonization persistence of B2 phylogroup strains in the human gut, strain selection was restricted to B2 phylogroup strains.

Collection of clone pairs

A total of 37 clone pairs isolated from 34 individuals were selected for this study (Table S3). Individual gut mucosal biopsies were collected by the designated doctors at the Canberra Hospital, ACT, Australia. Individuals provided consent prior to specimen collection for this study. The individuals included both females (N = 17), and males (N = 17), ranging in age from 21 to 78 years old. Of the 37 clone pairs, 30 were collected in 2019 for this study, and seven were collected from a previous studyCitation5, in which strain collection was performed between 2010 and 2011.

Whole genome sequencing

DNA was isolated using an Isolate II Genomic DNA Kit (Bioline, Cat. No. BIO− 52067). DNA libraries were prepared using the Illumina Nextera DNA Flex Library Prep (96 samples: Illumina, Cat. No. 20018705) and IDT® for Illumina Nextera DNA Unique Dual Indexes Set A (96 indices, 96 samples: Illumina, Cat. No. 20027213), according to the manufacturer’s protocol. The library was sequenced using the MiSeq system (Illumina) along with the MiSeq Reagent Kit V3 (600 cycles, 300 bp paired end). The Biomolecular Resource Facility (BRF), Australian National University, prepared the library and performed the sequencing.

Microbial genome assembly and phylogroup confirmation

SPAdes in EnteroBase was used to assemble the sequenceCitation73 and assembled genomes were aligned to a reference genome (S88) in Mauve.Citation74 We confirmed the phylogroups of all strains using ClermonTyping.Citation75

Basic molecular profiling of clone pairs

The ST of each strain was determined using the MLST scheme of the EnteroBase (http://enterobase.warwick.ac.uk/) database.Citation76 For serotyping (O-, H- type) and fimbriae typing (fimH type), SeroTypeFinderCitation77 and FimTyperCitation78 on the Center for Genomic Epidemiology (CGE) were used (http://www.genomicepidemiology.org/), respectively. CGE’s ResFinder 4.1 toolCitation79 was used to detect the antimicrobial resistance genes, PlasmidFinder 2.1Citation80 to detect plasmids, and VirulenceFinder 2.0Citation81 for virulence factor profiling.

Genome annotation and reference strain selection

We annotated the assembled draft genomes in PATRIC (now BV-BRC) using RASTtk, available at https://www.patricbrc.org/app/Annotation.Citation82 We determined the ST of all strains using the MLST EnteroBase database. We identified B2 reference strains (complete genomes) specific to each ST in the National Center for Biotechnology Information (NCBI) database and downloaded their genomes, including plasmids, whenever applicable. For plasmid(s)-bearing genomes, we added the nucleotide sequences of plasmid(s) to the chromosomal nucleotide sequences. We then annotated all the downloaded reference strains in PATRIC. Phylogenetic analysis was conducted by comparing all the strains with the reference genomes. When we did not find a reference strain for an ST, we used the reference strain of a closely related ST. A list of the reference genomes for each ST is provided in Table S4.

Identification of mutations

We used the Variation Analysis tool in PATRIC to analyze mutational variation between the clone pairs.Citation83 We separately analyzed the terminal ileum and rectum strains of each clone pair against the reference strain of the same ST. The read files of a desired bacterial strain were uploaded in the “paired read library” and the genome of a reference strain was uploaded in the “target genome”. We selected “BWA-mem-strict” as the aligner and “Freebayes” as the SNP caller. We retrieved the “all.var.tsv” files, which contained variation information for each strain (terminal ileum/rectum strains) against the reference strain. For determining mutations between similar strains of each clone pair, the “all.var.tsv” files of the terminal ileum and rectum strains were joined together in JMP version 15 software. Then, the terminal ileum strain was compared with the rectum strain, and the contig and position for matched and non-matched mutations with the reference strain were obtained. Using the conditional edit formula, the matched mutations between terminal ileum and rectum strains against the reference strain were denoted as ‘0’ and non-matched were denoted as ‘1.’ We selected only the non-matched mutations as the expected variation between the clone pairs (similar terminal ileum and rectum strains). We then checked the quality scores of all respective mutations. We performed quality control of the sequence variations at greater scores. We removed the mutations that were below the 90% quantile and marked the remaining mutations between clone pairs. As PATRIC also provides annotation of the mutations, we determined the mutational types, snpEFF_type, snpEFF_impact, and non-synonymous associated gene functions.

Metabolic profiling of clone pairs

Biolog GENIII MicroPlates (CellBioscience, Cat. No. B 1030) were used to assess the ability of each strain to use 71 different carbon sources and their performance in 23 chemical-sensitivity reactions (Table S2). Freezer stocks were grown overnight on LB agar plates at 35°C. A single colony was picked and thoroughly mixed with the GENIII inoculation fluid (Cat. No. B72401). Then, 100 µL of the inoculated GENIII fluid was transferred to each well of a Biolog GENIII MicroPlate and incubated at 35°C for 24 h under static conditions. To measure the endpoint growth, turbidity and color intensity of the wells, the OD value was measured at 640 nm using a microplate reader (PowerWaveX 340, BIO-TEK Instruments Inc.). The assay was performed once for each strain.

Determination of maximum growth rate in different pH conditions

The strains were grown overnight in LB broth at 35°C with shaking at 150 rpm. Then, 5 µL of overnight culture was inoculated in 190 µL Davis Minimal Media at various pH levels (pH 4.10, 4.40, 4.80, 5.14, 5.50, 5.80, 7.20, and 7.60) in a polystyrene round-bottom 96-well plate (Greiner Bio-one). After inoculation, we inserted the plate into a microplate reader (PowerWaveX 340, BIO-TEK Instruments Inc.) and set the OD value at 600 nm. To maintain the exponential phase, we incubated the cells for 18 h at 35°C and set the reader in kinetic mode, so that the plate was vibrated (mixed) for 5-second before every 10-minute OD reading.

Statistical analysis

Statistical tests were performed using the software package JMP version 15 (SAS Institute Inc., 2019). The growth rate and maximum growth rate were calculated using Excel. Student’s t-test was used to compare the mean mutations between the two different STs. We used paired t-tests for determining mutational variation between clone pairs within a ST. Paired t-tests were used to compare means between the terminal ileum and rectum strains in phenotypic assays. Statistical significance was set at a 95% confidence interval (p ≤ 0.05).

Author contribution

RB collected specimens, designed, and conducted the experiments, performed formal analysis, interpreted the results, and wrote the original draft; PP conceptualized the study, acquired funding, and reviewed the manuscript; DMG conceptualized and designed the study, interpreted the results, administered the project, and reviewed the manuscript; COB conceptualized and designed the study, interpreted the results, acquired funding, administered the project, and reviewed the manuscript.

Supplemental material

Supplemental Material

Download MS Word (2.4 MB)

Acknowledgments

We thank the patients who provided specimens for this study. We are grateful to the doctors, nurses, and staff of the Gastroenterology and Hepatology Unit of Canberra Hospital, ACT, Australia.

BioRender tool (https://app.biorender.com/) was used for merging and/or indication of images.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The genome sequences of Escherichia coli strains used in this study were deposited and are available in EnteroBase (https://enterobase.warwick.ac.uk/).

Supplementary material

Supplemental data for this article can be accessed online at https://doi.org/10.1080/19490976.2023.2223332

Correction Statement

This article has been corrected with minor changes. These changes do not impact the academic content of the article.

Additional information

Funding

Canberra Hospital Private Practice Trust Fund.

References

  • Blyton MDJ, Cornall SJ, Kennedy K, Colligon P, Gordon DM. Sex-dependent competitive dominance of phylogenetic group B2 Escherichia coli strains within human hosts. Environ Microbiol Rep. 2014;6(6):605–19. doi:10.1111/1758-2229.12168.
  • Smati M, Clermont O, Le Gal F, Schichmanoff O, Jauréguy F, Eddi A, Denamur E, Picard B. Real-time PCR for quantitative analysis of human commensal Escherichia coli populations reveals a high frequency of subdominant phylogroups. Appl Environ Microbiol. 2013;79(16):5005–5012. doi:10.1128/AEM.01423-13.
  • Martinson JNV, Pinkham NV, Peters GW, Cho H, Heng J, Rauch M, Broadaway SC, Walk ST. Rethinking gut microbiome residency and the Enterobacteriaceae in healthy human adults. Isme J. 2019;13(9):2306–2318. doi:10.1038/s41396-019-0435-7.
  • Lescat M, Clermont O, Woerther PL, Glodt J, Dion S, Skurnik D, Djossou F, Dupont C, Perroz G, Picard B, et al. Commensal Escherichia coli strains in Guiana reveal a high genetic diversity with host-dependant population structure. Environ Microbiol Rep. 2013;5(1):49–57. doi:10.1111/j.1758-2229.2012.00374.x.
  • Gordon DM, O’Brien CL, Pavli P. Escherichia coli diversity in the lower intestinal tract of humans. Environ Microbiol Rep. 2015;7(4):642–648. doi:10.1111/1758-2229.12300.
  • Denamur E, Clermont O, Bonacorsi S, Gordon D. The population genetics of pathogenic Escherichia coli. Nat Rev Microbiol. 2020;19(1):37–54. doi:10.1038/s41579-020-0416-x.
  • Tenaillon O, Skurnik D, Picard B, Denamur E. The population genetics of commensal Escherichia coli. Nat Rev Microbiol. 2010;8(3):207–217. doi:10.1038/nrmicro2298.
  • Skurnik D, Bonnet D, Bernède-Bauduin C, Michel R, Guette C, Becker J-M, Balaire C, Chau F, Mohler J, Jarlier V, et al. Characteristics of human intestinal Escherichia coli with changing environments. Environ Microbiol. 2008;10(8):2132–2137. doi:10.1111/j.1462-2920.2008.01636.x.
  • Escobar-Páramo P, Le Menac’h A, Le Gall T, Amorin C, Gouriou S, Picard B, Skurnik D, Denamur E. Identification of forces shaping the commensal Escherichia coli genetic structure by comparing animal and human isolates. Environ Microbiol. 2006;8(11):1975–1984. doi:10.1111/j.1462-2920.2006.01077.x.
  • Kallonen T, Brodrick HJ, Harris SR, Corander J, Brown NM, Martin V, Peacock SJ, Parkhill J. Systematic longitudinal survey of invasive Escherichia coli in England demonstrates a stable population structure only transiently disturbed by the emergence of ST131. Genome Res. 2017;27(8):1437–1449. doi:10.1101/gr.216606.116.
  • Nicolas-Chanoine M-H, Bertrand X, Madec J-Y. Escherichia coli ST131, an intriguing clonal group. Clin Microbiol Rev. 2014;27(3):543–574. doi:10.1128/CMR.00125-13.
  • Matsui Y, Hu Y, Rubin J, de Assis R.S, Suh J, Riley LW, Assis RS. Multilocus sequence typing of Escherichia coli isolates from urinary tract infection patients and from fecal samples of healthy subjects in a college community. MicrobiologyOpen. 2020;9(6):1225–1233. doi:10.1002/mbo3.1032.
  • Manges AR, Harel J, Masson L, Edens TJ, Portt A, Reid-Smith RJ, Zhanel GG, Kropinski AM, Boerlin P. Multilocus sequence typing and virulence gene profiles associated with Escherichia coli from human and animal sources. Foodborne Pathog Dis. 2015;12(4):302–310. doi:10.1089/fpd.2014.1860.
  • Nowrouzian FL, Adlerberth I, Wold AE. Enhanced persistence in the colonic microbiota of Escherichia coli strains belonging to phylogenetic group B2: role of virulence factors and adherence to colonic cells. Microbes Infect. 2006;8(3):834–840. doi:10.1016/j.micinf.2005.10.011.
  • Hillman ET, Lu H, Yao T, Nakatsu CH. Microbial ecology along the gastrointestinal tract. Microbes Environ. 2017;32(4):300–313. doi:10.1264/jsme2.ME17017.
  • Hounnou G, Destrieux C, Desmé J, Bertrand P, Velut S. Anatomical study of the length of the human intestine. Surg Radiol Anat. 2002;24(5):290–294. doi:10.1007/s00276-002-0057-y.
  • Vertzoni M, Augustijns P, Grimm M, Koziolek M, Lemmens G, Parrott N, Pentafragka C, Reppas C, Rubbens J, Van Den Αbeele J, et al. Impact of regional differences along the gastrointestinal tract of healthy adults on oral drug absorption: an UNGAP review. Eur J Pharm Sci. 2019;134:153–175. doi:10.1016/j.ejps.2019.04.013.
  • Dixit OVA, O’Brien CL, Pavli P, Gordon DM. Within-host evolution versus immigration as a determinant of Escherichia coli diversity in the human gastrointestinal tract. Environ Microbiol. 2018;20(3):993–1001. doi:10.1111/1462-2920.14028.
  • Ghalayini M, Launay A, Bridier-Nahmias A, Clermont O, Denamur E, Lescat M, Tenaillon O. Evolution of a dominant natural isolate of Escherichia coli in the human gut over the course of a year suggests a neutral evolution with reduced effective population size. Appl Environ Microb. 2018;84:e02377–17. doi:10.1128/AEM.02377-17.
  • Lescat M, Launay A, Ghalayini M, Magnan M, Glodt J, Pintard C, Dion S, Denamur E, Tenaillon O. Using long-term experimental evolution to uncover the patterns and determinants of molecular evolution of an Escherichia coli natural isolate in the streptomycin-treated mouse gut. Mol Ecol. 2017;26(7):1802–1817. doi:10.1111/mec.13851.
  • Lenski RE. Experimental evolution and the dynamics of adaptation and genome evolution in microbial populations. Isme J. 2017;11(10):2181–2194. doi:10.1038/ismej.2017.69.
  • Maharjan RP, Ferenci T, Gore J. A shifting mutational landscape in 6 nutritional states: stress-induced mutagenesis as a series of distinct stress input–mutation output relationships. PLoS Biol. 2017;15(6):e2001477. doi:10.1371/journal.pbio.2001477.
  • Bridier-Nahmias A, Launay A, Bleibtreu A, Magnan M, Walewski V, Chatel J, Dion S, Robbe-Saule V, Clermont O, Norel F, et al. Escherichia coli genomic diversity within extraintestinal acute infections argues for adaptive evolution at play. mSphere. 2021;6(1):e01176–20. doi:10.1128/mSphere.01176-20.
  • Barroso-Batista J, Demengeot J, Gordo I. Adaptive immunity increases the pace and predictability of evolutionary change in commensal gut bacteria. Nat Commun. 2015;6(1):8945. doi:10.1038/ncomms9945.
  • Clermont O, Lescat M, O’Brien CL, Gordon DM, Tenaillon O, Denamur E. Evidence for a human-specific Escherichia coli clone. Environ Microbiol. 2008;10(4):1000–1006. doi:10.1111/j.1462-2920.2007.01520.x.
  • Johnson JR, Davis G, Clabots C, Johnston BD, Porter S, DebRoy C, Pomputius W, Ender PT, Cooperstock M, Slater BS, et al. Household Clustering of Escherichia coli sequence type 131 clinical and fecal isolates according to whole genome sequence analysis. Open Forum Infect Dis. 2016;3(3):ofw129. doi:10.1093/ofid/ofw129.
  • Lee SM, Wyse A, Lesher A, Everett ML, Lou L, Holzknecht ZE, Whitesides JF, Spears PA, Bowles DE, Lin SS, et al. Adaptation in a mouse colony monoassociated with Escherichia coli K-12 for more than 1,000 days. Appl Environ Microbiol. 2010;76(14):4655–4663. doi:10.1128/AEM.00358-10.
  • Rice DH, Sheng HQ, Wynia SA, Hovde CJ. Rectoanal mucosal swab culture is more sensitive than fecal culture and distinguishes Escherichia coli O157: h7-colonized cattle and those transiently shedding the same organism. J Clin Microbiol. 2003;41(11):4924–4929. doi:10.1128/JCM.41.11.4924-4929.2003.
  • Evans DJ Jr., Evans DG. Escherichia Coli in Diarrheal disease. In: Baron S, editor Medical Microbiology. Galveston (TX): University of Texas Medical Branch at Galveston, Copyright © 1996, The University of Texas Medical Branch at Galveston. 1996. p. 251.
  • Naylor SW, Low JC, Besser TE, Mahajan A, Gunn GJ, Pearce MC, McKendrick IJ, Smith DGE, Gally DL. Lymphoid follicle-dense mucosa at the terminal rectum is the principal site of colonization of enterohemorrhagic Escherichia coli O157: h7 in the bovine host. Infect Immun. 2003;71(3):1505–1512. doi:10.1128/IAI.71.3.1505-1512.2003.
  • Qadri F, Svennerholm A-M, Faruque ASG, Sack RB. Enterotoxigenic Escherichia coli in developing countries: epidemiology, microbiology, clinical features, treatment, and prevention. Clin Microbiol Rev. 2005;18(3):465–483. doi:10.1128/CMR.18.3.465-483.2005.
  • Dixit SM, Gordon DM, Wu XY, Chapman T, Kailasapathy K, Chin JJ. Diversity analysis of commensal porcine Escherichia coli – associations between genotypes and habitat in the porcine gastrointestinal tract. Microbiol. 2004;150(6):1735–1740. doi:10.1099/mic.0.26733-0.
  • Abraham S, Gordon DM, Chin J, Brouwers HJ, Njuguna P, Groves MD, Zhang R, Chapman TA. Molecular characterization of commensal Escherichia coli adapted to different compartments of the porcine gastrointestinal tract. Appl Environ Microbiol. 2012;78(19):6799–6803. doi:10.1128/AEM.01688-12.
  • Johnson JR, Murray AC, Gajewski A, Sullivan M, Snippes P, Kuskowski MA, Smith KE. Isolation and molecular characterization of nalidixic acid-resistant extraintestinal pathogenic Escherichia coli from retail chicken products. Antimicrob Agents Chemother. 2003;47(7):2161–2168. doi:10.1128/AAC.47.7.2161-2168.2003.
  • Johnson JR, Johnston BD, Porter S, Thuras P, Aziz M, Price LB. Accessory traits and phylogenetic background predict Escherichia coli extraintestinal virulence better than does ecological source. J Infect Dis. 2018;219:121–132. doi:10.1093/infdis/jiy459.
  • Kaper JB, Nataro JP, Mobley HLT. Pathogenic Escherichia coli. Nat Rev Microbiol. 2004;2(2):123–140. doi:10.1038/nrmicro818.
  • Sarowska J, Futoma-Koloch B, Jama-Kmiecik A, Frej-Madrzak M, Ksiazczyk M, Bugla-Ploskonska G, Choroszy-Krol I. Virulence factors, prevalence and potential transmission of extraintestinal pathogenic Escherichia coli isolated from different sources: recent reports. Gut Pathog. 2019;11(1):10. doi:10.1186/s13099-019-0290-0.
  • Spurbeck RR, Dinh PC Jr., Walk ST, Stapleton AE, Hooton TM, Nolan LK, Kim KS, Johnson JR, Mobley HLT. Escherichia coli isolates that carry vat, fyuA, chuA, and yfcV efficiently colonize the urinary tract. Infect Immun. 2012;80(12):4115–4122. doi:10.1128/IAI.00752-12.
  • Hershberg R. Mutation—the engine of evolution: studying mutation and its role in the evolution of bacteria: figure 1. Cold Spring Harb Perspect Biol. 2015;7(9):a018077–a. doi:10.1101/cshperspect.a018077.
  • Brito IL. Examining horizontal gene transfer in microbial communities. Nat Rev Microbiol. 2021;19(7):442–453. doi:10.1038/s41579-021-00534-7.
  • Culyba MJ, Van Tyne D, Hiller NL. Bacterial evolution during human infection: adapt and live or adapt and die. PLoS Pathog. 2021;17(9):e1009872. doi:10.1371/journal.ppat.1009872.
  • Frazão N, Sousa A, Lässig M, Gordo I Horizontal gene transfer overrides mutation in Escherichia coli colonizing the mammalian gut. Proceedings of the National Academy of Sciences 2019;116:17906–17915.
  • Kidsley AK, O’Dea M, Saputra S, Jordan D, Johnson JR, Gordon DM, Turni C, Djordjevic SP, Abraham S, Trott DJ, et al. Genomic analysis of phylogenetic group B2 extraintestinal pathogenic E. coli causing infections in dogs in Australia. Vet Microbiol. 2020;248:108783. doi:10.1016/j.vetmic.2020.108783.
  • Baniga Z, Hounmanou YMG, Kudirkiene E, Kusiluka LJM, Mdegela RH, Dalsgaard A. Genome-based analysis of extended-spectrum β-lactamase-producing Escherichia coli in the aquatic environment and Nile Perch (lates niloticus) of Lake Victoria, Tanzania. Front Microbiol. 2020;11:11. doi:10.3389/fmicb.2020.00108.
  • Valat C, Drapeau A, Beurlet S, Bachy V, Boulouis HJ, Pin R, Cazeau G, Madec J-Y, Haenni M. Pathogenic Escherichia coli in dogs reveals the predominance of ST372 and the human-associated ST73 extra-intestinal lineages. Front Microbiol. 2020;11:580. doi:10.3389/fmicb.2020.00580.
  • Foster PL. Adaptive mutation: implications for evolution. BioEssays: news and reviews in molecular, cellular and developmental biology. BioEssays. 2000;22(12):1067–1074. doi:10.1002/1521-1878(200012)22:12<1067:AID-BIES4>3.0.CO;2-Q.
  • Swings T, Van den Bergh B, Wuyts S, Oeyen E, Voordeckers K, Verstrepen KJ, Fauvart M, Verstraeten N, Michiels J. Adaptive tuning of mutation rates allows fast response to lethal stress in Escherichia coli. eLife. 2017;6:e22939. doi:10.7554/eLife.22939.
  • Gordo I, Demengeot J, Xavier K. Escherichia coli adaptation to the gut environment: a constant fight for survival. Future Microbiol. 2014;9(11):1235–1238. doi:10.2217/fmb.14.86.
  • Ghalayini M, Magnan M, Dion S, Zatout O, Bourguignon L, Tenaillon O, Lescat M. Long-term evolution of the natural isolate of Escherichia coli 536 in the mouse gut colonized after maternal transmission reveals convergence in the constitutive expression of the lactose operon. Mol Ecol. 2019;28(19):4470–4485. doi:10.1111/mec.15232.
  • Fitzhenry RJ, Reece S, Trabulsi LR, Heuschkel R, Murch S, Thomson M, Frankel G, Phillips AD. Tissue tropism of enteropathogenic Escherichia coli strains belonging to the O55 serogroup. Infect Immun. 2002;70(8):4362–4368. doi:10.1128/IAI.70.8.4362-4368.2002.
  • Aktan İ, Ragione RML, Woodward MJ. Colonization, persistence, and tissue tropism of Escherichia coli O26 in conventionally reared weaned lambs. Appl Environ Microb. 2007;73:691–698. doi:10.1128/AEM.01879-06.
  • Chong Y, Fitzhenry R, Heuschkel R, Torrente F, Frankel G, Phillips AD. Human intestinal tissue tropism in Escherichia coli O157: h7 – initial colonization of terminal ileum and Peyer’s patches and minimal colonic adhesion ex vivo. Microbiol. 2007;153(3):794–802. doi:10.1099/mic.0.2006/003178-0.
  • Sperandio V, Nguyen Y. Enterohemorrhagic E. coli (EHEC) pathogenesis. Front Cell Infect Microbiol. 2012;2. doi:10.3389/fcimb.2012.00090.
  • McCall L-I, Siqueira-Neto JL, McKerrow JH, Knoll LJ. Location, location, location: five facts about tissue tropism and pathogenesis. PLoS Pathog. 2016;12(5):e1005519–e. doi:10.1371/journal.ppat.1005519.
  • Gonzales-Siles L, Sjöling Å. The different ecological niches of enterotoxigenic Escherichia coli. Environ Microbiol. 2016;18(3):741–751. doi:10.1111/1462-2920.13106.
  • Fleckenstein JM, Hardwidge PR, Munson GP, Rasko DA, Sommerfelt H, Steinsland H. Molecular mechanisms of enterotoxigenic Escherichia coli infection. Microbes Infect. 2010;12(2):89–98. doi:10.1016/j.micinf.2009.10.002.
  • Le Gall T, Clermont O, Gouriou S, Picard B, Nassif X, Denamur E, Tenaillon O. Extraintestinal virulence is a coincidental by-product of commensalism in b2 phylogenetic group Escherichia coli strains. Mol Biol Evol. 2007;24(11):2373–2384. doi:10.1093/molbev/msm172.
  • Riley LW, Blanton RE. Distinguishing pathovars from nonpathovars: Escherichia coli*. Microbiol Spectr. 2020;8(4):8. doi:10.1128/microbiolspec.AME-0014-2020.
  • Schroeder BO. Fight them or feed them: how the intestinal mucus layer manages the gut microbiota. Gastroenterol Rep. 2019;7(1):3–12. doi:10.1093/gastro/goy052.
  • Pelaseyed T, Bergström JH, Gustafsson JK, Ermund A, Birchenough GMH, Schütte A, van der Post S, Svensson F, Rodríguez-Piñeiro AM, Nyström EEL, et al. The mucus and mucins of the goblet cells and enterocytes provide the first defense line of the gastrointestinal tract and interact with the immune system. Immunol Rev. 2014;260(1):8–20. doi:10.1111/imr.12182.
  • Johansson MEV, Sjövall H, Hansson GC. The gastrointestinal mucus system in health and disease. Nat Rev Gastro Hepat. 2013;10(6):352–361. doi:10.1038/nrgastro.2013.35.
  • Arnoldini M, Cremer J, Hwa T. Bacterial growth, flow, and mixing shape human gut microbiota density and composition. Gut Microbes. 2018;9:559–566. doi:10.1080/19490976.2018.1448741.
  • Pellock SJ, Redinbo MR. Glucuronides in the gut: sugar-driven symbioses between microbe and host. J Biol Chem. 2017;292(21):8569–8576. doi:10.1074/jbc.R116.767434.
  • Xavier NM, Fortuna A. Synthesis and biological properties of d-glucuronamide-containing compounds. Reference Module Chem, Molecular Sci Chem Eng. 2019; Elsevier.
  • Ye F, Shen H, Li Z, Meng F, Li L, Yang J, Chen Y, Bo X, Zhang X, Ni M, et al. Influence of the biliary system on biliary bacteria revealed by bacterial communities of the human biliary and upper digestive tracts. PLos One. 2016;11(3):e0150519. doi:10.1371/journal.pone.0150519.
  • Le Bouguénec C, Schouler C. Sugar metabolism, an additional virulence factor in enterobacteria. Int J Med Microbiol. 2011;301(1):1–6. doi:10.1016/j.ijmm.2010.04.021.
  • Gibreel TM, Dodgson AR, Cheesbrough J, Bolton FJ, Fox AJ, Upton M. High metabolic potential may contribute to the success of ST131 uropathogenic Escherichia coli. J Clin Microbiol. 2012;50(10):3202–3207. doi:10.1128/JCM.01423-12.
  • Vimont S, Boyd A, Bleibtreu A, Bens M, Goujon J-M, Garry L, Clermont O, Denamur E, Arlet G, Vandewalle A, et al. The CTX-M-15-producing Escherichia coli clone O25b: h4-ST131 has high intestine colonization and urinary tract infection abilities. PLos One. 2012;7(9):e46547. doi:10.1371/journal.pone.0046547.
  • Alqasim A, Emes R, Clark G, Newcombe J, La Ragione R, McNally A, Ahmed N. Phenotypic microarrays suggest Escherichia coli ST131 is not a metabolically distinct lineage of extra-intestinal pathogenic E. coli. PLos One. 2014;9(2):e88374. doi:10.1371/journal.pone.0088374.
  • Alqasim A, Jaffal AA, Almutairi N, Alyousef AA. Comparative phenotypic characterization identifies few differences in the metabolic capacity between Escherichia coli ST131 subclones. Saudi J Biol Sci. 2021;28(1):762–769. doi:10.1016/j.sjbs.2020.11.008.
  • Lenski RE, Rose MR, Simpson SC, Tadler SC. Long-term experimental evolution in Escherichia coli. i. adaptation and divergence during 2,000 generations. Am Nat. 1991;138(6):1315–1341. doi:10.1086/285289.
  • Wiser MJ, Ribeck N, Lenski RE. Long-term dynamics of adaptation in asexual populations. Science. 2013;342(6164):1364–1367. doi:10.1126/science.1243357.
  • Nurk S, Bankevich A, Antipov D, Gurevich A, Korobeynikov A, Lapidus A, Prjibelsky A, Pyshkin A, Sirotkin A, Sirotkin Y, et al. Assembling genomes and mini-metagenomes from highly chimeric reads. Berlin: Springer Berlin Heidelberg Berlin Heidelberg; 2013. pp. 158–170. 10.1007/978-3-642-37195-0_13.
  • Darling AC, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004;14(7):1394–1403. doi:10.1101/gr.2289704.
  • Beghain J, Bridier-Nahmias A, Le Nagard H, Denamur E, Clermont O. ClermonTyping: an easy-to-use and accurate in silico method for Escherichia genus strain phylotyping. Microb Genom. 2018;4(7):e000192. doi:10.1099/mgen.0.000192.
  • Wirth T, Falush D, Lan R, Colles F, Mensa P, Wieler LH, Karch H, Reeves PR, Maiden MCJ, Ochman H, et al. Sex and virulence in Escherichia coli: an evolutionary perspective. Mol Microbiol. 2006;60(5):1136–1151. doi:10.1111/j.1365-2958.2006.05172.x.
  • Joensen KG, Tetzschner AMM, Iguchi A, Aarestrup FM, Scheutz F. Rapid and easy in silico serotyping of Escherichia coli isolates by use of whole-genome sequencing data. J Clin Microbiol. 2015;53:2410–2426.
  • Roer L, Tchesnokova V, Allesøe R, Muradova M, Chattopadhyay S, Ahrenfeldt J, Thomsen MCF, Lund O, Hansen F, Hammerum AM, et al. Development of a web tool for Escherichia coli subtyping based on fimH Alleles. J Clin Microbiol. 2017;55(8):2538–2543. doi:10.1128/JCM.00737-17.
  • Bortolaia V, Kaas RS, Ruppe E, Roberts MC, Schwarz S, Cattoir V, Philippon A, Allesoe RL, Rebelo AR, Florensa AF, et al. ResFinder 4.0 for predictions of phenotypes from genotypes. J Antimicrob Chemother. 2020;75(12):3491–3500. doi:10.1093/jac/dkaa345.
  • Carattoli A, Zankari E, García-Fernández A, Voldby Larsen M, Lund O, Villa L, Møller Aarestrup F, Hasman H. In silico detection and typing of plasmids using PlasmidFinder and plasmid multilocus sequence typing. Antimicrob Agents Chemother. 2014;58(7):3895–3903. doi:10.1128/AAC.02412-14.
  • Joensen KG, Scheutz F, Lund O, Hasman H, Kaas RS, Nielsen EM, Aarestrup FM. Real-time whole-genome sequencing for routine typing, surveillance, and outbreak detection of verotoxigenic Escherichia coli. J Clin Microbiol. 2014;52(5):1501–1510. doi:10.1128/JCM.03617-13.
  • Brettin T, Davis JJ, Disz T, Edwards RA, Gerdes S, Olsen GJ, Olson R, Overbeek R, Parrello B, Pusch GD, et al. Rasttk: a modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes. Sci Rep. 2015;5(1):8365–. doi:10.1038/srep08365.
  • Davis JJ, Wattam AR, Aziz RK, Brettin T, Butler R, Butler RM, Chlenski P, Conrad N, Dickerman A, Dietrich EM, et al. The PATRIC Bioinformatics Resource Center: expanding data and analysis capabilities. Nucleic Acids Res. 2020;48:D606–D12. doi:10.1093/nar/gkz943.