1,808
Views
2
CrossRef citations to date
0
Altmetric
Research Article

Genomic dissection of the microevolution of Australian epidemic Bordetella pertussis

, , , , , , , , & show all
Pages 1460-1473 | Received 05 Jan 2022, Accepted 09 May 2022, Published online: 01 Jun 2022

ABSTRACT

Whooping cough (pertussis) is a highly contagious respiratory disease caused by the bacterium Bordetella pertussis. Despite high vaccine coverage, pertussis has re-emerged in many countries including Australia and caused two large epidemics in Australia since 2007. Here, we undertook a genomic and phylogeographic study of 385 Australian B. pertussis isolates collected from 2008 to 2017. The Australian B. pertussis population was found to be composed of mostly ptxP3 strains carrying different fim3 alleles, with ptxP3-fim3A genotype expanding far more than ptxP3-fim3B. Within the former, there were six co-circulating epidemic lineages (EL1 to EL6). The multiple ELs emerged, expanded, and then declined at different time points over the two epidemics. In population genetics terms, both hard and soft selective sweeps through vaccine selection pressures have determined the population dynamics of Australian B. pertussis. Relative risk estimation suggests that once a new B. pertussis lineage emerged, it was more likely to spread locally within the first 1.5 years. However, after 1.5 years, any new lineage was likely to expand to a wider region. Phylogenetic analysis revealed the expansion of ptxP3 strains was also associated with replacement of the type III secretion system allele bscI1 with bscI3. bscI3 is associated with decreased T3SS secretion and may allow B. pertussis to reduce immune recognition. This study advanced our understanding of the epidemic population structure and spatial and temporal dynamics of B. pertussis in a highly immunized population.

Introduction

Bordetella pertussis is the causative agent of the contagious respiratory disease, whooping cough [Citation1]. Although the introduction of whole-cell vaccines (WCVs) has achieved a successful reduction in B. pertussis incidence, pertussis re-emerged in the late 1990s in many developed countries after the acellular vaccines (ACVs) replaced the WCV [Citation2–4]. Based on the World Health Organisation (WHO) estimates, there were 24.1 million pertussis cases in 2014 with 160,700 deaths in children less than 5-year-old, despite around 86% immunization coverage with pertussis vaccines globally [Citation5]. In Australia, pertussis epidemics have been occurring cyclically every three to five years. The largest pertussis epidemic started in 2007 and peaked with nearly 40,000 cases reported in 2011 [Citation6], followed by a smaller epidemic from 2013 to 2017.

Several factors are thought to have contributed to the re-emergence of pertussis. The adaptation to vaccine-induced immunity is believed to be an important factor for pertussis re-emergence since genotypic and phenotypic divergences between vaccine strains and circulating strains of B. pertussis, especially with respect to genes encoding vaccine antigens, have been reported globally [Citation4,Citation7–11]. For example, pertactin (Prn) is one of the antigens included in ACVs and strains which do not produce pertactin have emerged recently and became predominant in some countries [Citation12,Citation13]. A mouse infection study compared a Prn-negative isolate with a Prn-positive wild type isolate and showed that the former was more competitive in colonizing ACV vaccinated mice [Citation14]. Another example of vaccine-induced genetic adaption is the emergence and spread of strains carrying the ptxP3 allele across the globe in recent years [Citation4,Citation8]. The ptxP3 strains were found to produce a higher amount of pertussis toxin than ptxP1 strains suggesting higher virulence of the ptxP3 strains [Citation8]. Higher virulence of the B. pertussis strains may have evolved under the selective pressure from vaccines, similar to that observed in Haemophilus influenzae Serotype b strains [Citation15].

There are two main ACVs consisting of three and five components [Citation16]. The three component ACV consisted of Prn, pertussis toxin and filamentous haemagglutinin whereas the five component ACV additionally contained fimbriae Fim2 and Fim3. Australia predominantly uses the three component ACV [Citation6]. Variations in ACV antigen genes apart from the two key changes discussed above have been observed and have been associated with vaccine selection [Citation6]. For the two fimbrial subunits, Fim2 and Fim3, they are subject to phase variation [Citation17]. Most B. pertussis strains produce either Fim2, Fim3 or both. Fim serotypes have changed over time globally and the changes may or may not be vaccine-driven [Citation18]. Dissecting the selection pressures and evolutionary forces in the genetic and phenotypic changes in vaccine antigen genes is often difficult.

Variations in vaccine-related genes also reflected the changes in the population structure of B. pertussis. Comparative genomic studies of B. pertussis have been conducted to investigate the evolutionary changes contributing to their resurgence. Genome-wide single nucleotide polymorphisms (SNPs) typing was able to divide over 300 global B. pertussis isolates into six SNPs clusters (Cluster I to VI) [Citation7]. This, and other SNP typing studies provided phylogenetic evidence supporting the hypothesis that B. pertussis has evolved under vaccine selective pressure [Citation4,Citation7]. Subsequently, the emergence of the Cluster I lineage was found to be associated with the emergence of the ptxP3 allele [Citation9]. Genomic sequencing and phylogenetic analysis of isolates from the 2008–2012 Australian epidemic further defined five epidemic lineages (ELs) within Cluster I in Australia and the parallel evolution of Prn deficiency in different co-circulating lineages [Citation19]. Prn-negative isolates from four of these five ELs continued to circulate in the 2013–2017 Australian pertussis epidemic [Citation13]. Although genetic variation in these isolates has been studied, the temporal and geospatial dynamics of B. pertussis population structure changes and evolutionary forces behind have not been elucidated yet.

In addition to vaccine selection, comparative proteomic studies between a Cluster I and a Cluster II strain identified decreased secretion of the type III secretion system (T3SS) in the Cluster I strain [Citation20–22]. This decreased secretion was hypothesized to be associated with a mutation in bscI, which encodes the T3SS basal body protein. A mouse infection study with Cluster I and Cluster II strains showed that Cluster I was fitter in both ACV-immunized and unimmunized mice suggesting additional selection pressures [Citation23].

In this study, we used a large set of B. pertussis isolates to elucidate the population structure of the pathogen and the factors influencing its evolution during the epidemics in Australia. The fine-scale phylogenetic analyses were integrated with temporal and spatial metadata to examine how B. pertussis populations varied over time and space.

Materials and methods

Bacterial isolates and genomic DNA preparation

The 385 B. pertussis isolates used in this study were sampled from pathology laboratories servicing patients across different states in Australia. Among them, five isolates from 1997 to 2002 were included to represent strains circulating before the recent epidemics, 302 isolates were sampled from the 2008–2012 epidemic across five states: New South Wales (NSW), Western Australia (WA), Queensland (QLD), Victoria (VIC) and South Australia (SA) and 78 isolates were collected from the 2013–2017 epidemic from two states, WA and NSW, as the other states ceased culturing B. pertussis (Supplementary Table S1). Due to the poor sequence quality of four isolates from a previous study [Citation19], these isolates were re-sequenced in this study. Overall, a total of 284 isolates were sequenced in this study. The number of isolates from different states did not reflect the total pertussis incidence of the states. WA has a much smaller population than NSW but more isolates were collected from WA. All available clinical isolates from 2008 to 2017 were included. The number of isolates obtained was relatively small in comparison to the number of cases reported over the study period because pertussis culture had been replaced by direct PCR testing.

Isolates were revived from cryovials stored in freezers at −80°C and inoculated on Bordet-Gengou agar (Becton Dickinson, Sparks, MD, USA, supplemented with 7% horse blood) and aerobically incubated at 37°C for 4–5 days. Pure colonies were harvested for genomic DNA extraction using the phenol–chloroform method [Citation24].

Genome sequencing and data accessibility

Sequencing libraries were constructed using Nextera XT kit (Illumina). The fragment size distribution of the libraries was analysed using LabChip GX high Sensitivity DNA assay kit (Caliper Life Science, Hopkinton, MA). Whole-genome sequencing (WGS) was performed using Illumina NextSeq (2 × 150 bp) and MiSeq (2 × 300 bp) platform. Raw reads have been submitted to the NCBI GenBank database under the BioProject accession number PRJNA562796. All isolates were sequenced at the Marshall Centre for Infectious Diseases Research and Training (WA, Australia).

Detection of SNPs and phylogenetic analysis

De novo assembly was performed using SPAdes (version 3.14.1) [Citation25]. SNPs were identified using a combination of mapping to the B. pertussis reference strain Tohama I (NC_002929.2) using Burrow-Wheeler Alignment (BWA) tool (version 0.7.12) [Citation26], SAMtools (version 0.1.19) [Citation27] and alignment of SPAdes assembled genome to reference Tohama I using progressiveMauve (version snapshot_2015_02-25) [Citation28]. Only SNPs identified consistently by both contig-alignment and mapping methods were used for the downstream analyses. Phylogenetic trees were constructed using RAxML (version 8.2.12) [Citation29] with the complete genome of B. pertussis Tohama I as reference and outgroup, applying parameters -m GTRGAMMA and 1000 bootstrap samples. Determination of IS481 and IS1002 insertions was performed using a custom script as previously described in Safarchi et al. [Citation19] and Octavia et al. [Citation7].

The phylogenetic tree of international B. pertussis strains was constructed using 1452 available B. pertussis genomes to locate the spread of ELs. The phylogenetic tree was constructed using Quicktree pipeline (version 1.1) with default settings, in which -gtr -gamma model was used for tree construction with 100 bootstrap replicates [Citation30].

BEAST analysis

Divergence time of the nodes on the tree was estimated using the BEAST package (version 1.10.4). Considering that the sampling and sequencing of isolates may not cover all features for the whole epidemic in Australia, a fixed evolutionary model chosen for BEAST may not reflect the truth. However, a comprehensive modelling for estimating the population structure of B. pertussis epidemic using sampled genomes is not in the scope of this study. Thus, the model was chosen by statistical evaluation using only the sequenced data, following the strategy described in [Citation31], in which a total number of 24 models (four molecular clock models and six population size models) were set up and run independently using the same data with same MCMC parameters (1,000,000 chains excluding the first 10,000 chains from the posterior distribution as burn-in), and then Effective Sample Size (ESS) value of each model was calculated using Tracer (version 1.7.1). The strict clock with constant population size model was found to have the highest ESS value (ESS = 339, the only model combination with ESS over 200) and was thus selected. The final tree with divergence dates was annotated by TreeAnnotator within the BEAST package.

Lineage through time (LTT)

The lineage through time (LTT) was performed following Reznick et al. [Citation32]. Briefly, LTT refers to the total number of branches at a certain time point from the root to the top of the lineage. In this study, the divergence time of the nodes on the tree was extracted from the BEAST estimation as in . For a specific lineage, logarithm values of total number of branches were calculated at time points from the root to the top of the lineage. LTT curves were then generated using a custom script. The calculation was conducted using phytools package in R (version 3.5.1)[Citation33].

Figure 1. Phylogenetic relationship of Australian epidemic lineages (ELs). A phylogenetic tree of 385 B. pertussis isolates was constructed using the maximum likelihood method. The branch length was scaled by divergence time as generated by BEAST. The tree was rooted using the B. pertussis reference genome Tohama I as an outgroup. Three major lineages were coloured at the branch. The coloured lines and columns from left to right Prn presence or absence, ptxP allele types, the geographic source of isolates and period of isolation.

Figure 1. Phylogenetic relationship of Australian epidemic lineages (ELs). A phylogenetic tree of 385 B. pertussis isolates was constructed using the maximum likelihood method. The branch length was scaled by divergence time as generated by BEAST. The tree was rooted using the B. pertussis reference genome Tohama I as an outgroup. Three major lineages were coloured at the branch. The coloured lines and columns from left to right Prn presence or absence, ptxP allele types, the geographic source of isolates and period of isolation.

Relative risk assessment within and between jurisdictions

First, the probability was estimated that a pair of cases in geographically defined areas is within the same cluster of transmissions. Then relative risk was computed for the disease transmission as a function of geographic proximity, by comparing probabilities restricted in different ways as the same method shown in [Citation34]. Specifically, the analysis looks at relative risk of infection at different spatial scales within a state and between states.

The calculation of relative risk is based on the following notations. Define Dij to be the geographic distance between isolates i and j and let si and sj be the states associated with the isolates. Let Tij be the estimated time of the most recent common ancestor of isolates i and j. Let n be the total number of isolates being considered. The probability that a pair of geographically defined cases is within the same genetic cluster according to the specified time band is estimated with (1) θˆ(d1,d2,t1,t2)=i=1njinI(d1Dij<d2t1Tij<t2)i=1njinI(d1Dij<d2)(1) where the sum j1n is over the values of j between i and n except where i = j, and where I(x) is the indicator function which evaluates to 1 if x is true and 0 otherwise.

The relative risk is then estimated by (2) τˆ(d1,d2,t1,t2)=θˆ(d1,d2,t1,t2)θˆ(0,,t1,t2)(2) Based on the monthly number of pertussis notifications in Australia from January 1991 until December 2018 from the NNDSS (monthly pertussis notifications were standardized to a 30-day month [Citation35]), four cycle length interval (1.5, 3.5, 8 and 16 years) were generated by using the Fast Fourier Transform method [Citation35]. The geographical distance between isolates were mainly divided into five ranges: 0–1 km (same postcode area), 1–20 km (the Perth city area), 20–100 km (average distance to surrounding suburbs of Perth city), 100–260 km (cover the most of suburbs nearby) and >260 km. The > 260 km distance was used as the reference point, because very few pairs of isolates fell into this range of distance in WA.

The probability that a pair of interstate isolates, of which one belongs to state s, are genetically related under the t1, t2 time range for the time to most recent common ancestor (tMRCA) is (3) θ~(s,t1,t2)=i=1njinI(si=ssjst1Tij<t2)i=1njinI(si=ssjs)(3) which considers all pairs of isolates in which one is in state s and the other is outside s. For the relative risk the reference point is the probability that any two interstate pair of isolates is closely related: (4) θ~(t1,t2)=i=1njinI(sisjt1Tij<t2)i=1njinI(sisj)(4)

Thus, the relative risk (of interstate transmission for a particular state compared to all states) is (5) τˆ(s,t1,t2)=θˆ(s,t1,t2)θˆ(t1,t2)(5) To estimate the uncertainty of the relative risk, 100 times bootstrap resampling was done for each calculation to get the 95% CI intervals. For each time bootstrap process, 385 times independent and random samplings of isolates were performed from the pool of all the 385 isolates in the analysis to construct a bootstrap sample set for the relative risk calculation. All epidemiological factors were preserved in each resampled isolate.

Results

Phylogenetic lineages of B. pertussis circulating in Australia

All 302 isolates collected in Australia between 2008 and 2012 were sequenced in this study. Together with five isolates from 1997 to 2002 and 78 isolates collected from 2013 to 2017 sequenced previously [Citation13], a total of 385 isolates were included in the analysis. Comparing to Tohama I, the number of SNPs identified per genome ranged from 191 to 308. A phylogenetic tree was constructed using the maximum likelihood method based on 1501 SNPs (). Among these 385 isolates, 351 were classified into our previously identified SNP Cluster I [Citation7] which was characterized by the ptxP3 allele and six cluster specific SNPs (Supplementary Table 2 & 3). The isolates carrying ptxP3 allele were further divided into two sub-lineages demarcated by different fim3 alleles, with 249 isolates carrying fim3A allele and 85 isolates carrying fim3B allele. Note that some studies used fim3-1 and fim3-2 notation for fim3 alleles which is equivalent to fim3A and fim3B respectively. Apart from the ptxP3 lineage, 21 out of 34 isolates carried the fim3A* allele clustered into a defined ptxP1-fim3A* lineage (fim3A* denotes a synonymous mutation in the fim3A allele [Citation7]). Twelve isolates which did not belong to any cluster were distributed between the ptxP1-fim3A* lineage and the ptxP3 lineage. In addition, there was one ptxP1 isolate which shared little homology with the major ptxP1 clades and was located at the root of the tree with a long branch ().

Our previous studies [Citation19] identified five lineages which were named as epidemic lineages (ELs). The five ELs (EL1 to EL5) previously described [Citation13] remained distinctive with expanded number of isolates, while a new EL (EL6) containing 21 isolates was identified as a sister group with EL1, EL3 and EL5 (). All ELs were within the ptxP3-fim3A lineage, and the division of the ELs were supported by bootstrap values of >80% and specific SNPs (Supplementary Table 3). It should be noted that these lineages may have been circulating before the recent epidemics and thus the naming does not imply that they arose during the epidemic.

Our previous study showed that Prn-negative isolates emerged at the start of the first epidemic in 2008 before becoming predominant by the end of the epidemic in 2012. Prn-negative isolates continued to be predominant through the latest epidemic [Citation3,Citation13]. EL1 had the highest proportion of Prn-negative isolates which accounted for 96.7% (59/61), while EL5 and EL6 had the lowest proportion with 8.00% (2/25) and 9.52% (2/21), respectively. The proportion of Prn-negative isolates in EL2, EL3 and EL4 were 66.67% (12/18), 56.92% (37/65) and 45% (18/40) respectively. The ptxP3-fim3B lineage also contained Prn-negative isolates with 27.05% (23/85) ().

Diversification of the epidemic lineages through the epidemics

Phylogenetic and BEAST analyses showed different ELs arose and diverged at different times. The root-to-tip phylogenetic distances of individual isolates showed a positive linear correlation with their collection dates (R2 = 0.1048, P = 6.481e-11) (Supplementary Figure 1), suggesting that B. pertussis evolved through successive SNP accumulation over time. The divergence time of ptxP3 lineage was estimated to be around 1983 (95%CI, 1980–1987).

To further understand how these lineages emerged, expanded or disappeared, as well as how the diversity of B. pertussis changed during the succession of lineages, a lineage through time (LTT) analysis [Citation32,Citation36] was performed. As shown in A, the diversifications [log (number of lineages)] of lineage ptxP1-fim3A*, ptxP3-fim3B, and ptxP3-fim3A started from around 1994, 1995, and 1997, respectively. Ten years after ACV replaced WCV in Australia, their diversifications peaked around 2007. In these three lineages, ptxP1-fim3A* failed to flourish during the 2008–2012 epidemic and declined rapidly, while diversification of both ptxP3-fim3A and ptxP3-fim3B accelerated during the epidemic. B. pertussis isolates carrying ptxP3-fim3A appeared later but also persisted longer. The diversification of the six ELs within the ptxP3-fim3A lineage were also assessed using LTT (B & C). EL3, EL4 and EL5 started to diversify at similar time points. EL3 and EL4 persisted through the two epidemics while EL5 went extinct in the 2008–2012 epidemic. EL2 and EL6 started at similar times. EL2 declined gradually whereas EL6 disappeared abruptly. EL1 started the latest among the six ELs and declined gradually with the progression of the first epidemic. This genetic diversification analysis suggested that the pertussis epidemics from 2008 to 2017 consisted of overlapping expansion and decline of different lineages over time.

Figure 2. Lineage through time (LTT). LTT plot was based on the temporal distribution of nodes in the Australian B. pertussis phylogenetic tree. A&C: LTT for allelic types and the lineages identified from the phylogenetic tree. Y axis shows the log of the number of ancestral lineages; X axis shows the estimated emerging time of each lineage. Different lineages are coded by line colours. B: Schematic tree of Australian B. pertussis isolates showing phylogenetic relationships of allelic types and ELs.

Figure 2. Lineage through time (LTT). LTT plot was based on the temporal distribution of nodes in the Australian B. pertussis phylogenetic tree. A&C: LTT for allelic types and the lineages identified from the phylogenetic tree. Y axis shows the log of the number of ancestral lineages; X axis shows the estimated emerging time of each lineage. Different lineages are coded by line colours. B: Schematic tree of Australian B. pertussis isolates showing phylogenetic relationships of allelic types and ELs.

Patterns of intra-state and inter-state spread of B. pertussis in Australia

To examine how B. pertussis isolates spread across the country, we applied relative risk analysis [Citation34] as a measure of transmission speed within defined intervals of divergence time and geographical distance using all 385 isolates based on the divergence date and tree in . As shown in A, two B. pertussis isolates collected from individuals residing within 1 km were five times more likely to share an MRCA of ≤ 1.5 years compared to two isolates from pairs which were > 260 km apart (the reference risk point). The probability of sharing an MRCA (<1.5 years) dropped to the reference point as the spatial distance increased between pairs and the value approached the reference risk (when distance > 20 km). Furthermore, relative risks all approached the reference risk for the pair of isolates with a MRCA of ≥1.5 years (B–D).

Figure 3. Relative risk analysis of Australian B. pertussis isolates within and between states. A–D: Intrastate WA (Western Australia) relative risk by MRCA within a defined period. Each point represents the risk that a pair of isolates collected from particular cases have an MRCA within a defined evolutionary timeframe relative to the risk that a pair of distal isolates (defined as two isolates from WA separated by > 260 km) have an MRCA in the same evolutionary timeframe. E–H: Interstate relative risk by MRCA within a defined period. Each point represents the risk that a pair of isolates collected from particular cases have an MRCA within a defined evolutionary timeframe relative to the risk that a pair of isolates from different state have a MRCA in the same evolutionary timeframe. One of the pair of isolates was selected from the specific state as shown in X-axis for each column and the other isolate was selected from any other 4 states. Each panel represents a different MRCA interval: (A) MRCA < 1.5 years, (B) MRCA 1.5–3.5 years, (C) MRCA 3.5–8 years, (D) MRCA 8–16 years. Grey broken lines represent the reference relative risks (reference ratio divided by itself, thus always equals one, see methods). Error bars represent the 95% CIs from 100 times bootstrap.

Figure 3. Relative risk analysis of Australian B. pertussis isolates within and between states. A–D: Intrastate WA (Western Australia) relative risk by MRCA within a defined period. Each point represents the risk that a pair of isolates collected from particular cases have an MRCA within a defined evolutionary timeframe relative to the risk that a pair of distal isolates (defined as two isolates from WA separated by > 260 km) have an MRCA in the same evolutionary timeframe. E–H: Interstate relative risk by MRCA within a defined period. Each point represents the risk that a pair of isolates collected from particular cases have an MRCA within a defined evolutionary timeframe relative to the risk that a pair of isolates from different state have a MRCA in the same evolutionary timeframe. One of the pair of isolates was selected from the specific state as shown in X-axis for each column and the other isolate was selected from any other 4 states. Each panel represents a different MRCA interval: (A) MRCA < 1.5 years, (B) MRCA 1.5–3.5 years, (C) MRCA 3.5–8 years, (D) MRCA 8–16 years. Grey broken lines represent the reference relative risks (reference ratio divided by itself, thus always equals one, see methods). Error bars represent the 95% CIs from 100 times bootstrap.

Relative risk was also used to assess the speed of B. pertussis isolates spread across Australia. The analysis for relative risk of the <1.5 year pairs did not reach statistical significance at the 0.05 level possibly due to small number of isolates. When the MRCA period was >1.5 and < 3.5 years, it is clear that the relative risk of each state transmitting isolates to another state differed from the reference risk (E and F). In contrast, the relative risk for each of the states where the isolates transmitted to another state approached the reference risk when the MRCA period was > 3.5 years (G and H).

Relationship of Australian epidemic lineages with global B. pertussis

A phylogenetic tree () of 1452 B. pertussis genomes was constructed based on 5777 SNPs in these genomes (See Methods), including 302 newly sequenced Australian B. pertussis isolates in this study and 1150 publicly available genomes (Supplementary Table 4). Ten ptxP and fim3 genotypes were identified with four major genotypes ptxP1-fim3A/ptxP1-fim3A*, ptxP3-fim3A and ptxP3-fim3B (). These genotypes were phylogenetically clustered and the genotype divisions support the tree structure at large scale as expected, which is consistent with previous studies [Citation4,Citation37,Citation38].

Figure 4. Phylogenetic tree of Australian and global B. pertussis isolates. Phylogenetic tree of 1452 B. pertussis isolates based on genome-wide SNPs. The tree was rooted using a group of ptxP2 isolates related to early Dutch strain B189 (isolated in 1991). The five columns to the right of the tree show the country of isolation, presence or absence of Prn, ptxP-fim3 genotype, bscI allele and Australian lineages per colour legends. The background colours represent the ELs as indicated. For both tree branches and the 5th Column, grey background colour of individual or groups of isolates marks Australian isolates not belonging to any ELs. Bootstrap values from 100 replicates are indicated by the colour of the branches as shown. Black arrows mark three Chinese ptxP3 isolates.

Figure 4. Phylogenetic tree of Australian and global B. pertussis isolates. Phylogenetic tree of 1452 B. pertussis isolates based on genome-wide SNPs. The tree was rooted using a group of ptxP2 isolates related to early Dutch strain B189 (isolated in 1991). The five columns to the right of the tree show the country of isolation, presence or absence of Prn, ptxP-fim3 genotype, bscI allele and Australian lineages per colour legends. The background colours represent the ELs as indicated. For both tree branches and the 5th Column, grey background colour of individual or groups of isolates marks Australian isolates not belonging to any ELs. Bootstrap values from 100 replicates are indicated by the colour of the branches as shown. Black arrows mark three Chinese ptxP3 isolates.

These isolates were collected from 1935 to 2017 across 20 countries in five continents, Asia, Africa, North and South America, Europe and Oceania (, Supplementary Table 4). The largest number of isolates were from the US, Australia and Japan with 506, 404, and 180 isolates respectively. Consistent with a previous study [Citation37], the US isolates (annotated in blue in ) were distributed across the phylogenetic tree. Isolates from UK and France (annotated in dark/light green in ) were also generally distributed among ptxP3-fim3A and ptxP3-fim3B clades. In contrast, most (95%, 171/180) of the Japan B. pertussis isolates sampled between 1982 and 2013 (annotated in purple in ) were clustered in a single clade which was located near the root of the tree and were geographically restricted to Japan (). Similarly, most (94%, 47/50) isolates from North-western China (annotated in pink in ) were grouped together in one clade, except for three Chinese ptxP3 isolates on the global tree which were grouped with other ptxP3 isolates (). While ptxP1 isolates were predominantly in North-western China, it should be noted that the proportion of ptxP3 isolates was much higher with up to 50% in the Southern part of China (e.g. Shanghai, Zhejiang and Shenzhen) [Citation39–41].

The Australian isolates were spread across the tree with many instances of an Australian lineage consisting of one or more Australian isolates only sharing MCRA with isolates from another country (), suggesting independent import into Australia. However, more than half of the isolates (230 of 404 Australian isolates grouped into the six ELs) were grouped together as distinctive lineages, in particular EL1, EL2, EL5 and EL6 (color-coded on branches in ). However, EL3 was split into three clusters with one large (52 isolates), two small clusters of two and nine isolates, interspersed with isolates from other countries. Therefore, EL3 is not an exclusive Australian lineage and there were likely three independent origins of Australian isolates within EL3 with each cluster closest to different US isolates. Similarly, EL4 has also been split into three separate groups for Australian isolates with one large (32 isolates) and one small (six isolates) cluster and one singleton. The large cluster was closest to UK isolates while the smaller cluster was closest to US isolates. Therefore, EL3 and EL4 are large global lineages with sub-lineages within the lineage being Australian only. By contrast, within EL1, EL2, EL5 and EL6, there were also isolates from other countries among the Australian isolates suggesting export of these lineages to other countries after expansion in Australia.

Emergence and global expansion of ptxP3/bscI3 genotype

Analysis of the phylogenetic tree () of 1452 B. pertussis genomes revealed a major allelic replacement event in bscI (BP2249), which encodes the type III secretion system basal body protein BscI. From the 1452 genomes, five bscI alleles (designated as bscI1, bscI2, bscI3, bscI4 and bscI5) were observed. bscI1 is the allele found in Tohama I. Relative to bscI1, bscI2 has a non-synonymous SNP (G146A) which changes glycine to glutamic acid while bscI3 has a non-synonymous SNP (A341G) which alters tyrosine to cysteine (A). bscI4 and bscI5 also contains the A341G mutation and an additional C184A and C131 T mutation, respectively. C184A changes lysine to methionine in bscI4 while C131 T in bscI5 changes alanine to valine (A). A341G was predicted by PROVEAN to be a deleterious mutation while all other mutations were predicted to be neutral [Citation20,Citation42].

Figure 5. bscI alleles and their frequencies detected in 1452 sequenced B. pertussis isolates from across the globe. (A) bscI alleles with nucleotide and amino acid changes. The numbers at the top refer to the position of the underlined SNP, relative to the start of the bscI gene. Dots represent identical base or amino acid. The nucleotide sequence is shown in codons with the corresponding amino acid (in single letter format) shown below. (B) Number and percentage of ptxP3-bscI3 isolated since it was first detected in 1996. Red, green and purple bars depict the number of ptxP1-bscI1, ptxP3-bscI1 and ptxP3-bscI3 isolates. Twenty of the 1452 isolates were non-ptxP1/non-ptxP3 or non-bscI1/non-bscI3 and were not tallied in the graph. An additional 21 isolates had no collection date assigned and were also not tallied in the graph. The ptxP1-bscI1, ptxP3-bscI1 and ptxP3-bscI3 isolates were found across the globe. The line graph represents the rapid expansion of ptxP3-bscI3 strains.

Figure 5. bscI alleles and their frequencies detected in 1452 sequenced B. pertussis isolates from across the globe. (A) bscI alleles with nucleotide and amino acid changes. The numbers at the top refer to the position of the underlined SNP, relative to the start of the bscI gene. Dots represent identical base or amino acid. The nucleotide sequence is shown in codons with the corresponding amino acid (in single letter format) shown below. (B) Number and percentage of ptxP3-bscI3 isolated since it was first detected in 1996. Red, green and purple bars depict the number of ptxP1-bscI1, ptxP3-bscI1 and ptxP3-bscI3 isolates. Twenty of the 1452 isolates were non-ptxP1/non-ptxP3 or non-bscI1/non-bscI3 and were not tallied in the graph. An additional 21 isolates had no collection date assigned and were also not tallied in the graph. The ptxP1-bscI1, ptxP3-bscI1 and ptxP3-bscI3 isolates were found across the globe. The line graph represents the rapid expansion of ptxP3-bscI3 strains.

Of the five bscI alleles two, bscI1 and bscI3, were predominant representing 23.3% (338/1452) and 76.5% (1111/1452) of the isolates, respectively. For bscI2, bscI4 and bscI5, only one isolate was identified for each. bscI2 was detected in a ptxP1-fim3A isolate while bscI4 and bscI5 were detected in a ptxP3-fim3C and ptxP3-fim3B isolate, respectively. bscI4 and bscI5 appear to have evolved from a bscI3 strain as both shared the bscI3 mutation. The bscI1 allele has been present since 1935 (the earliest strain in this study) and was mostly associated with ptxP1 isolates (B and Supplementary Table 5). As for bscI3, it appeared to have emerged as a single event () in a ptxP3 strain with the earliest bscI3 isolate identified in 1996 from the Netherlands (Supplementary Table 5). Between 1988 and 1995, all ptxP3 isolates sequenced had the bscI1 allele, however after 1996, ptxP3-bscI3 strains rapidly expanded and replaced the existing ptxP3-bscI1 strains (B). The last ptxP3-bscI1 isolate sequenced was isolated in 2009. From 2010 onwards, the bscI1 allele has only been seen in isolates carrying ptxP1, with many of the recent ptxP1-bscI1 isolates sequenced being isolated from China.

Discussion

Australia experienced two major pertussis epidemics in the past two decades [Citation13]. In a previous study by Safarchi et al. [Citation19], a set of 27 isolates from the 2008–2012 epidemic was used to define five ELs, four of which expanded during the 2013–2017 epidemic [Citation13]. This study expanded the previous studies with a larger dataset of 385 isolates, collected from 1997 to 2017, including isolates from the two recent epidemics (2008–2012 and 2013–2017) and compared with over 1000 global isolates. The Australian genomic data was also augmented by background epidemiological information, including isolation dates and locations, to allow comprehensive phylogeographic analyses to be performed and thus provided a more complete picture of B. pertussis population structure in Australia. Although we have no clinical information on these isolates, it is likely that many isolates were from vaccinated cases as the vaccine coverage in Australia was high (> 92% for 1-year-old and > 95% for 2-year-old) [Citation43].

This study has shown that some major lineages dominated during the two epidemics in Australia but there were also many independent lineages detected (). These lineages had a shared MRCA with isolates from other countries and thus were likely to have been independently imported initially. Some lineages expanded while others died. Interestingly, the ELs were mostly imported as Prn-positive strains from other countries before expanding locally. These strains lost the Prn during the expansion in Australia, paralleling the global reports of Prn-negative strains arising independently [Citation11,Citation44,Citation45].

LTT analysis [Citation32] depicted the development, disappearance and genetic diversification of different lineages across the two epidemics. The first epidemic (2008–2012) was associated with one ptxP1 lineage and one ptxP3 lineage. The outcompeting of ptxP1 lineage by the ptxP3 lineage started well before the epidemic and was consistent with the hypothesis of vaccine selection pressure [Citation7,Citation8]. The diversification of the ptxP3 lineage began around 1995–1997 when ACV was first introduced in Australia [Citation3], then peaked around 2008 at the start of the epidemic. During this period, two overlapping types of vaccine selective pressure co-existed: the waning but broader WCV-induced immunity and the newly introduced but narrower ACV immunity. In population genetics terms, above observations might be explained by hard and soft selective sweeps [Citation46,Citation47]. The emergence of the ptxP3 lineage under vaccine selection led to hard selective sweeps across the population resulting in a population bottleneck, followed by rapid recovery of genetic diversity with multiple sub-lineages emerging within the ptxP3 lineage. Further, the parallel emergence of Prn-negative strains from the selection pressure from the ACV can be viewed as soft selective sweeps followed by accelerated diversification of these sub-lineages. Therefore, hard and soft selective sweeps through vaccine selection pressures could have shaped the current B. pertussis population dynamics.

Over the course of the two epidemics, ptxP3-fim3A lineage gradually overtook ptxP3-fim3B lineage in Australia. Since Australia used mostly an ACV without Fim3 component, it is unlikely that ACV selection pressure played any role. There is a possibility that WCV played a role in the selection of strains in the early years of ACV era as vaccination or initial priming with WCV had a longer-term protection against B. pertussis infection than ACVs [Citation48]. However, it is likely that, as the B. pertussis populations evolve, natural infection induced immunity may have affected the dynamics of spread of the two ptxP3-fim3 sub-lineages. It is also possible that the two sub-lineages differed in frequency in their countries of origin, affecting their chances of being spread to other continents including Australia (the “founder effect”). In the US, fim3B was more prevalent in 2006–2009 while fim3A was more prevalent in the UK during 2001–2012 in the ACV era [Citation49,Citation50]. It is possible that strains with the fim3B allele were imported into Australia around 1995 () and expanded during the first epidemic. Another possible explanation could be hitchhiking effect. When compared to fim3B isolates, a higher proportion of Australian fim3A isolates have lost Prn which may have led to better survival of the ptxP3-fim3A lineage in Australia as fim3A hitchhiked on the Prn negative trait ().

Further, LTT analysis of the six ELs within the ptxP3-fim3A lineage showed that the increase and decrease of the diversification of different ELs (EL1-6) occurred at different time points. The rise and fall of different ELs during the epidemic revealed that the pertussis epidemic was not due to a sole expansion of a single ptxP3 lineage in Australia. Rather, in the epidemic period, the B. pertussis population was composed of different lineages with overlapping circulation of multiple lineages over time. Examination of the origins of the lineages in the global phylogenetic tree showed that four lineages, EL1, EL2, EL5 and EL6 were likely to have been imported into Australia once and then expanded as each of these lineages contained mostly Australian isolates. However, EL3 was now seen as consisting of one large (n = 52) cluster and two smaller (n = 2 and n = 9) clusters of Australian only isolates in each case, suggesting that EL3 was a global lineage within which the clusters were actually imported into Australia and then underwent local expansion after importation (). EL4 has the same branching pattern. Within EL3, the large cluster expanded with emergence and expansion of a Prn-negative subcluster while the minor cluster contained no Prn-negative isolates (). The data suggest that EL3 has adapted to circulate in Australia with the emergence of the Prn-negative subcluster. In EL4, Prn-positive isolates and Prn-negative isolates started to expand in parallel in two independent sub-lineages, but only Prn-negative sub-lineages survived after 2008. By contrast, EL6 which persisted during the first epidemic and then abruptly declined had only two Prn-negative isolates (9%). The above result suggests that the lineages with lower proportion of Prn-negative (e.g. EL6) isolates may not be fit enough to survive the competition for respiratory niches with other lineages with predominantly Prn-negative isolates. This can be interpreted as additional supporting evidence that the succession of the B. pertussis lineages was affected by the selection pressure from the vaccines.

Since isolates from this study had residential postcode data, the local spread of isolates was examined using relative risk analysis. Isolates that shared an MRCA within 1.5 years from present were more likely to be localized within 1 km (the approximate radius of an average suburb). In other words, once a new B. pertussis lineage emerged, it was more likely to spread locally within the first 1.5 years. However, any new lineage emerged after 1.5 years was likely to transmit to a wider region, reflecting rapid spread of pertussis strains within a state. Relative risk analysis was also used to examine the inter-state spread of B. pertussis in Australia. Within 3.5 years, newly arisen lineages were more likely to stay within a state. However, after 3.5 years, new lineages can spread between states with equal chances and these lineages were no longer highly spatially structured across the country. The patterns of spread uncovered in this study may have implications on local pertussis infection dynamics. By inference, infection induced immunity by newly arisen strains within the first 3.5 years will tend to be localized. The timeframe also coincides with pertussis epidemic cycles of 3–4 years [Citation43]. Further modelling studies will be required to understand the effect of initial localized spread of newly arisen strains on pertussis epidemic dynamics.

Therefore, pertussis epidemics in Australia were characterized by the initial importation followed by expansions of different lineages over the epidemics. On the other hand, these lineages could also spread to other countries after a certain period of time, generating interweaving pertussis epidemics across the globe as seen with EL1 and EL6 within which isolates from other countries were found. Given that pertussis is a respiratory infection and is not confined by borders just like SARS-CoV-2 [Citation51], epidemics in one country may rapidly affect other countries. However, the relative risk analysis suggests that strains were geographically restricted within and between states in the initial 1.5–3.5 years. Therefore, there was a likely lag of pertussis epidemics between countries from 1.5 to 3.5 years. These epidemic patterns are consistent with the time needed for a locally arisen lineage to spread. A range of epidemiologic factors such as population density, population mobility and relative opportunity of exposure of population cohort can affect the rate of spread, but these factors were not within the framework of the relative risk analysis method and thus not incorporated or considered.

Phylogenetic analysis of 1452 Australian and international isolates revealed the emergence of bscI3 in ptxP3 strains. Although both ptxP1-bscI1 and ptxP3-bscI3 strains were circulating during the mid-1990s-2000s, the expansion of ptxP3 strains in many ACV countries, such as Australia, US, UK, Canada and France, was only associated with the bscI3 allele. This suggests that the bscI3 allele may provide additional selective advantages to ptxP3 strains. King et al [Citation52] showed that ptxP1 strains with a ptxP3 genetic background had higher colonization ability than wild type ptxP1 strains, suggesting additional adaptation besides ptxP3. Our mouse studies found that ptxP3 strains appeared to be fitter than ptxP1 strains in both ACV immunized mice and naïve mice [Citation23]. Further our previous proteomic studies suggests that the non-synonymous A341G mutation in bscI3 may result in decreased secretion of T3SS effectors [Citation20–22]. This may provide ptxP3 strains with reduced immune recognition allowing it to evade the general host immune system and may partly explain the increased fitness of Cluster I strains regardless of immunization status [Citation23]. Since 2010, all ptxP3 strains sequenced globally including fim3A/fim3B and Prn-positive/Prn-negative isolates were ptxP3-bscI3 (with the exception of two bscI4 and bscI5 isolates which were derived from bscI3) suggesting a selective advantage of the A341G mutation. ptxP3 strains were also found to predominant in Iran, a WCV country, which were all carrying bscI3 allele [Citation44]. There is a possibility that emergence of the bscI3 allele was associated with WCV selection if BscI or the T3SS was part of the WCV antigens. Strains used to produce WCV most likely carried bscI1 allele since all early circulation strains carrying bscI1. It is possible that bscI3 was hitch-hiking on ptxP3 but it remains to be seen whether the selective advantage provided by bscI3 is dependent on the ptxP3 allele. Most recent ptxP1-bscI1 strains were currently circulating in China and its dominance may be associated with macrolide resistance [Citation53].

Conclusions

In this study, genome-scale phylogenetic analysis with spatial and temporal metadata revealed the important dynamics of importation and expansion of B. pertussis lineages in Australia. During the two epidemics in Australia, multiple B. pertussis ELs were initially imported, expanded and then declined at different time points. Our results also suggest that, once a new strain emerged, it was more likely to only spread within a short spatial distance in the first years after emergence. However, after 3.5 years of evolutionary time, the different lineages were no longer spatially structured across the country. This study showed that the expansion of ptxP3 strains was also associated with the expansion of the bscI3 allele. This allele may reduce secretion of T3SS effectors and allow B. pertussis to reduce immune recognition and thus increase overall fitness of ptxP3 strains. Together, these findings uncovered important characteristics of the epidemic dynamics and population structural changes of B. pertussis and improved our understanding of pertussis epidemics and re-emergence in highly vaccinated host populations.

Author contributions

Ruiting Lan conceived the study, Zheng Xu, Dalong Hu and Laurence Luu performed the analysis. Sophie Octavia, Vitali Sintchenko, Frits R. Mooi and Laurence Don Wai Luu revised the manuscript. Mark M. Tanaka supported the calculation of the relative risk. Vitali Sintchenko, Jenny Robson and Anthony D Keil collected B. pertussis isolates. Ruiting Lan provided critical analysis and discussions. All authors discussed the results and contributed to the revision of the final manuscript.

Supplemental material

Supplemental Material

Download Zip (890.4 KB)

Acknowledgements

We thank Narelle Raven for technical assistance. Z.X. was supported by a University of New South Wales scholarship.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability

All Illumina sequencing reads analysed in this study were deposited in the NCBI database. Raw reads were deposited under the project accession number PRJNA562796. In addition, the other five Illumina sequencing reads collected from 1997 to 2002 were deposited under PRJNA280658 and 78 isolates from 2013 to 2017 Australian epidemic study were deposited under PRJNA432286. The global B. pertussis genomes were downloaded from Sequence Read Archive (SRA) in NCBI by searching “Bordetella pertussis” by the date of 6th Nov 2018.

Additional information

Funding

This work was supported by The National Health and Medical Research Council: [Grant Number 1146938].

Reference

  • World Health Organization. Generic protocol for estimating the burden of pertussis in young children [electronic resource]. Geneva: World Health Organization; 2005.
  • Clark TA. Changing pertussis epidemiology: everything old is new again. J Infect Dis. 2014;209(7):978–981.
  • Lam C, Octavia S, Ricafort L, et al. Rapid increase in pertactin-deficient Bordetella pertussis isolates, Australia. Emerg Infect Dis. 2014;20(4):626.
  • Bart MJ, Harris SR, Advani A, et al. Global population structure and evolution of Bordetella pertussis and their relationship with vaccination. MBio. 2014;5(2):e01074–14.
  • Yeung KHT, Duclos P, Nelson EAS, et al. An update of the global burden of pertussis in children younger than 5 years: a modelling study. Lancet Infect Dis. 2017;17(9):974–980.
  • Octavia S, Sintchenko V, Gilbert GL, et al. Newly emerging clones of Bordetella pertussis carrying prn2 and ptxP3 alleles implicated in Australian pertussis epidemic in 2008–2010. J Infect Dis. 2012;205(8):1220–1224.
  • Octavia S, Maharjan RP, Sintchenko V, et al. Insight into evolution of Bordetella pertussis from comparative genomic analysis: evidence of vaccine-driven selection. Mol Biol Evol. 2010;28(1):707–715.
  • Mooi FR, van Loo IH, Van Gent M, et al. Bordetella pertussis strains with increased toxin production associated with pertussis resurgence. Emerg Infect Dis. 2009;15(8):1206.
  • Lam C, Octavia S, Bahrame Z, et al. Selection and emergence of pertussis toxin promoter ptxP3 allele in the evolution of Bordetella pertussis. Infect Genet Evol. 2012;12(2):492–495.
  • Hovingh ES, Mariman R, Solans L, et al. Bordetella pertussis pertactin knock-out strains reveal immunomodulatory properties of this virulence factor. Emerg Microbes Infect. 2018;7(1):1–13.
  • Weigand MR, Peng Y, Loparev V, et al. The history of Bordetella pertussis genome evolution includes structural rearrangement. J Bacteriol. 2017;199(8):e00806–16.
  • Lan R, Octavia S. Chapter 10, vaccine-driven selection and the changing molecular epidemiology of Bordetella pertussis. In: Rohani P, Scarpino SV, editors. Pertussis: epidemiology, immunology, and evolution. New York, NY: Oxford University Press; 2019. p. 166–181.
  • Xu Z, Octavia S, Luu LDW, et al. Pertactin-Negative and filamentous hemagglutinin-negative Bordetella pertussis, Australia, 2013–2017. Emerg Infect Dis. 2019;25(6):1196.
  • Safarchi A, Octavia S, Luu LDW, et al. Pertactin negative Bordetella pertussis demonstrates higher fitness under vaccine selection pressure in a mixed infection model. Vaccine. 2015;33(46):6277–6281.
  • Meyler K, Meehan M, Bennett D, et al. Spontaneous capsule loss in Haemophilus influenzae serotype b associated with Hib conjugate vaccine failure and invasive disease. Clin Microbiol Infect. 2019;25(3):390–391.
  • Carvalho CF, Andrews N, Dabrera G, et al. National outbreak of pertussis in England, 2011–2012: A case-control study comparing 3-component and 5-component acellular vaccines with whole-cell pertussis vaccines. Clin Infect Dis. 2020;70(2):200–207.
  • Willems R, Paul A, Van Der Heide H, et al. Fimbrial phase variation in Bordetella pertussis: a novel mechanism for transcriptional regulation. EMBO J. 1990;9(9):2803–2809.
  • Gorringe AR, Vaughan TE. Bordetella pertussis fimbriae (Fim): relevance for vaccines. Expert Rev Vaccines. 2014;13(10):1205–1214.
  • Safarchi A, Octavia S, Wu SZ, et al. Genomic dissection of Australian Bordetella pertussis isolates from the 2008–2012 epidemic. Journal of Infection. 2016;72(4):468–477.
  • Luu LDW, Octavia S, Zhong L, et al. Comparison of the whole cell proteome and secretome of epidemic Bordetella pertussis strains from the 2008–2012 Australian epidemic under sulfate-modulating conditions. . Front Microbiol. 2018;9:2851.
  • Luu LDW, Octavia S, Zhong L, et al. Proteomic adaptation of Australian epidemic Bordetella pertussis. Proteomics. 2018;18(8):1700237.
  • Luu LDW, Octavia S, Aitken C, et al. Surfaceome analysis of Australian epidemic Bordetella pertussis reveals potential vaccine antigens. Vaccine. 2020;38(3):539–548.
  • Safarchi A, Octavia S, Luu LDW, et al. Better colonisation of newly emerged Bordetella pertussis in the co-infection mouse model study. Vaccine. 2016;34(34):3967–3971.
  • Bastin D, Romana L, Reeves P. Molecular cloning and expression in Escherichia coli K-12 of the rfb gene cluster determining the O antigen of an E. coli O111 strain. Mol Microbiol. 1991;5(9):2223–2231.
  • Bankevich A, Nurk S, Antipov D, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19(5):455–477.
  • Li H. Durbin R. Fast and accurate short read alignment with burrows–Wheeler transform. bioinformatics. 2009;25(14):1754–1760.
  • Li H, Handsaker B, Wysoker A, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25(16):2078–2079.
  • Darling AE, Mau B, Perna NT. Progressivemauve: multiple genome alignment with gene gain, loss and rearrangement. PloS one. 2010;5(6):e11147.
  • Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–1313.
  • Hu D, Liu B, Wang L, et al. Living trees: high-quality reproducible and reusable construction of bacterial phylogenetic trees. Mol Biol Evol. 2020;37(2):563–575.
  • Moura A, Criscuolo A, Pouseele H, et al. Whole genome-based population biology and epidemiological surveillance of listeria monocytogenes. Nature Microbiology. 2016;2(2):1–10.
  • Reznick DN, Ricklefs RE. Darwin's bridge between microevolution and macroevolution. Nature. 2009;457(7231):837–842.
  • Revell LJ. Phytools: an R package for phylogenetic comparative biology (and other things). Methods in Ecology and Evolution. 2012;3(2):217–223.
  • Salje H, Lessler J, Berry IM, et al. Dengue diversity across spatial and temporal scales: local structure and the effect of host population size. Science. 2017;355(6331):1302–1306.
  • Leong R, Wood J, Turner R, et al. Estimating seasonal variation in Australian pertussis notifications from 1991 to 2016: evidence of spring to summer peaks. Epidemiology & Infection. 2019;147:e155.
  • Rabosky DL, Lovette IJ. Density-dependent diversification in North American wood warblers. Proceedings of the Royal Society B: Biological Sciences. 2008;275(1649):2363–2371.
  • Bouchez V, Guglielmini J, Dazas M, et al. Genomic sequencing of Bordetella pertussis for epidemiology and global surveillance of whooping cough. Emerg Infect Dis. 2018;24(6):988.
  • Weigand MR, Williams MM, Peng Y, et al. Genomic survey of Bordetella pertussis Diversity, United States, 2000–2013. Emerg Infect Dis. 2019;25(4):780.
  • Fu P, Wang C, Tian H, et al. Bordetella pertussis infection in infants and young children in Shanghai, China, 2016–2017: clinical features, genotype variations of antigenic genes and macrolides resistance. Pediatr Infect Dis J. 2019;38(4):370–376.
  • Wu S, Hu Q, Yang C, et al. Molecular epidemiology of Bordetella pertussis and analysis of vaccine antigen genes from clinical isolates from Shenzhen, China. Ann Clin Microbiol Antimicrob. 2021;20(1):1–9.
  • Barkoff A-M, He Q. Molecular Epidemiology of Bordetella pertussis. In: Fedele G, Ausiello C, editors. Pertussis Infection and Vaccines. Advances in Experimental Medicine and Biology, Vol. 1183. Cham: Springer; 2019. p. 19–33.
  • Choi Y, Chan AP. PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics. 2015;31(16):2745–2747.
  • Quinn HE, McIntyre PB. Pertussis epidemiology in Australia over the decade 1995-2005-trends by region and age group. Commun Dis Intell Q Rep. 2007;31(2):205–215.
  • Safarchi A, Octavia S, Nikbin VS, et al. Genomic epidemiology of Iranian Bordetella pertussis: 50 years after the implementation of whole cell vaccine. Emerg Microbes Infect. 2019;8(1):1416–1427.
  • Barkoff A-M, Mertsola J, Pierard D, et al. Pertactin-deficient Bordetella pertussis isolates: evidence of increased circulation in Europe, 1998 to 2015. Eurosurveillance. 2019;24(7):1700832.
  • Messer PW, Petrov DA. Population genomics of rapid adaptation by soft selective sweeps. Trends Ecol Evol. 2013;28(11):659–669.
  • Pennings PS, Kryazhimskiy S, Wakeley J. Loss and recovery of genetic diversity in adapting populations of HIV. PLoS Genet. 2014;10(1):e1004000.
  • Sheridan SL, Frith K, Snelling TL, et al. Waning vaccine immunity in teenagers primed with whole cell and acellular pertussis vaccine: recent epidemiology. Expert Rev Vaccines. 2014;13(9):1081–1106.
  • Schmidtke AJ, Boney KO, Martin SW, et al. Population diversity among Bordetella pertussis isolates, United States, 1935–2009. Emerg Infect Dis. 2012;18(8):1248.
  • Sealey KL, Harris SR, Fry NK, et al. Genomic analysis of isolates from the United Kingdom 2012 pertussis outbreak reveals that vaccine antigen genes are unusually fast evolving. J Infect Dis. 2014;212(2):294–301.
  • Wells CR, Sah P, Moghadas SM, et al. Impact of international travel and border control measures on the global spread of the novel 2019 coronavirus outbreak. Proc Natl Acad Sci USA. 2020;117(13):7504–7509.
  • King AJ, van der Lee S, Mohangoo A, et al. Genome-wide gene expression analysis of Bordetella pertussis isolates associated with a resurgence in pertussis: elucidation of factors involved in the increased fitness of epidemic strains. PloS one. 2013;8(6):e66150.
  • Xu Z, Wang Z, Luan Y, et al. Genomic epidemiology of erythromycin-resistant Bordetella pertussis in China. Emerg Microbes Infect. 2019;8(1):461–470.