2,560
Views
14
CrossRef citations to date
0
Altmetric
Articles

Emergence of multidrug-resistant Mycobacterium tuberculosis of the Beijing lineage in Portugal and Guinea-Bissau: a snapshot of moving clones by whole-genome sequencing

ORCID Icon, ORCID Icon, , ORCID Icon, ORCID Icon, ORCID Icon, , ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon show all
Pages 1342-1353 | Received 07 Oct 2019, Accepted 14 May 2020, Published online: 15 Jun 2020

ABSTRACT

The Beijing genotype comprises a highly disseminated strain type that is frequently associated with multidrug resistant (MDR) tuberculosis (TB) and increased transmissibility but, countries such as Portugal and Guinea-Bissau fall outside the regions phylogeographically associated with this specific genotype. Nevertheless, recent data shows that this genotype might be gradually emerging in these two countries as an underlying cause of primary MDR-TB. Here, we describe the emergence of Mycobacterium tuberculosis Beijing strains associated with MDR-TB in Portugal and Guinea-Bissau demonstrating the presence of the well described superclusters 100-32 and 94-32 in Portugal and Guinea-Bissau, respectively. Genome-wide analysis and comparison with a global genomic dataset of M. tuberculosis Beijing strains, revealed the presence of two genomic clusters encompassing isolates from Portugal and Guinea-Bissau, GC1 (n = 121) and GC2 (n = 39), both of which bore SNP signatures compatible with the 100-32/B0/W148 and 94-32/Central Asia Outbreak clades, respectively. Moreover, GC2 encompasses a cross-border cluster between Portugal, Guinea-Bissau and Brazil thus supporting migration-associated introduction of MDR-TB and subsequent clonal expansion at the community-level. The comparison with global Beijing datasets demonstrates the global reach of the disease and its complex dissemination across multiple countries while in parallel there are clear microevolutionary trajectories towards extensively drug resistant TB.

Introduction

Originally described by van Soolingen et al. in 1995, Mycobacterium tuberculosis Beijing lineage has shown a variable association with drug resistance coupled with a well-documented transmissibility and ability to cause tuberculosis (TB) outbreaks [Citation1]. In the 1990s, Beijing strains became widely known as the underlying cause of a major New York multidrug-resistant (MDR)-TB outbreak among patients co-infected with the Human Immunodeficiency Virus (HIV) [Citation2]. Other outbreaks involving drug resistant Beijing strains are often described in different regions throughout the globe [Citation3,Citation4]. This specific and widespread genotype was initially thought to have emerged in North-Central China more than one thousand years ago but, recent estimates point to Beijing lineage split approximately thirty thousand years ago in southern East Asia [Citation5,Citation6]. At the immunopathological level, the Beijing lineage has shown several potential selective advantages, particularly its “modern” sub-branch (i.e. strains harboring a IS6110 at the NTF locus and mutT4 codon 48 CGG > GGG substitution) which were shown in some studies to be associated with a reduced proinflammatory response and an increased virulence on mouse models of infection [Citation7,Citation8].

In Portugal, MDR-TB has been a major cause of public health concern particularly in the metropolitan area surrounding Lisbon. This concern is mainly due to two major monophyletic clades of the Latin American and Mediterranean (LAM) lineage (Lisboa3 and Q1), highly associated with MDR-TB, but also extensively drug resistant (XDR)-TB [Citation9]. Despite its well-documented transmissibility and ability to cause TB outbreaks, Beijing strains are seldom isolated in the country [Citation3,Citation4,Citation10].

Nevertheless, over the last 10 years we have been observing an increasing number of MDR-TB cases associated with Beijing strains (unpublished data). Recently, we developed CPLP-TB (http://cplp-tb.ff.ulisboa.pt), an online database aimed at tracking clones of M. tuberculosis expanding over Portuguese-speaking countries [Citation11]. Presently, CPLP-TB contains nine and eight Beijing isolates from Portugal and Guinea-Bissau, respectively, with available 24-loci Mycobacterial Interspersed Repetitive Unit-Variable Number of Tandem Repeat (MIRU-VNTR) profiles. In Guinea-Bissau, the emergence of Beijing strains associated with MDR-TB was previously reported by us although the route driving the emergence and spread of such strains to this country was unclear [Citation12].

Herein, we explore the source of the MDR-TB Beijing genotypes in both Portugal and Guinea-Bissau and unveil novel and previously uncharacterized routes for their emergence.

Methods

Clinical isolates and drug susceptibility testing

A total of seventeen M. tuberculosis clinical isolates belonging to the Beijing genotype as determined by spoligotyping were selected. From these, eight isolates were recovered from patients followed at the Hospital Raoul Follereau in Bissau, Guinea-Bissau, while the remaining nine isolates were obtained from patients at laboratories and hospitals across the Lisbon Health Region in Portugal between 2008 and 2014.

All isolates were subjected to susceptibility testing to all five first-line drugs by the BACTEC™ MGIT™ 960 System (Becton Dickinson Diagnostic Systems, Sparks, MD, USA) using standardized critical concentrations.

Genotyping and cluster identification

Twenty-four-loci MIRU-VNTR typing was carried out by multiplex amplification procedure described by Supply et al [Citation13]. Amplicon size determination was performed by capillary electrophoresis on an ABI 3130 Genetic Analyzer (Applied Biosystems®, Foster City, CA, USA) using a custom ROX-labelled molecular weight marker, MapMarker®1200 (Bioventures®, Murfreesboro, TN, USA), with 25 bands sized between 100 and 1200 bp.

Spoligotyping was performed by a single-tube multiplex PCR amplification of 43 spacer regions of the direct repeat (DR) locus using oligonucleotide primers DRa (5′-Biotin-GGTTTTGGGTCTGACGAC-3′) and DRb (5′-CCGAGAGGGGACGAAAC-3′) and 20 ng of genomic DNA. Amplicons were reverse hybridized on a membrane with amino-linked immobilized probes for each spacer as described previously by Kamerbeek et al. Detection was performed using the ECL® Chemiluminescence Detection System (GE Healthcare®, Cleveland, OH, USA) as per the manufacturer’s instructions. Spoligotyping profiles were assigned to lineage, clade and shared international type (SIT) using the rules described in SITVITWEB and SITVIT2 international databases [Citation10,Citation14,Citation15].

A comparative genotypic profile analysis was carried out including a total of 5482 isolates and 24-loci MIRU-VNTR profiles assembled from three large global studies, along with publicly available metadata [Citation16–18]. Hierarchical clustering analysis was conducted in R statistical software using the hclust function and clusters defined as groups of two or more isolates sharing identical 24-loci MIRU-VNTR profiles. MtbC15-9 type classification was performed using the MIRU-VNTRplus online database [Citation19]. The Hunter-Gaston index of diversity was computed as described previously [Citation20].

Whole genome sequencing and bioinformatic analysis

Whole genome sequencing (WGS) was carried out using an Illumina HiSeq 2500 paired-end (100/150 bp) platform. Single nucleotide polymorphisms (SNPs) were obtained by mapping raw sequence reads to the M. tuberculosis H37Rv genome (GenBank Ref. NC000962.3) using BWA-mem and by variant calling using the SAMtools/GATK software suites in established pipelines [Citation9,Citation21,Citation22]. A comparison against publicly available sequence data was carried out for all sequenced isolates against 5296 Lineage 2 M. tuberculosis strains with Illumina short-read sequence data available on the European Nucleotide Archive (ENA) until July 2017 (see Supplementary Table S1 for full list of ENA accessions) and, for which SNPs were then called using the same bioinformatics pipeline as above. SNP sites or samples having an excess of 10% missing calls were removed from the analysis [Citation21]. SNP sites within PE/PPE genes or those occurring at low mappability regions were also excluded from the analysis. The final dataset was composed of a total of 5180 isolates and 67,846 SNPs. Sequence data generated in this study is available on ENA under study accession ERP002611.

Genomic clustering was evaluated under both a five and twelve SNP distance threshold using R and the ape package [Citation23]. A maximum-likelihood phylogenetic tree was constructed for the complete dataset using FastTree with a Generalised Time Reversible (GTR) model (Supplementary Figure S1) [Citation24]. Additional maximum-likelihood phylogenetic trees were constructed using PhyML as implemented in Seaview for the genomic clusters found under a maximum distance of twelve SNPs. The phylogenetic context of the non-clustered isolates from Portugal or Guinea-Bissau was reconstructed by comparison with isolates within 100 SNPs. The best-fit nucleotide substitution models for each dataset were evaluated using the R package phangorn. The GTR model allowing across site rate variation and invariable sites was found to be the best-fit model for the GC1 (negative Log-likelihood: 99341.88), GC2 (negative Log-likelihood: 93879.79) and the GW000065 phylogenetic context (negative Log-likelihood: 100664.40); the GTR model allowing for invariable sites were found as the best-fit model for the PT000095 (negative Log-likelihood: 96291.71) and PT000025 (negative Log-likelihood: 94204.24) phylogenetic datasets. Tree visualization and annotation was performed using the Interactive Tree of Life tool [Citation25]. A map showing the geographical dispersion of genomic clusters of interest was constructed using Microreact [Citation26].

Minimum spanning trees (MSTs) were constructed using the goeBURST/Phyloviz software (available at http://online2.phyloviz.net) [Citation27].

Results

Emergence of Beijing family MDR-TB in Portugal and Guinea-Bissau is associated with distinct superclusters

To understand the dynamics of emergence and transmission of MDR-TB associated with the Beijing genotype in Portugal and Guinea-Bissau, we analysed seventeen Beijing isolates from Portugal (n = 9) and Guinea-Bissau (n = 8), all of which recently characterized in the scope of the CPLP-TB database. The public health importance of these strains in both countries is noteworthy as 7/9 and 7/8 of these isolates in Portugal and Guinea-Bissau, respectively, are MDR-TB strains (). However, no links between the Portuguese and Guinean Beijing strains were found except at the 12-loci MIRU-VNTR set (, Supplementary Table S2). In Portugal, the nine Beijing strains yielded seven distinct profiles (two clusters of two isolates, each; and five non-clustered isolates) (). In Guinea-Bissau, five distinct profiles were observed encompassing two MIRU-VNTR clusters with two and three isolates, respectively, and three non-clustered isolates (). Regarding the clustered isolates, a classical epidemiological survey approach failed to find epidemiological links between patients, except for two patients (P16 and P17) which are of two siblings sharing the same household. Interestingly, both patients are Portuguese nationals for which no further TB contacts were identified and, neither had any prior history of travelling outside the country.

Table 1. Beijing strains isolated in Portugal and Guinea-Bissau with available 24-loci MIRU-VNTR profiles.

Except for cases directly linked with immigrants, which is not relevant in the Guinea-Bissau dataset since all patients were of Guinean nationality, the absence of additional transmission clusters with established epidemiological links render unknown the routes by which MDR-TB Beijing strains are being introduced. To identify links with other Beijing clusters, we compared the genotypic profiles of all seventeen clinical isolates included in this study against a global dataset of 5,482 isolates assembled from three large studies [Citation16–18]. The comparison against this dataset yielded a total of 1,848 distinct profiles (Hunter-Gaston Index of Diversity, h = 0.9812). Four cross-border clusters, comprising six out of the nine Portuguese isolates, were identified matching MtbC15-9 type 100-32 (n = 302; 2 Portuguese [PT] isolates); 2083-32 (n = 18; 1 PT isolate); 4737–32 (n = 5; 2 PT isolates); and, 9387–32 (n = 3; 1 PT isolate) (). For Guinea-Bissau strains, two cross-border clusters were detected (n = 5) matching MtbC15-9 type 94-32 (n = 576; 3 Guinea-Bissau [GW] isolates) and 9124-32 (n = 6; 2 GW isolates) ().

Table 2. MIRU-VNTR (24-loci) obtained by comparison against a global Beijing dataset.

WGS enables high-resolution clustering snapshots of cross-border superclusters and the detection of novel clustering between Portugal and Guinea-Bissau

To obtain a more resolved snapshot, we sequenced the genomes of 14 clinical isolates (Guinea-Bissau, n = 8; Portugal, n = 6) and compared them to 5,296 Lineage 2 strains available on ENA until July 2017 (Supplementary Table S1). The final dataset, after removal of low coverage samples and those having an excess of missing calls was composed of 5,181 isolates (including M. tuberculosis H37Rv as an outgroup strain) and 67,846 SNP sites. The global phylogenetic tree obtained for the entire dataset along with the examination of drug resistance genotypes showed a wide distribution and a high predominance of isolates showing a mutational profile compatible with MDR-TB (n = 1,447) and XDR-TB (n = 155) (Supplementary Figure S1). Moreover, the overall tree structure is consistent with multiple emergence of drug resistance and closely grouped clusters of MDR-TB.

The genomic clustering analysis using a maximum distance of 12 SNPs [Citation23] enabled the identification of two genomic clusters (GC1 and GC2) associated with nine out of the 14 initially sequenced strains (). GC2 (partially depicted in see Supplementary Figure S2 for the full MST) encompasses a total of 39 isolates including seven isolates sequenced in this study, all which from Guinea-Bissau. Interestingly, this cluster also comprises two additional isolates from Guinea-Bissau and another from Portugal (). Further cluster discrimination using a five SNP threshold showed that all isolates from Guinea-Bissau herein studied plus two isolates also from Guinea-Bissau, one from Portugal and one from Brazil formed a more restricted genomic cluster (GC2.1) that comprised a monophyletic sub-branch within GC2 that is parallel to other sub-branches that encompass strains circulating mostly in Russia and on other neighbouring countries such as Georgia, Moldova or Azerbaijan. The topological structure of the tree is therefore suggestive of dissemination from these regions towards Guinea-Bissau, Portugal and Brazil. All GC2 harboured the sigE silent mutation at codon 98 (genome position 1364706, G > A), associated with type 94-32, which suggests possible descent from this cluster with concomitant divergence at the MIRU-VNTR loci () [Citation28]. More specifically, all GC2 strains also bore the intergenic specific SNPs for the Central Asia Outbreak specific sub-branch of the 94-32 type [Citation16].

Figure 1. Genomic Clusters (GC) 1 and 2 comprising isolates characterized in the present study and herein defined as strains/nodes within 12 SNPs of distance between each other. Each GC is here partially represented as a Minimum Spanning Tree (MST) as to highlight nodes/strains closer to the analysed Beijing clinical isolates from Portugal and Guinea-Bissau (see Supplementary Figure S2 for full MSTs). Lines connecting each node represent a link of ≤12 SNPs where each dot represents the genetic distance corresponding to one SNP; lines depicted in black and grey represent distances ranging between 1–5 and 7–12 SNPs, respectively. GC1 comprises 121 isolates (displayed in a truncated form that highlights the positioning of Beijing strains isolated in Portugal) and includes isolates PT000078, PT000080 (both representing a known transmission case among the same household and involving two siblings, MtbC15-9 type 4737-32) and PT000013. PT000078 and PT000080 whose patients had no history of travel and of known TB contacts, were found to be in proximity with an imprisoned immigrant from Eastern Europe (ERR1034819), which is eight SNPs apart from another undetected immigrant from Eastern Europe in Portugal (ERR1034838). PT000013 pertains to a Moldovan immigrant in Portugal and is herein linked to an isolate originating from Moldova, seven years after this case has been detected in Portugal and is 3–4 SNPs apart of cases detected in Tajikistan. GC1 thus highlight the epidemiological impact and emergence of MDR-TB strains belonging to this GC that appear to originate from a complex transmission network that mainly spans across former Soviet states as well as its successful spread outside its endemic settings. GC2 depicts a cross-border cluster of previously unlinked cases between Portugal, Guinea-Bissau and Brazil that are also linked (≤12 SNPs) with isolates from former Soviet states. Overall, the high resolution offered by whole genome sequencing enabled the identification of previously epidemiologically unlinked cases and the identification of new routes for the spread of MDR-TB Beijing strains.

Figure 1. Genomic Clusters (GC) 1 and 2 comprising isolates characterized in the present study and herein defined as strains/nodes within 12 SNPs of distance between each other. Each GC is here partially represented as a Minimum Spanning Tree (MST) as to highlight nodes/strains closer to the analysed Beijing clinical isolates from Portugal and Guinea-Bissau (see Supplementary Figure S2 for full MSTs). Lines connecting each node represent a link of ≤12 SNPs where each dot represents the genetic distance corresponding to one SNP; lines depicted in black and grey represent distances ranging between 1–5 and 7–12 SNPs, respectively. GC1 comprises 121 isolates (displayed in a truncated form that highlights the positioning of Beijing strains isolated in Portugal) and includes isolates PT000078, PT000080 (both representing a known transmission case among the same household and involving two siblings, MtbC15-9 type 4737-32) and PT000013. PT000078 and PT000080 whose patients had no history of travel and of known TB contacts, were found to be in proximity with an imprisoned immigrant from Eastern Europe (ERR1034819), which is eight SNPs apart from another undetected immigrant from Eastern Europe in Portugal (ERR1034838). PT000013 pertains to a Moldovan immigrant in Portugal and is herein linked to an isolate originating from Moldova, seven years after this case has been detected in Portugal and is 3–4 SNPs apart of cases detected in Tajikistan. GC1 thus highlight the epidemiological impact and emergence of MDR-TB strains belonging to this GC that appear to originate from a complex transmission network that mainly spans across former Soviet states as well as its successful spread outside its endemic settings. GC2 depicts a cross-border cluster of previously unlinked cases between Portugal, Guinea-Bissau and Brazil that are also linked (≤12 SNPs) with isolates from former Soviet states. Overall, the high resolution offered by whole genome sequencing enabled the identification of previously epidemiologically unlinked cases and the identification of new routes for the spread of MDR-TB Beijing strains.

Figure 2. Maximum-likelihood based phylogenetic trees for GC1 and GC2encompassing 121 and 39 clinical isolates, respectively. The global phylogenetic tree is shown annotated with: tip colours for GC1 (red) and GC2 (yellow) isolates; a first inner colour strip highlighting isolates sequenced in this study from Portugal (blue) and Guinea-Bissau (green); a second colour strip highlighting genotypic based classification of isolates as susceptible (green), any drug resistance other than MDR-/XDR-TB (yellow), MDR-TB (red) and, XDR-TB (dark red); a third colour strip highlights sub-clusters at the five SNP distance threshold: GC1.1 (light blue), GC1.2 (red) and GC2.1 (yellow); sublineage and country of isolation; most common drug resistance associated mutations; and, presence of kdpD binucleotide deletion marker for the Russian B0/W148 (blue triangles) and the sigE silent mutation at codon 98 associated with type 94-32 (green triangles). A map illustrates the geographical dissemination of GC1 (red) and GC2 (yellow) across Asia, Europe, Africa and South America.

Figure 2. Maximum-likelihood based phylogenetic trees for GC1 and GC2encompassing 121 and 39 clinical isolates, respectively. The global phylogenetic tree is shown annotated with: tip colours for GC1 (red) and GC2 (yellow) isolates; a first inner colour strip highlighting isolates sequenced in this study from Portugal (blue) and Guinea-Bissau (green); a second colour strip highlighting genotypic based classification of isolates as susceptible (green), any drug resistance other than MDR-/XDR-TB (yellow), MDR-TB (red) and, XDR-TB (dark red); a third colour strip highlights sub-clusters at the five SNP distance threshold: GC1.1 (light blue), GC1.2 (red) and GC2.1 (yellow); sublineage and country of isolation; most common drug resistance associated mutations; and, presence of kdpD binucleotide deletion marker for the Russian B0/W148 (blue triangles) and the sigE silent mutation at codon 98 associated with type 94-32 (green triangles). A map illustrates the geographical dissemination of GC1 (red) and GC2 (yellow) across Asia, Europe, Africa and South America.

The largest genome-wide-based cluster (GC1; partially depicted in see Supplementary Figure S2 for the full MST) was comprised of 121 M. tuberculosis Beijing isolates and included three isolates sequenced in this study (PT000078, PT000080 and PT000013). The first two clinical isolates belong to the two siblings (P16 and P17) with unknown TB contacts. Notably, WGS data showed these samples to be extremely close (one SNP difference) to a previously undetected isolate in Portugal three years before. An epidemiological investigation showed that the latter was obtained from an imprisoned Eastern European immigrant. No epidemiological link could be established between the imprisoned patient and the siblings, except for the fact that the husband of one of the siblings worked as a prison guard and was often responsible for transportation of imprisoned individuals; however this prison guard was never diagnosed with TB. The third isolate (PT000013) recovered from a Moldova immigrant in Portugal was also part of this large transmission network and genetically close (≤5 SNPs) to isolates originating from Moldova and Tajikistan. GC1 sub-clustering under a five SNP threshold confirmed the genomic clustering of the siblings with the imprisoned patient with the latter assuming a basal positioning to this monophyletic sub-cluster (GC1.1; ), while the PT000013 isolate was also found to be part of a monophyletic cluster (GC1.2; ) composed of five additional isolates from Tajikistan (n = 4) and Moldova (n = 1). These findings corroborate, on the one hand, that the dissemination of strains leading to PT000078 and PT000080 result from the clonal expansion of MDR-TB Beijing isolates in Portugal but that, on the other hand, strains belonging to this same cluster are thought to have disseminated from former Soviet states towards Eastern Europe and Western Europe, therein represented by Portugal. All strains in this GC were positive for the kdpD binucleotide deletion marker for the Russian B0/W148, part of the MIRU-VNTR 100-32 supercluster () [Citation16,Citation29]. Albeit belonging to MIRU-VNTR type 4737-32, PT00078 and PT00080 also bear this latter genetic marker highlighting its likely descent from 100-32 supercluster. In comparison to GC2, GC1 showed a more widespread distribution across Europe ().

For non-clustered isolates GW000065, PT000095, PT000025 and PT000242, the respective phylogenetic contexts were investigated by examining the topological structure of the phylogenetic trees constructed for isolates within 100 SNPs of these isolates (). PT000242 and PT000025 were part of a sub-branch of sublineage 2.2.1 that is parallel to strains isolated in China and Vietnam. PT000095 is also phylogenetically positioned closer to strains circulating in Eastern hemisphere countries such as Vietnam and Singapore. Lastly, the GW000065 isolate from Guinea-Bissau is part of a sub-branch of sublineage 2.2.1 that appears to be disseminating in Africa and is parallel to other sub-branches mostly found in Vietnam which also suggests dissemination to Africa from the Eastern part of the globe ().

Molecular basis of resistance within GC1 and GC2 reveals multiple evolutionary trajectories towards resistance amplification and XDR-TB

We examined the molecular basis of resistance within the GCs herein detected (). All isolates in GC2 (n = 39) bore the classical katG S315T, rpoB S450L and rpsL K43R mutations associated with isoniazid (INH), rifampicin (RIF) and streptomycin (STR) resistance, respectively; embB M306V mutation was putatively associated with EMB resistance in 37 isolates, 11 of which corresponding to the Guinea-Bissau/Portugal/Brazil sub-branch of GC2 concomitantly showed the embA C-16G mutation; and, a D63A mutation on pncA potentially conferring PZA resistance was found in the latter eleven isolates. Furthermore, a frameshift ethA mutation (182delG) and a thyX promoter mutation (C-16T) were also detected among the same eleven GC2 isolates which, is therefore potentially linked with ETH and PAS resistance, respectively.

Table 3. Mutations detected across drug resistance associated genes in GC1 and GC2 isolates.

Contrarily to GC2, GC1 (n = 121) exhibited a remarkably higher mutational diversity associated with drug resistance. Nevertheless, two mutations were common across all GC1 isolates: rpsL K43R and katG S315T. Considering the three GC1 isolates sequenced in this study, all possessed a rpoB S450L mutation but diverged on the mutational profile of other loci: the siblings P16 and P17 isolates’ both showed embB M306I, rpoB V496A, and a single nucleotide deletion on ethA (887delA); the remaining isolate possessed a pncA D63A mutation. Globally, GC1 isolates showed 98 distinct mutations on drug resistance associated loci, and 4/121 showed mutational profiles compatible with XDR-TB (). Associated with PZA resistance, the pncA gene showed the highest mutational diversity: 29 mutations, two of which occurring on the promoter region ().

Figure 3. Maximum-likelihood based phylogenetic trees illustrating the phylogenetic context surrounding the non-clustered isolates GW000065, PT000095, PT000025 and PT000242. Three phylogenetic trees are shown annotated with: a first inner colour strip highlighting isolates sequenced in this study from Portugal (blue) and Guinea-Bissau (green); a second colour strip highlighting genotypic based classification of isolates as susceptible (green), any drug resistance other than MDR/XDR-TB (yellow), MDR-TB (red) and, XDR-TB (dark red); sublineage and country of isolation; most common drug resistance associated mutations; and, presence of kdpD binucleotide deletion marker for the Russian B0/W148 (blue triangles) and the sigE silent mutation at codon 98 associated with type 94-32 (green triangles).

Figure 3. Maximum-likelihood based phylogenetic trees illustrating the phylogenetic context surrounding the non-clustered isolates GW000065, PT000095, PT000025 and PT000242. Three phylogenetic trees are shown annotated with: a first inner colour strip highlighting isolates sequenced in this study from Portugal (blue) and Guinea-Bissau (green); a second colour strip highlighting genotypic based classification of isolates as susceptible (green), any drug resistance other than MDR/XDR-TB (yellow), MDR-TB (red) and, XDR-TB (dark red); sublineage and country of isolation; most common drug resistance associated mutations; and, presence of kdpD binucleotide deletion marker for the Russian B0/W148 (blue triangles) and the sigE silent mutation at codon 98 associated with type 94-32 (green triangles).

Comparing the mutations found across GC1 and GC2 isolates to the complete Beijing dataset assembled for this study (Supplementary Table S3) it is noteworthy that: (i) the ethA 182delG and 887delA in eleven GC2 and in six GC1 isolates, respectively, were not found in any other Beijing isolates; (ii) the rpoB V496A mutation was detected only in the GC1 sub-branch encompassing PT000078 and PT000080; and (iii) the pncA D63A was only detected in 17 isolates across the entire dataset, eleven of which from GC2 and six from GC1. In contrast, rpoB S450L, katG S315T and rpsL K43R mutations were highly prevalent as these were detected in 1246 (77.8%), 1,392 (86.9%) and 778 (48.6%) MDR isolates, respectively, and thereby confirming these as the most prevalent mutations in drug resistant isolates, particularly in former Soviet states [Citation16]. Moreover, the thyX C-16T and embA C-16G mutation were well represented in eleven GC2 (28.2%) isolates whereas across MDR-TB and in the entire dataset these mutations were detected only in 62 (3.8%) MDR-TB isolates (total: 72 [1.4%] across all isolates) for the thyX C-16T mutation and, in 15 (0.9%) MDR-TB isolates (total: 17 [0.3%] across all isolates) for the embA C-16G mutation.

Discussion

The increasing rates of MDR-TB and XDR-TB are a major roadblock to the WHO TB Elimination goals. Despite having the M. tuberculosis genetic diversity characterized to different extents, Portugal and Guinea-Bissau present different healthcare systems and challenges regarding TB control. A common challenge pertains the introduction and transmission of non-endemic strains such as the Beijing family with no study previously attempting to establish epidemiological links between these countries and investigate the routes by which these strains may be introduced in the current epidemiological scenario of each country.

Most Beijing strains detected in Portugal and Guinea-Bissau were in fact not clustered using classical typing methods. This latter observation suggests recent emergence of these strains, particularly in Portugal. A higher degree of clustered Beijing isolates was observed for Guinea-Bissau therefore suggesting ongoing transmission of Beijing strains but the route by which these strains were imported into Guinea-Bissau is unclear as all patients were of Guinean nationality. It is known that the Beijing is the predominant genotype across Central, Eastern, South-Eastern Asia and North Eurasia and, that it has been detected in Northern, Eastern and Southern Africa [Citation10,Citation15]. This data suggests that MDR-TB Beijing strains are now also emerging in West African countries such as Guinea-Bissau.

To understand and place the origin of Beijing isolates in Portugal and Guinea-Bissau in a broader epidemiological context, we compiled a global dataset of 5,482 clinical isolates. Isolates belonging to superclusters 100-32 and 94-32 were detected in Portugal and Guinea-Bissau, respectively. Supercluster 100-32 (more specifically, its major B0/W148 variant) is highly associated with MDR-TB in Russia [Citation29] and is often isolated from immigrants in Europe, whereas supercluster 94-32 (Central Asian/Russian clone), second to 100-32 in Russia and less associated with MDR-TB, is more prevalent in former Soviet Central Asia [Citation17]. Beyond these, the presence of types 2083-32, 4737-32 and 9387-32 in Portugal suggests other possible sources of Beijing strains. The presence of type 9124-32 in Guinea-Bissau is also suggestive of dissemination from Russia or eastern European countries, where it is more prevalent. All clusters but type 9387-32 had been previously associated with drug resistance which stresses the ability of MDR-TB Beijing strains to expand beyond its endemic settings. This initial analysis based on cluster dispersion is indicative of different possible sources for the Beijing strains in these settings. One possible link between Guinea-Bissau and Russia lies in the close bilateral relations that once existed between former Soviet Union and Guinea-Bissau which promoted scholarships for Guinean students and military training in the Soviet Union. However, the main limitation of this MIRU-VNTR based analysis lies in its poor discriminatory power when compared to WGS and poor ability to infer on the directionality of the transmission dynamics. The fact that all MIRU-VNTR clusters were defined based on 24-loci along with a stringent definition of cluster based on completely similar profiles does limit the discordance between methods [Citation30]. Further discriminatory power could have been obtained through the inclusion of 1980, 3232, 3820 and 4120 hypervariable VNTR loci since the standardized 24-loci MIRU-VNTR typing set alone is in many circumstances unable to resolve closely related clones such as the 94-32 and 100-32 [Citation31,Citation32]. While the standard 24-loci set enabled the discriminatory of the Beijing isolates in a non-endemic setting such as Portugal, the inexistence of global datasets with hypervariable allelic also hampers global comparison using this classical typing method.

Nevertheless, WGS did allow for a more resolved scenario in a global context and, enabled the identification of two genomic clusters, herein named GC1 and GC2, that included isolates sequenced in this study. In fact, GC2 encompassed a cross-border clustering event between Portugal, Guinea-Bissau and Brazil, with the number of immigrants and arrivals from Guinea-Bissau to Portugal alerting for a previously unrecognized source of MDR-TB Beijing strains in this country [Citation11]. Moreover, the phylogenetic context of GC2 further demonstrated that emergence of MDR-TB Beijing in Guinea-Bissau is likely driven from strains imported from Eastern Europe or former Soviet countries to Guinea-Bissau and later mobilized to other Portuguese-speaking countries. Interestingly, GC2 is herein shown to be part of the Central Asia Outbreak clade which is a specific sub-branch of the Central Asia clade, more associated with MDR-TB and extensive geographic dissemination beyond its place of origin [Citation33]. Contrarily to GC2, GC1 shows a widespread distribution across Central Asia and Europe and, has been detected in Portugal as a result of two separate and independent dissemination events: (i) one involving four cases, two of which detected in the present study, that given the apparent genetic distances suggest clonal expansion already in Portugal but whose original geographical source is unknown; and (ii) a single case phylogenetically nested within strains isolated in Tajikistan and Moldova that may have spread into Europe via other former Soviet countries. In fact, GC1 appears to be dominated by strains circulating within former Soviet states, comprising a reservoir of MDR-TB Beijing strains emerging in Europe and likely to be part of Clade B/East-Europe2/B0-W148 cluster [Citation6,Citation16,Citation34]. The mutational diversity associated with GC1 corroborates its extensive dissemination and multiple evolutionary trajectories leading to resistance amplification towards XDR-TB from a common ancestor already resistant to INH and STR. This observation is also corroborated by the detection of different XDR-TB and pre-XDR-TB Beijing strains in Spain among sex workers, one of which (BJR1/ERR1665402) herein found to belong to GC1 [Citation35]. While the latter did not disseminate in the community, GC1 secondary cases occurring in Portugal are herein observed and, stress the importance of screening and prompt access to TB care for immigrants [Citation36].

The phylogenetic context of the non-clustered isolates suggests that the three non-clustered isolates from Portugal are representative of strains that appear to circulate mostly in East Asian countries such as China and Vietnam. Also, the same can be said for the single Guinea-Bissau non-clustered isolate although this isolate appears to represent an entire clade disseminating across Africa but more deeply rooted across Asian countries. The migrant population in Guinea-Bissau (n = 1933, according to the last population census in 2009) is mostly comprised of immigrants from Western African countries which can pose as vehicle for the dissemination of these strains but the migration profile of the country is poorly characterized [Citation37]. Portugal on the other hand has observed a steep increase in the foreign population with residency status originating from Asia: from 24,269, in 2007, to 54,508 legal residents in 2017 [Citation38]. The emergence of non-clustered Beijing strains in Portugal can therefore result from the increasing international migratory flow from high-prevalence settings for Beijing strains other than Eastern European countries.

In conclusion, the present study corroborates the emergence of MDR-TB Beijing strains in Portugal and Guinea-Bissau linked with former Soviet Countries, as well as previously unidentified cross-border clustering between Guinea-Bissau and Portugal concerning the epidemiology of MDR-TB Beijing strains. The latter is of special concern as the Lusophone migratory system may constitute an additional gateway for intercontinental dissemination of MDR-TB Beijing strains to countries such as Brazil or Angola, that appear to have a very low prevalence of Beijing strains. Also, we stress the role of WGS as a high-resolution typing tool that can assist the tracking of international relevant clones and its transmission dynamics.

Supplemental material

Supplementary_TableS1.pdf

Download ()

Acknowledgments

Financial support was provided by the European Society of Clinical Microbiology and Infectious Diseases, for which we would like to would like to acknowledge the Study Group for Mycobacterial Infections. This study was supported in part by UID/DTP/04138/2019 from Fundação para a Ciência e Tecnologia (FCT), Portugal. DM, IC and MV work was funded by project PTDC/BIA-MIC/30692/2017 and the Global Health and Tropical Medicine (GHTM) Research Center (Grant UID/Multi/04413/2013) from FCT. JP [CEECIND/00394/2017] and DM are supported by FCT through Estímulo Individual ao Emprego Científico. AP was supported by a faculty baseline funding from KAUST [BAS/1/1020-01-01]. IM was supported by Russian Science Foundation (grant 19-14-00013). DM, IC and MV are thankful to projects “ForDILAB-TB” and “A implementação de um novo método de identificação rápida do complexo M. tuberculosis nos Laboratórios de Referência da Tuberculose de Maputo e Beira” from Fundação Calouste Gulbenkian and the Community of the Portuguese Speaking Countries (CPLP). TGC received funding from the MRC UK (Grant no. MR/K000551/1, MR/M01360X/1, MR/N010469/1, MR/R020973/1, MR/R025576/1, MR/S01988X/1, MR/S03563X/1) and BBSRC UK (BB/R013063/1). SC and MH received funding from the Medical Research Council UK grants (MR/R020973/1, MR/R025576/1, MR/S01988X/1, MR/S03563X/1) and the BBSRC UK (BB/R013063/1). JPh is funded by the Medical Research Council UK grants (MR/S03563X/1).

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by the Biotechnology and Biological Sciences Research Council [grant number BB/R013063/1]; Fundação para a Ciência e a Tecnologia [grant number UID/DTP/04138/2019,UID/Multi/04413/2013]; Medical Research Council [grant numbers MR/K000551/1, MR/M01360X/1, MR/N010469/1, MR/R0209]; Russian Science Foundation [grant number 19-14-00013].

References

  • van Soolingen D, Qian L, de Haas PE, et al. Predominance of a single genotype of Mycobacterium tuberculosis in countries of east Asia. J Clin Microbiol. 1995;33(12):3234–3238.
  • Bifani PJ, Plikaytis BB, Kapur V, et al. Origin and interstate spread of a New York City multidrug-resistant Mycobacterium tuberculosis clone family. JAMA. 1996;275(6):452–457.
  • Gurjav U, Erkhembayar B, Burneebaatar B, et al. Transmission of multi-drug resistant tuberculosis in Mongolia is driven by Beijing strains of Mycobacterium tuberculosis resistant to all first-line drugs. Tuberculosis (Edinb). 2016;101:49–53.
  • Marais BJ, Mlambo CK, Rastogi N, et al. Epidemic spread of multidrug-resistant tuberculosis in Johannesburg, South Africa. J Clin Microbiol. 2013;51(6):1818–1825.
  • Mokrousov I. Genetic geography of Mycobacterium tuberculosis Beijing genotype: a multifacet mirror of human history? Infect Genet Evol. 2008;8(6):777–785.
  • Luo T, Comas I, Luo D, et al. Southern East Asian origin and coexpansion of Mycobacterium tuberculosis Beijing family with Han Chinese. Proc Natl Acad Sci U S A. 2015;112(26):8136–8141.
  • Ribeiro SC, Gomes LL, Amaral EP, et al. Mycobacterium tuberculosis strains of the modern sublineage of the Beijing family are more likely to display increased virulence than strains of the ancient sublineage. J Clin Microbiol. 2014;52(7):2615–2624.
  • van Laarhoven A, Mandemakers JJ, Kleinnijenhuis J, et al. Low induction of proinflammatory cytokines parallels evolutionary success of modern strains within the Mycobacterium tuberculosis Beijing genotype. Infect Immun. 2013;81(10):3750–3756.
  • Perdigao J, Silva H, Machado D, et al. Unraveling Mycobacterium tuberculosis genomic diversity and evolution in Lisbon, Portugal, a highly drug resistant setting. BMC Genomics. 2014;15(991).
  • Demay C, Liens B, Burguiere T, et al. SITVITWEB – a publicly available international multimarker database for studying Mycobacterium tuberculosis genetic diversity and molecular epidemiology. Infect Genet Evol. 2012;12(4):755–766.
  • Perdigao J, Silva C, Diniz J, et al. Clonal expansion across the seas as seen through CPLP-TB database: A joint effort in cataloguing Mycobacterium tuberculosis genetic diversity in Portuguese-speaking countries. Infect Genet Evol. 2019;72:44–58.
  • Rabna P, Ramos J, Ponce G, et al. Direct detection by the Xpert MTB/RIF Assay and Characterization of Multi and Poly drug-resistant tuberculosis in Guinea-Bissau, West Africa. PLoS ONE. 2015;10(5):e0127536.
  • Supply P, Allix C, Lesjean S, et al. Proposal for standardization of optimized mycobacterial interspersed repetitive unit-variable-number tandem repeat typing of Mycobacterium tuberculosis. J Clin Microbiol. 2006;44(12):4498–4510.
  • Couvin D, Rastogi N. Tuberculosis - A global emergency: Tools and methods to monitor, understand, and control the epidemic with specific example of the Beijing lineage. Tuberculosis (Edinb). 2015;95(Suppl 1):S177–S189.
  • Couvin D, David A, Zozio T, et al. Macro-geographical specificities of the prevailing tuberculosis epidemic as seen through SITVIT2, an updated version of the Mycobacterium tuberculosis genotyping database. Infect Genet Evol. 2019;72:31–43.
  • Merker M, Blin C, Mona S, et al. Evolutionary history and global spread of the Mycobacterium tuberculosis Beijing lineage. Nat Genet. 2015;47(3):242–249.
  • Skiba Y, Mokrousov I, Ismagulova G, et al. Molecular snapshot of Mycobacterium tuberculosis population in Kazakhstan: a country-wide study. Tuberculosis (Edinb). 2015;95(5):538–546.
  • Yin QQ, Liu HC, Jiao WW, et al. Evolutionary history and ongoing transmission of phylogenetic Sublineages of Mycobacterium tuberculosis Beijing genotype in China. Sci Rep. 2016;6(34353).
  • Allix-Beguec C, Harmsen D, Weniger T, et al. Evaluation and strategy for use of MIRU-VNTRplus, a multifunctional database for online analysis of genotyping data and phylogenetic identification of Mycobacterium tuberculosis complex isolates. J Clin Microbiol. 2008;46(8):2692–2699.
  • Hunter PR, Gaston MA. Numerical index of the discriminatory ability of typing systems: an application of Simpson’s index of diversity. J Clin Microbiol. 1988;26(11):2465–2466.
  • Coll F, Phelan J, Hill-Cawthorne GA, et al. Genome-wide analysis of multi- and extensively drug-resistant Mycobacterium tuberculosis. Nat Genet. 2018;50(2):307–316.
  • Coll F, McNerney R, Guerra-Assuncao JA, et al. A robust SNP barcode for typing Mycobacterium tuberculosis complex strains. Nat Commun. 2014;5:4812.
  • Walker TM, Ip CL, Harrell RH, et al. Whole-genome sequencing to delineate Mycobacterium tuberculosis outbreaks: a retrospective observational study. Lancet Infect Dis. 2013;13(2):137–146.
  • Price MN, Dehal PS, Arkin AP. FastTree 2 – approximately maximum-likelihood trees for large alignments. PLoS ONE. 2010;5(3):e9490.
  • Letunic I, Bork P. Interactive tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 2019;47(W1):W256–W2W9.
  • Argimon S, Abudahab K, Goater RJE, et al. Microreact: visualizing and sharing data for genomic epidemiology and phylogeography. Microb Genom. 2016;2(11):e000093.
  • Nascimento M, Sousa A, Ramirez M, et al. PHYLOViz 2.0: providing scalable data integration and visualization for multiple phylogenetic inference methods. Bioinformatics. 2017;33(1):128–129.
  • Mokrousov I, Chernyaeva E, Vyazovaya A, et al. Rapid Assay for detection of the epidemiologically Important Central Asian/Russian strain of the Mycobacterium tuberculosis Beijing genotype. J Clin Microbiol. 2018;56(2):e01551–17.
  • Mokrousov I, Narvskaya O, Vyazovaya A, et al. Russian “successful” clone B0/W148 of Mycobacterium tuberculosis Beijing genotype: a multiplex PCR assay for rapid detection and global screening. J Clin Microbiol. 2012;50(11):3757–3759.
  • Nikolayevskyy V, Kranzer K, Niemann S, et al. Whole genome sequencing of Mycobacterium tuberculosis for detection of recent transmission and tracing outbreaks: A systematic review. Tuberculosis (Edinb). 2016;98:77–85.
  • Allix-Beguec C, Wahl C, Hanekom M, et al. Proposal of a consensus set of hypervariable mycobacterial interspersed repetitive-unit-variable-number tandem-repeat loci for subtyping of Mycobacterium tuberculosis Beijing isolates. J Clin Microbiol. 2014;52(1):164–172.
  • Murase Y, Izumi K, Ohkado A, et al. Prediction of Local transmission of Mycobacterium tuberculosis isolates of a predominantly beijing lineage by use of a variable-number tandem-repeat typing method incorporating a consensus set of hypervariable loci. J Clin Microbiol. 2018;56(1):e01016–17.
  • Shitikov E, Vyazovaya A, Malakhova M, et al. Simple Assay for detection of the Central Asia Outbreak clade of the Mycobacterium tuberculosis Beijing genotype. J Clin Microbiol. 2019;57(7):e00215–19.
  • Casali N, Nikolayevskyy V, Balabanova Y, et al. Microevolution of extensively drug-resistant tuberculosis in Russia. Genome Res. 2012;22(4):735–745.
  • Perez-Lago L, Martinez-Lirola M, Garcia S, et al. Urgent Implementation in a Hospital setting of a Strategy To Rule Out secondary cases Caused by imported extensively drug-resistant Mycobacterium tuberculosis strains at Diagnosis. J Clin Microbiol. 2016;54(12):2969–2974.
  • Pereira C, Gomes P, Taveira R, et al. Insights on the Mycobacterium tuberculosis population structure associated with migrants from Portuguese-speaking countries over a three-year period in Greater Lisbon, Portugal: Implications at the public health level. Infect Genet Evol. 2019;71:159–165.
  • Instituto Nacional de Estatística. Recenseamento Geral da População e Habitação – Guiné Bissau. Bissau: Instituto Nacional de Estatística; 2009.
  • Gabinete de Estratégia e Economia. População Estrangeira Residente em Portugal. Lisboa: Ministério da Economia; 2018.