807
Views
7
CrossRef citations to date
0
Altmetric
Original Research Articles

Examining DNA fingerprinting as an epidemiology tool in the tuberculosis program in the Northwest Territories, Canada

, , , , &
Article: 20067 | Received 11 Nov 2012, Accepted 17 Mar 2013, Published online: 08 May 2013

Abstract

Background

Tuberculosis (TB) is an important public health problem in the Northwest Territories (NWT), particularly among Canadian Aboriginal people.

Objective

To analyse the transmission patterns of tuberculosis among the population living in the NWT, a territorial jurisdiction located within Northern Canada.

Methods

This population-based retrospective study examined the DNA fingerprints of all laboratory confirmed cases of TB in the NWT, Canada, between 1990 and 2009. An isolate of each lab-confirmed case had genotyping done using IS6110 Restriction Fragment Length Polymorphism. DNA patterns were assigned to each DNA fingerprint, and indistinguishable fingerprints patterns were assigned a cluster. Social network analysis (SNA) was used to examine direct linkages among cases determined through conventional contact tracing (CCT), their DNA fingerprint and home community.

Results

Of the 225 lab-confirmed cases identified, the study was limited to 195 subjects due to DNA fingerprinting data availability. The mean age of the cases was 43.8 years (±22.6) and 120 (61.5%) males. The Dene (First Nations) encompassed 120 of the cases (87.7%), 8 cases (4.1%) were Inuit, 2 cases (1.0%) were Metis, 7 cases (3.6%) were Immigrants and 1 case had unknown ethnicity. One hundred and eighty six (95.4%) subjects were clustered, resulting in 8 clusters. Trend analysis showed significant relationships between with risk factors for unemployment (p=0.020), geographic location (p≤0.001) and homelessness (p≤0.001). Other significant risk factors included excessive alcohol consumption, prior infection with Mycobacterium tuberculosis and prior contact with a case of TB.

Conclusions

This study demonstrates how DNA fingerprinting and SNA can be additional epidemiological tools, along with CCT method, to determine transmission patterns of TB.

Tuberculosis (TB) is an important public health problem in the Northwest Territories (NWT), particularly among Canadian Aboriginal people. TB was first reported in the NWT by the early missionaries in the later years of the 19th century (Citation1). TB was epidemic in the early 1940s in the NWT with a reported 42 deaths per 10,000 population (Citation1). TB continues to be an endemic disease among the Aboriginal population (Dene, Inuit and Metis) who comprise roughly half of the NWT population.

Despite effective antibiotic treatment, standardised clinical management programs and rigorous contact tracing, the rate of TB in the NWT averages 20 cases per 100,000 population (Citation2), 4 times the national rate (Citation3). Outbreaks have been reported among populations living in remote communities throughout the NWT. The TB rates among the Aboriginal population are twice the overall NWT rate (Citation2). Continued transmission of this disease can be attributed to late identification of a respiratory case of TB resulting in subsequent progression of the disease to an infectious advanced stage allowing high amounts of M. tuberculosis in the respiratory tract to be expelled into the air. Conventional contact tracing (CCT) remains an important method to stop the chain of transmission of TB in the NWT.

DNA fingerprinting is a tool that can be used to evaluate gaps in the CCT method and determine clonal relatedness of M. tuberculosis isolates (Citation4Citation6). The case and their infected contacts have the same indistinguishable DNA fingerprint. Contact tracing investigations are significantly enhanced if TB cases share an indistinguishable DNA fingerprint typing in addition to the traditional epidemiological links as determined through CCT.

Another useful approach is social networking analysis (SNA), which is a mathematical tool that includes visualisation of people and places and the connections between them (Citation7Citation9). Due to the lengthy latency period of TB and the mode of transmission through the air, CCT may not capture all of the contacts. SNA has been used to determine socialising patterns by directing focus on locations and activities contributing to potential transmission (Citation10).

The objective of this study was to better understand the transmission patterns of tuberculosis among the Northern Canadian population living in the NWT.

The aims of this study were to determine: (a) whether unknown transmission among the studied cases not previously identified through CCT can be identified by examination of DNA fingerprinting patterns; and (b) whether specific TB risk factors related to demographics, social and behavioural risk factors, and clinical aspects are associated with DNA fingerprinting patterns.

Materials and methods

We conducted a 20-year retrospective population-based study examining DNA fingerprinting patterns of isolates from reported NWT TB cases between January 1990 and December 2009 matched to the epidemiological and demographic data. DNA fingerprinting analysis of each M. tuberculosis isolate corresponded to a single TB case reported during the study period.

Epidemiological data

Demographic and epidemiologic data were obtained from medical records of all patients diagnosed with TB at the Office of the Chief Public Health Officer (OCPHO). All data were collected by staff at the OCPHO and stored in hard copy and electronic copy in the integrated Public Health Information system (iPHIS), a web-based data management application.

Demographic, social and behavioural risk factors, and clinical aspects included: age, gender, ethnicity, employment status, amount of alcohol consumption, illicit drug use, smoking, homeless status, HIV and past TB exposure history including prior contact with an active TB case and previous latent tuberculosis infection (LTBI).

DNA fingerprint analysis

Molecular typing method for genotyping of the NWT M. tuberculosis isolates has been a routine procedure at the Provincial Laboratory for Public Health (ProvLab), Alberta Health Services even prior to the onset of this study. The ProvLab uses an international standardised protocol for IS6110 restriction fragment length polymorphism [IS6110-RFLP] (Citation11). Images of the IS6110-RFLP patterns were digitized and stored in databases managed using the BioNumerics software (version 5.1; Applied Maths, USA). RFLP fingerprint pattern numbers were assigned to each isolate, and cluster analysis was performed with BioNumerics. Dendrograms were made using BioNumerics using the unweighted pair group method with arithmetic mean, a Dice similarity coefficient, an additional 1.0% similarity coefficient and 1.5% optimisation.

Definition of clustering

A cluster of M. tuberculosis isolates included isolates with characteristics of the same number of copies (greater than 5) with IS6110 fragments of identical molecular weight and greater than 85% band agreement within the timeframe of 1990–2009.

Statistical analysis

Data were analysed using Statistical Package for Social Services software version 17.0 (SPSS Inc., Chicago IL). Univariate analysis of the potential TB risk factors of each case of TB was examined by grouping the genotype from their matched isolate into DNA fingerprint clusters or not clustered (unique). The association of each risk variable (demographic, social and behavioural risks and clinical aspects) was compared to the outcome variable of DNA fingerprint cluster groupings. Bivariate analysis was used to test association using Chi-squared test or Fisher's exact test. P values <0.05 were considered as statistically significant. Strength in the statistical power was increased by grouping the DNA clusters as: the 2 dominant DNA clusters and grouping the remaining cases belonging to other clusters and unique DNA fingerprints as the outcome variable.

Social network analysis

SNA permitted the visualisation of patterns or connections between cases and communities focused on the 2 dominant DNA fingerprint clusters. SNA was used as a tool to examine TB transmission within a population due to person-to-person, person-to-place mapping and showing recent transmission. Recent transmission was defined as having the 2 cases reported within 2 years. Examination of known exposure, based on CCT records of each case, was examined through the iPHIS database. The system allowed each case to be cross-referenced with reported contact to other cases. PAJEK (Citation12), a SNA application, was used for visualising network analysis to measure the connections between cases and communities. Both methods, SNA and CCT can detect evidence of transmission but depending on the socialisation patterns of the case(s) being studied, one or both methods may provide more conclusive findings of transmission patterns (Citation7).

The research proposal was reviewed and approved by the University of Alaska Anchorage Institutional Review Board and the Aurora Research institute (Research Licence # 1280, NWT).

Results

Between 1 January 1990 and 31 December 2009, there were 225 laboratory-confirmed cases reported in the NWT. However, the study was limited to 195 subjects because the DNA fingerprint data were not available for 30 of the isolates at the laboratory. Clustering analysis was performed on isolates with IS6110 RFLP data, incorporating 95% (186/195) of the cases in this study and grouped into clusters labelled: NWT1–NWT8 ().

Table I DNA cluster frequencies

demonstrates a dendrogram of clustering analysis of the IS6110 RFLP patterns of the strains in this study.

Fig. 1 Dendrogram of clustering analysis of the IS6110 RFLP patterns of the strains in this study. Dendrogram nodes and associated sub-branches that fit the cluster definition (≥85% pattern similarity) are highlighted in bold. The NWT clusters are represented as follows: • NWT1; *NWT2; ♦ NWT3; [odot] NWT4; ▪ NWT5; ▴ NWT6; NWT7; NWT8.

Fig. 1 Dendrogram of clustering analysis of the IS6110 RFLP patterns of the strains in this study. Dendrogram nodes and associated sub-branches that fit the cluster definition (≥85% pattern similarity) are highlighted in bold. The NWT clusters are represented as follows: • NWT1; *NWT2; ♦ NWT3; [odot] NWT4; ▪ NWT5; ▴ NWT6; NWT7; NWT8.

The 2 dominant DNA fingerprint clusters were NWT1 and NWT2, and they included 40.5% (79/195) and 40.0% (78/195) of the isolates, respectively (). The clusters NWT3 to NWT8 and an additional 9 unique DNA fingerprints (did not cluster) were amalgamated into a grouping called “others”. Detailed case characteristics are shown in .

Table II Description of case's characteristics from DNA clusters and bivariate analysis

Due to the presentation of the majority of isolates meeting the clustering definition, the statistical analysis involved comparing the dominant DNA clusters to one another and each of the dominant clusters with the amalgamated grouping titled, “others”. Comparison among the 3 groupings was a method used to determine whether a significant association of DNA fingerprint clustering existed with the risk factors examined in the study. The ethnicity of the cases was primarily Dene among the 2 most dominant clusters, NWT1 and NWT2 representing 75 (94.9%) and 71 (91.0%) of the cases, respectively. Ethnicity frequency among the “other” DNA fingerprints included a higher proportion of cases representing Inuit and Immigrant populations with 6 (15.8%) cases for each. Incorporating all of the examined cases, 87.7% (171/195) were Dene, followed by 4.1% (8/195) Inuit, 3.6% (7/195) Immigrant, 3.1% (6/195) non-aboriginal, 1.0% (2/195) Métis and 0.5% (1/195) of unknown ethnic group. The mean age of the 3 groupings NWT1, NWT2 and “others” was 42, 44 and 48 years, respectively. Gender was evenly distributed among the NWT1 cluster but was predominately male among the NWT2 and “others” groupings.

Unemployment status varied among the 3 grouping, NWT1, NWT2 and “others” with 12 (15.2%), 27 (34.6%) and 8 (21.0%), respectively. Children and students were excluded from this analysis while the employed group included homemakers and retired individuals, assuming these 2 categories that did not seek employment. The TB cases originated from 24 of the 33 communities in the NWT. In the overall analysis of all of the 195 TB cases, the 3 communities representing the highest number of cases were Community A with 21.5% (42/195), Community B with 14.4% (28/195) and Community C with 25.1% (49/195). NWT1 cases were predominately in Communities A and B, representing 36.7 and 34.2%, respectively, while the majority of the cases in the NWT2 cluster and “others” grouping were represented in the remaining communities. Due to the low populations in the isolated communities, anonymity of the community name was required in this study.

Harmful alcohol drinking included those who had reported frequent heavy drinking or a history of alcohol dependency was greater than 39.5% among the 2 dominant DNA clusters and “others” grouping. Homelessness was reported among all 3 groupings with NWT2 having the highest frequency of 16 cases (20.5%).

Clinical aspects of the cases included nearly half of the cases grouped in NWT1, NWT2 and “others” reporting evidence of LTBI, indicating that close to half of the cases may have been reactivations. The majority of the cases were diagnosed with respiratory TB averaging 85%, and the remaining was non-respiratory TB. Approximately half of the cases had recorded HIV testing done, all reported as negative.

Bivariate analysis using Chi-squared and Fisher's exact test were used to examine association among the two dominant DNA fingerprint clusters (NWT1 and NWT2) and the remaining DNA fingerprints as “others”. In , the analysis between NWT1 and NWT2 showed significant association among the risk factors of age (p=0.047), community (p=0.001) and homelessness (p=0.003). NWT1 verses “others” DNA fingerprints had significance for ethnicity (p≤0.001), community (p≤0.001) and prior contact with a case (p≤0.001). NWT2 verses “others” DNA fingerprints showed significance for ethnicity (p≤0.001), employment (p=0.020), community (p≤0.001), homelessness (p≤0.000) and prior contact with a case (p<0.001).

SNA was done on the cases without reported records of LTBI among the two dominant DNA clusters, NWT1 and NWT2, representing 47 cases in each cluster. These cases were selected primarily to lessen the possibility of the case having previous exposure to cases not included in this study. As well, the cases with exposure to another case within two years were considered recent transmission. In separate SNA examination of the two dominate DNA fingerprint clusters, each case was assigned a unique identification number with the DNA cluster and communities were assigned a unique letter, both referred to as “nodes”. Each case was assigned a colour code for their DNA fingerprint cluster. demonstrates the relationship between cases and their connections with communities for NWT1 DNA cluster.

Fig. 2 SNA of NWT1 DNA fingerprint cluster. The yellow circles represent the cases and the red squares represent the communities. The heavy black lines represent cases reported within 2 years, while the lighter lines represent exposure between the cases exceeding 2 years.

Fig. 2 SNA of NWT1 DNA fingerprint cluster. The yellow circles represent the cases and the red squares represent the communities. The heavy black lines represent cases reported within 2 years, while the lighter lines represent exposure between the cases exceeding 2 years.

shows the relationship between the 47 cases reported between 1990 and 2004, with their isolate's DNA fingerprint classified as NWT1. The cases were distributed among six communities (A, B, C, E, H and I), all located around the Great Slave Lake area. Note a few of the cases have multiple heavy black lines, indicating recent transmission among cases. Case “41” was an index case resulting in an outbreak in Community B starting in 1995. Recent contact was reported among cases “5”, “35”, “36”, “39”, “44” and “45”. As well, note the direct link of case “8” between Community A and Community E, cases “7” and “17” to Community B and cases “25” and “27” to Community C. Although many cases were directly linked to one community, the social patterns show spread to other communities.

Seven communities were associated with the distribution of 47 cases matching to isolates grouped in NWT2, shown in . The cases in this figure were reported between 1991 and 2009. NWT2 case numbers do not match with assigned case numbers in NWT1. Case “35” was the index case reported in 2007 from a homeless shelter outbreak in Community C. Twelve cases (“1”, “4”, “9”, “10”, “12”, “15”, “17”, “18” “25”, “31”, “42” and “47” had direct transmission reported within the 2-year criteria as well as having cases linked to 5 communities (Communities A, C, D, F and G).

Fig. 3 SNA of NWT2 DNA fingerprint cluster. The green circles represent the cases and the red squares represent the communities. The heavy black lines represent cases reported within 2 years, while the lighter lines represent exposure between the cases exceeding 2 years.

Fig. 3 SNA of NWT2 DNA fingerprint cluster. The green circles represent the cases and the red squares represent the communities. The heavy black lines represent cases reported within 2 years, while the lighter lines represent exposure between the cases exceeding 2 years.

Discussion

Early case detection and timely completion of treatment are the most important measures to stop the spread of TB in a community. CCT focuses on the concentric model for determining risk of contracting TB, where household members are usually considered at the highest risk of acquiring the infection (Citation13). The rationale for investigating contacts of a TB patient is that the infection can be spread through airborne droplet nuclei containing M. tuberculosis (Citation13). The identification and differentiation of the strains of M. tuberculosis by IS6110-RFLP has provided a better understanding of the epidemiology of the transmission of TB in the NWT. Although this study did not determine the direction of transmission among cases, it was able to determine associations of indistinguishable DNA fingerprints or clustering with some risk factors such as age, ethnicity, unemployment, excessive alcohol consumption, geographic location, homelessness and previous exposure to TB cases. This study does identify one unknown cluster, NWT8 consisting of 2 cases with indistinguishable DNA fingerprints, not identified through CCT.

The most important outcome of this study is the development of a database of DNA fingerprint patterns on all culture confirmed cases of TB in the NWT for the last 20 years. The DNA fingerprint registry will be invaluable in prospective analysis of outbreaks to assist with linking to known outbreaks and determining new ones. TB is a disease often associated with marginalised populations. In this study, among the 195 cases, over 90% of the cases were of Aboriginal ethnicity, 24.1% unemployed, 46.7% excessive alcohol consumers, 32.8% illicit drug users and 9.7% declared as homeless at some time during the progression of disease and treatment.

In this study, a large proportion of the case's isolate belonged to a cluster, 186/195 (95.4%). Conversely, a 2-year study among cases of TB in the Canadian provinces of British Columbia, Alberta, Saskatchewan and Manitoba only had 32.1% of their cases grouped into clusters (Citation14). The remaining 67.9% were unique DNA fingerprints. DNA fingerprinting homogeneity identified in this study suggests 2 things: first, the population is fairly non-transient in the NWT, meaning the circulating strains of M. tuberculosis is limited and endemic, and/or second, there is a high amount of transmissibility among cases in the NWT with endemic strains. The SNA demonstrates that both are feasible explanations.

SNA demonstrated that there is a strong relationship between cases within communities and among other communities. Further study could be done using SNA to demonstrate temporal spacing of the transmission of TB among the study group with further analysis of the genomes of the endemic strains. SNA allows the focus of the investigation to shift from individual case investigations to broader population-based examination of commonalities such as common networks of drug use or places of social congregation.

In conclusion, this study demonstrates how DNA fingerprinting and SNA can be additional epidemiological tools, along with the CCT method, to determine transmission patterns of TB. The 3 tools complement one another and each provides significant additional information to a TB investigation, which could be applied to prospective and retrospective investigations for TB transmission patterns. In this study, TB is most prevalent among marginalised populations in the NWT, and future control efforts need to focus on social networking patterns related to geographic location, alcohol consumption, exposure to a case, unemployment and homelessness.

TB remains a serious problem among the Aboriginal population in the NWT. Over half of the cases had evidence of being infected long before progression to active disease; they had evidence of previous LTBI. A high degree of strain homogeneity and previous infection with M. tuberculosis raises the question of whether large-scale testing and treatment of latent infection might be an effective way of dramatically reducing TB rates in some of the isolated communities in the NWT. Another option may be to drill down to the population at highest risk for contracting TB and targeting screening and treatment programs.

Acknowledgements

Funding for this research was supported by the Institute for Circumpolar Health Research and Department of Health and Social Services, Government of Northwest Territories.

References

  • Grzybowski S, Styblo K, Dorken E. Tuberculosis in Eskimos. Tubercle. 1976; 57: S1–58.
  • Case C. Annual review of tuberculosis for 2006 and 2007. EpiNorth. 2008; 3–5. [cited 2012 Nov 10]. Available from: http://www.hss.gov.nt.ca/sites/default/files/2008vol20issue2.pdf.
  • Public Health Agency of Canada. Tuberculosis in Canadian-born Aboriginal peoples, special report of the Canadian tuberculosis committee. Ottawa: Public Health Agency of Canada. 2002 [cited 2012 Nov 10]. Available from: http://www.phac-aspc.gc.ca/publicat/tbcbap-tbpac/special_report-eng.php.
  • Castro KG, Jaffe HW. Rationale and methods for the national tuberculosis genotyping and surveillance network. Emerg Infect Dis. 2002; 8: 1188–91.
  • Blackwood KS, Al-Azem A, Elliott LJ, Hershfield ES, Kabani AM. Conventional and molecular epidemiology of tuberculosis in Manitoba. BMC Infect Dis. 2003; 3: 1–11.
  • Hernandez-Garduno E, Kunimoto D, Wang L, Rodrigues M, Elwood RK, Black W, etal. Predictors of clustering of tuberculosis in greater Vancouver: a molecular epidemiologic study. CMAJ. 2002; 167: 349–52.
  • Andre M, Ijaz K, Tillinghast JD, Krebs VE, Diem LA, Metchock B, etal. Transmission network analysis to complement routine tuberculosis contact investigation. Am J Public Health. 2007; 97: 470–7.
  • Cook VJ, Sun SJ, Tapia J, Muth SQ, Arguello F, Lewis BL, etal. Transmission network analysis in tuberculosis contact investigations. J Infect Dis. 2007; 196: 1517–27.
  • McElroy P, Rothenberg R, Varghese R, Woodruff R, Minns GO, Muth SQ, etal. A network-informed approach to investigating a tuberculosis outbreak: implications for enhancing contact investigations. Int J Tuberc Lung Dis. 2003; 7(Suppl 12): S486–S93.
  • Murray EJ, Marais BJ, Mans G, Beyers N, Ayles H, Godfrey-Faussett P, etal. A multidisciplinary method to map potential tuberculosis transmission “hot spots” in high-burden communities. Int J Tuberc Lung Dis. 2009; 13: 767–74.
  • van Soolingen D, Fremer K, Hermans PWM, Palomino JC, Leao SC, Ritacco V. Molecular epidemiology: breakthrough achievements and future prospects. Tuberculosis – from basic science to patient care. 2007; Brazil: Bernd Sebastian Komps and Patricia Boucillier. 315–340. [cited 2012 Nov 10]. Available from: http://www.pneumonologia.gr/articlefiles/molepid_2007.pdf.
  • Batagelj V, Mrvar A. Pajek – program for large network analysis. 2004 [cited 2011 Nov 10]. Available from: http://vlado.fmf.uni-lj.si/pub/networks/pajek/doc/pajekman.htm.
  • Long R, Schwartzman K, Long R, Ellis E. Transmission and pathogenesis of tuberculosis. Canadian tuberculosis standards, 6th ed. Canadian Lung Association, Canadian Thoracic Society and Tuberculosis Prevention and Control Centre for Infectious Disease Prevention and Control, Public Health Agency of Canada. Ottawa, ON; 2007 [cited 2012 Nov 10]: [p. 38–41]. Available from: http://www.phac-aspc.gc.ca/tbpc-latb/pubs/tbstand07-eng.php.
  • FitzGerald JM, Fanning A, Hoepnner V, Hershfield E, Kunimoto D. Canadian Molecular Epidemiology of TB Study Group. The molecular epidemiology of tuberculosis in western Canada. Int J Tuber Lung Dis. 2003; 7: 132–8.