1,729
Views
13
CrossRef citations to date
0
Altmetric
Review Article

Genetic exchange and reassignment in Porphyromonas gingivalis

, &
Article: 1457373 | Received 24 Aug 2017, Accepted 19 Mar 2018, Published online: 12 Apr 2018

Figures & data

Table 1. Occurrences of transposase, fimbrilin and CRISPR features in 35 P. gingivalis genomes*

Figure 1. Major and minor fimbrilin proteins encoded in the 35 P. gingivalis genomic sequences. All the fimbrilin proteins were detected in the following manner: The keyword FimA/Mfa1/fimbrilin were used to select from the proteins predicted in the 35 genomes, which were annotated by the NCBI prokaryotic genome annotation pipeline [Citation54]. The keywords were also used to search for P. gingivalis FimA from the NCBI protein databases. These “seed” proteins were then searched against all the proteins in the 35 genomes to identify additional fimbrilin candidates. These potential candidates matched with the seed proteins at ≥ 90% sequence identity and ≥ 90% length coverage. The additional candidates were combined with other annotated fimbrilin proteins and aligned with the “mafftℍ program [Citation55]. The aligned sequences were used to build a phylogenetic tree with the “FastTree” program [Citation56]. For each fimbrilin shown in the tree, the Genbank accession number was printed first, followed with the strain label (in blue), the original functional annotation (by NCBI), and the size of the protein sequences in amino acids (red)

Figure 1. Major and minor fimbrilin proteins encoded in the 35 P. gingivalis genomic sequences. All the fimbrilin proteins were detected in the following manner: The keyword FimA/Mfa1/fimbrilin were used to select from the proteins predicted in the 35 genomes, which were annotated by the NCBI prokaryotic genome annotation pipeline [Citation54]. The keywords were also used to search for P. gingivalis FimA from the NCBI protein databases. These “seed” proteins were then searched against all the proteins in the 35 genomes to identify additional fimbrilin candidates. These potential candidates matched with the seed proteins at ≥ 90% sequence identity and ≥ 90% length coverage. The additional candidates were combined with other annotated fimbrilin proteins and aligned with the “mafftℍ program [Citation55]. The aligned sequences were used to build a phylogenetic tree with the “FastTree” program [Citation56]. For each fimbrilin shown in the tree, the Genbank accession number was printed first, followed with the strain label (in blue), the original functional annotation (by NCBI), and the size of the protein sequences in amino acids (red)

Figure 2. Syntenic analysis of 9 P. gingivalis genomic sequences. Of the 35 currently available P. gingivalis genomics sequences, eight are considered finished with one final successfully assembled contig. The sequence of JCVI SC001 appears to have a one-contig circular sequence under the Genbank Accession number CM001843, however it is a pseudo-contig generated by ordering the 284 unassembled contigs using the sequence of strain TDC60 as the template. Thus, the syntenies, illustrated as the same color across different genomes, of JCVI SC001 appear to be in similar order of those in TDC60 and may not be the true order in this genome. Syntenies were detected with the “MUAVE” program (version 2.4.0) [Citation57] and were illustrated as rectangles of the same colors across all genomes. Two groups of syntenies were indicated with blue and red arrows underneath to exemplify the different rearrangement and inversion events in different genomes for this species

Figure 2. Syntenic analysis of 9 P. gingivalis genomic sequences. Of the 35 currently available P. gingivalis genomics sequences, eight are considered finished with one final successfully assembled contig. The sequence of JCVI SC001 appears to have a one-contig circular sequence under the Genbank Accession number CM001843, however it is a pseudo-contig generated by ordering the 284 unassembled contigs using the sequence of strain TDC60 as the template. Thus, the syntenies, illustrated as the same color across different genomes, of JCVI SC001 appear to be in similar order of those in TDC60 and may not be the true order in this genome. Syntenies were detected with the “MUAVE” program (version 2.4.0) [Citation57] and were illustrated as rectangles of the same colors across all genomes. Two groups of syntenies were indicated with blue and red arrows underneath to exemplify the different rearrangement and inversion events in different genomes for this species