Figures & data
Characteristics and phenotype data for Campylobacter concisus strains used in the study
Columns: Full stars represent Faecal samples, empty stars represent Oral samples. Squares represent presences in genome assemblies and circles presence in plasmid assemblies. Red represents Exo9 whilst blue represents ZOT presence. Full shapes indicate presence, empty shapes indicate absence. For plasmids, there are some samples with no shape, this indicates that plasmidSPAdes software was unable to assemble any plasmids for the sample
![Fig. 1 Phylogenetic tree based on whole genome sequences of Campylobacter concisus strains used in this study, incorporating all published genomes.Columns: Full stars represent Faecal samples, empty stars represent Oral samples. Squares represent presences in genome assemblies and circles presence in plasmid assemblies. Red represents Exo9 whilst blue represents ZOT presence. Full shapes indicate presence, empty shapes indicate absence. For plasmids, there are some samples with no shape, this indicates that plasmidSPAdes software was unable to assemble any plasmids for the sample](/cms/asset/6ed398a0-6186-4a68-b724-b41e1cd78e47/temi_a_12040025_f0001_ob.jpg)
Clinical source of strains based on ZOT and Exotoxin 9 positivity as well as the percentage of isolates that belonged to Genomospecies (GS) I and II
Contains VFDB hits from the VFDB all dataset. Virulence factors were detected using the Comprehensive Antibiotic Resistance Database (CARD) and the Virulence Factor Database (VFDB). There are two colour columns representing metadata for the samples. The first represents the disease presentation of the host with a legend available. The second represent the genomospecies (GS) of the isolate with “red” referring to GSI and “blue” to GSII
![Fig. 2 Presence of virulence factors in genomes of isolates sequenced for this study.Contains VFDB hits from the VFDB all dataset. Virulence factors were detected using the Comprehensive Antibiotic Resistance Database (CARD) and the Virulence Factor Database (VFDB). There are two colour columns representing metadata for the samples. The first represents the disease presentation of the host with a legend available. The second represent the genomospecies (GS) of the isolate with “red” referring to GSI and “blue” to GSII](/cms/asset/9d67dcea-2b13-47fb-b0f8-e6b9b4a92c70/temi_a_12040025_f0002_ob.jpg)
There are two colour columns representing metadata for the samples. The first represents the host disease presentation with a legend available. The second represent the GS of the isolate with red referring to GSI and blue to GSII
![Fig. 3 T4SS and T6SS presence within the assembled genomes.There are two colour columns representing metadata for the samples. The first represents the host disease presentation with a legend available. The second represent the GS of the isolate with red referring to GSI and blue to GSII](/cms/asset/dae99d06-055d-473a-a05c-357ae66ac723/temi_a_12040025_f0003_ob.jpg)
Fig. 4 a Pangenome summary of all C. concisus used in this study. The figure displays the number of genes found to be shared across a certain percentage of samples in different groupings of isolates. This figure uses a bin size of 5% on the x-axis. GS Genomospecies, and b Pangenome summary of all Campylobacter. This was produced by carrying out pangenome analysis of all the C. concisus isolates and one reference assembly for each non-C. concisus species that was available. The figure displays how many genes were found to be shared across a certain percentage of samples in different groupings of isolates. The figure depicts a bin size of 5% on the x-axis
![Fig. 4 a Pangenome summary of all C. concisus used in this study. The figure displays the number of genes found to be shared across a certain percentage of samples in different groupings of isolates. This figure uses a bin size of 5% on the x-axis. GS Genomospecies, and b Pangenome summary of all Campylobacter. This was produced by carrying out pangenome analysis of all the C. concisus isolates and one reference assembly for each non-C. concisus species that was available. The figure displays how many genes were found to be shared across a certain percentage of samples in different groupings of isolates. The figure depicts a bin size of 5% on the x-axis](/cms/asset/e8e5d9f0-fc74-4ee0-81f9-a469e5dcef9c/temi_a_12040025_f0004_ob.jpg)
Hierarchies which were shared by five or fewer isolates were not included. There are two colour columns representing metadata for the samples. The first represents host disease presentation with a legend available. The second represent the GS of the isolate with “red” referring to GSI and “blue” to GSII
![Fig. 5 Plasmid KEGG Orthology (ko0001) KEGG BRITE hierarchies present within the amino acid sequences present in plasmid sequences of samples.Hierarchies which were shared by five or fewer isolates were not included. There are two colour columns representing metadata for the samples. The first represents host disease presentation with a legend available. The second represent the GS of the isolate with “red” referring to GSI and “blue” to GSII](/cms/asset/3d02b275-523a-42bc-9033-3138b59d4c6b/temi_a_12040025_f0005_ob.jpg)
Phylogenetic tree of faecal/oral paired samples based on the whole genome and b Heatmap of genes found to be core only in oral samples and core in only faecal samples from oral/faecal paired samples. Columns: Full stars represent faecal samples, empty stars represent oral samples. Squares represent presence in genome assemblies and circles presence in plasmid assemblies. “Red” represents Exo9 whilst “blue” represents ZOT presence. Full shapes indicate presence, empty shapes indicate absence. For plasmids, there are some samples with no shape. This indicates that plasmids were not assembled for this sample. Heatmap: There are two colour rows representing metadata for the samples. The first represents the host disease presentation with a legend available. The second represent the GS of the isolate with “red” referring to GSI and “blue” to GSII
![Fig. 6 Phylogenetic tree. aPhylogenetic tree of faecal/oral paired samples based on the whole genome and b Heatmap of genes found to be core only in oral samples and core in only faecal samples from oral/faecal paired samples. Columns: Full stars represent faecal samples, empty stars represent oral samples. Squares represent presence in genome assemblies and circles presence in plasmid assemblies. “Red” represents Exo9 whilst “blue” represents ZOT presence. Full shapes indicate presence, empty shapes indicate absence. For plasmids, there are some samples with no shape. This indicates that plasmids were not assembled for this sample. Heatmap: There are two colour rows representing metadata for the samples. The first represents the host disease presentation with a legend available. The second represent the GS of the isolate with “red” referring to GSI and “blue” to GSII](/cms/asset/17924611-df8d-4523-b83e-46e2149a3c58/temi_a_12040025_f0006_ob.jpg)