1,011
Views
5
CrossRef citations to date
0
Altmetric
Mitogenome Announcement

The complete mitochondrial genome of the cat flea, Ctenocephalides felis

, , , , &
Pages 3422-3424 | Received 09 Jul 2020, Accepted 08 Sep 2020, Published online: 05 Oct 2020

Abstract

The cat flea, Ctenocephalides fells, is widely recognized as a global veterinary pest and a vector of pathogenic bacteria. We recently reported on the C. felis nuclear genome, which is characterized by over 38% protein coding gene duplication, extensive tRNA gene family expansion, and remarkable gene copy number variation (CNV) between individual fleas. Herein, we describe the assembly of the C. felis mitochondrial genome, a novel resource for comparative genomics of fleas and other insects. The order and content of mitochondrial genes is highly consistent with four previously sequenced flea mitochondrial genomes, limiting CNV to siphonapteran nuclear genomes.

With over 2,500 identified species across the globe, fleas are notorious veterinary pests and vectors of pathogens, including Rickettsia typhi (murine typhus), R. felis (murine typhus-like illness), Bartonella henselae (cat-scratch disease), and myxoma virus (Myxomatosis) (Bertagnoli and Marchandeau Citation2015; McElroy et al. Citation2010; Mullen and Durden Citation2002). Speciation of fleas is reliant on distinguishing morphological features; however, studies have also used certain mitochondrial genes for systematic analyses (Lawrence et al. Citation2019; McKern et al. Citation2008; Whiting et al. Citation2008). To date, four full mitochondrial genomes have been sequenced in the order Siphonaptera, representing a potential untapped source of genomic variation for clearer evolutionary inferences (Cameron Citation2015; Hystrichopsylla Citation2019; Tan et al. Citation2018; Xiang et al. Citation2017). We assembled the mitochondria genome of the cat flea, Ctenocephalides felis, using reads generated from our sequencing of the cat flea nuclear genome (Driscoll et al. Citation2020). While the cat flea nuclear genome exhibits unprecedented genome plasticity evinced by excessive gene duplication, the mitochondrial genome of the cat flea is consistent with other Siphonaptera mitochondrial genomes sequenced to date and contains no evidence of genome rearrangements or duplications.

To sequence the genome of the cat flea, unfed female C. felis (n = 250) from the Elward Laboratory colony (Soquel, CA, USA) were obtained in January 2018 and pooled for high molecular weight DNA extraction followed by long-read sequencing on the PacBio Sequel. The sample DNA was deposited in the arthropod repository at the University of Maryland Baltimore under accession Cf102787-2018.

Corrected PacBio reads were assembled with Canu (version 1.5) (Koren et al. Citation2017) in ‘pacbio-raw’ mode; the expected mitochondrial genome was compiled into a single 20,873 bp contig at 2267x coverage. Even with high coverage, the mitogenome could not be circularized informatically or with PCR techniques due to two distinct, contiguous AT-rich repeat regions spanning nearly 5000 bases combined. The mitochondrial genome was preliminarily annotated with a combination of MITOS (using the invertebrate genetic code with default parameters) (Bernt et al. Citation2013) and GeSeq (with default parameters and flea mitochondrial genomes as reference sequences) (Tillich et al. Citation2017). The complete flea mitochondrial genomes included in the GeSeq analyses were Jellisonia amadoi (NC_022710.1), Ceratophyllus wui (MG886872.1), Dorcadi ioffi (MF124314.1) and Hystrichopsylla weida qinlingensis (MH259703.1). Both MITOS and GeSeq did not predict complete open reading frames for any protein coding genes, resulting in truncated gene predictions. After annotation, the Canu-assembled mitochondrial genome contained multiple split genes in six protein coding genes requiring further investigation. To supplement the preliminary annotation, open reading frame analyses and BlastN were used to identify full open reading frames for protein coding genes extending in most cases in both 5′ and 3′ directions. Manual sequence analysis revealed split genes that appeared to be missing bases within homopolymer stretches resulting in truncated open reading frames. Targeted PCR amplification and Sanger sequencing resolved deletions at sites often containing stretches of four or more A or T. Additionally, paired-end 250 basepair read Illumina sequencing of C. felis from the same Elward Laboratory colony resolved the remaining deletions after examining read pileups for evidence of additional encoded bases.

The C. felis mitochondrial genome (Genbank accession number: MT594468) encodes the full repertoire of 37 genes, including 22 tRNAs, 13 protein coding genes, and 2 rRNAs with the conserved synteny observed in other Siphonaptera mitogenomes and the general insect mitochondrial gene order (Cameron Citation2015). The major strand is composed of 83.1% A + T and all protein coding genes are similar in size to homologs in other fleas with no evidence of gene truncations or major rearrangements. There are few non-coding positions within the conserved block of encoded genes. The singular intergenic spacer encoded is 50 bases in length and occurs between trnS2 and nad1. The cat flea mitogenome is at least 2000 bases greater in length than Ceratophyllus wui, the next largest sequenced flea mitogenome (Tan et al. Citation2018). The increase in the length of the C. felis mitochondrial genome is due to the longer AT-rich repeat regions flanking the core 37 gene segment, not due to additional internal spacer regions. All protein coding genes begin with canonical start codons (ATN) with the exception of atp8 starting with a TTG start codon. Stop codons are equally split between full TAA stop codons and incomplete T stop codons with the exception of ctyb, which has a TAG stop codon.

Phylogeny estimation of full proteomes from the five siphonapteran mitochondrial genomes corroborates previously determined flea relationships (Whiting et al. Citation2008), barring the lack of complete mitochondrial genome sequences for most of the siphonapteran families (). In contrast to the extreme genome size variation reported for C. felis, as well as the rat flea Xenopsylla cheopis (Driscoll et al. Citation2020), the observed genome stasis and phylogenetic utility of flea mitochondrial genomes implicates these resources as prudent tools for future analyses on flea systematics and epidemiology of flea-borne diseases.

Figure 1. Phylogeny estimation based on five siphonapteran proteomes. Protein sequences (n = 12: ATP synthase F0 subunits 6 & 8, cytochrome b, cytochrome c oxidase subunits I & III, and NADH dehydrogenase subunits 1-6 & 4 L) from 6 sequenced siphonapteran mitogenomes (Ctenocephalides felis, Jellisonia amadoi, Ceratophyllus wui, Dorcadia ioffi, and Hystrichopsylla weida qinlingensis) plus Panorpa debilis (Mecopteran) were independently aligned with MUSCLE v3.8.31 using default parameters. Aligned sequences were concatenated into a single data set (3,505 positions) for phylogeny estimation using RAxML v8.2.4, under the gamma model of rate heterogeneity, WAG model of substitution, and estimation of the proportion of invariant sites. Branch support was assessed with 500 pseudo-replications. Final ML optimization likelihood was -26546.055122.

Figure 1. Phylogeny estimation based on five siphonapteran proteomes. Protein sequences (n = 12: ATP synthase F0 subunits 6 & 8, cytochrome b, cytochrome c oxidase subunits I & III, and NADH dehydrogenase subunits 1-6 & 4 L) from 6 sequenced siphonapteran mitogenomes (Ctenocephalides felis, Jellisonia amadoi, Ceratophyllus wui, Dorcadia ioffi, and Hystrichopsylla weida qinlingensis) plus Panorpa debilis (Mecopteran) were independently aligned with MUSCLE v3.8.31 using default parameters. Aligned sequences were concatenated into a single data set (3,505 positions) for phylogeny estimation using RAxML v8.2.4, under the gamma model of rate heterogeneity, WAG model of substitution, and estimation of the proportion of invariant sites. Branch support was assessed with 500 pseudo-replications. Final ML optimization likelihood was -26546.055122.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The data for this study are openly available in GenBank at https://www.ncbi.nlm.nih.gov/nucleotide/ under accession number MT594468.

Additional information

Funding

Research reported in this publication was supported by the National Institute of Health (NIH)/National Institute of Allergy and Infectious Diseases (NIAID) grants R01AI017828 and R01AI126853 to AFA, R01AI122672 to KRM, and R21AI26108 and R21AI146773 to JJG. TPD and VIV were supported by startup funding provided to TPD by West Virginia University.

References

  • Bernt M, Donath A, Jühling F, Externbrink F, Florentz C, Fritzsch G, Pütz J, Middendorf M, Stadler PF. 2013. MITOS: improved de novo metazoan mitochondrial genome annotation. Mol Phylogenet Evol. 69(2):313–319.
  • Bertagnoli S, Marchandeau S. 2015. Myxomatosis. Rev Sci Tech Int off Epizoot. 34:549–556, 539–547.
  • Cameron SL. 2015. The complete mitochondrial genome of a flea, Jellisonia amadoi (Siphonaptera: Ceratophyllidae). Mitochondr DNA. 26(2):289–290.
  • Driscoll TP, Verhoeve VI, Gillespie JJ, Johnston JS, Guillotte ML, Rennoll-Bankert KE, Rahman MS, Hagen D, Elsik CG, Macaluso KR, et al. 2020. A chromosome-level assembly of the cat flea genome uncovers rampant gene duplication and genome size plasticity. BMC Biol. 18(1):70.
  • Tan L, Yao X, Liu J, Huang Q, and Lei C. Hystrichopsylla weida qinlingensis mitochondrion, complete genome 2019. http://www.ncbi.nlm.nih.gov/nuccore/NC_042380.1. Accessed 19 Jun 2020.
  • Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27(5):722–736.
  • Lawrence AL, Webb CE, Clark NJ, Halajian A, Mihalca AD, Miret J, D'Amico G, Brown G, Kumsa B, Modrý D, et al. 2019. Out-of-Africa, human-mediated dispersal of the common cat flea, Ctenocephalides felis: the hitchhiker's guide to world domination. Int J Parasitol. 49(5):321–336.
  • McElroy KM, Blagburn BL, Breitschwerdt EB, Mead PS, McQuiston JH. 2010. Flea-associated zoonotic diseases of cats in the USA: bartonellosis, flea-borne rickettsioses, and plague. Trends Parasitol. 26(4):197–204.
  • McKern JA, Szalanski AL, Austin JW, Gold RE. 2008. Genetic diversity of field populations of the cat flea, Ctenocephalides felis, and the human flea, Pulex irritans, in the South Central United States. J Agric Urban Entomol. 25(4):259–263.
  • Mullen G, Durden L.Medical and veterinary entomology. Amsterdam, Boston: Academic Press; 2002
  • Tan L, Guan X, Zhang L, Zhu F, Lei C. 2018. The complete mitochondrial genome of the flea Ceratophyllus wui (Siphonaptera: Ceratophyllidae). Mitochondr DNA B. 3(1):401–402.
  • Tillich M, Lehwark P, Pellizzer T, Ulbricht-Jones ES, Fischer A, Bock R, Greiner S. 2017. GeSeq-versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 45(W1):W6–11.
  • Whiting MF, Whiting AS, Hastriter MW, Dittmar K. 2008. A molecular phylogeny of fleas (Insecta: Siphonaptera): origins and host associations. Cladistics. 24(5):677–707.
  • Xiang H-T, Wen F-Q, Wang G-L. 2017. The complete nucleotide sequence of the mitochondrial genome of Dorcadia ioffi (Siphonaptera: Vermipsyllidae). Mitochondr DNA B. 2(2):389–390.