Abstract
In the present study, the nearly complete mitochondrial genome of Euphaea ochracea was described and its phylogenetic position in the family Euphaeidae was analyzed. Here, we recovered 13 protein-coding genes, 22 transfer RNAs, 2 ribosomal RNAs and a partial control region, resulting in a mitogenome length of 15,545bp. All protein-coding genes were initiated by the typical ATN codon except nad3 and nad1, which utilizes the TTG codon. Four protein-coding genes (cox1, cox2, cox3 and nad5) are terminated by an incomplete stop codon T, while others end with either a TAA or TAG codon. The intergenic spacer region, S5, is absent in this mitogenome, supporting the lack of this region as a specific character in damselflies. Phylogenetic analysis showed that the newly sequenced E. ochracea is phylogenetically closer to E. ornata with a high support value.
Introduction
Equipped with unparalleled characteristics, Odonata (dragonflies and damselflies) are among the most ancient groups of winged insects and have served as prominent model organisms in the study of insect evolution (Bybee et al. Citation2016; Jiang et al. Citation2021). Odonates are also excellent health indicators of aquatic ecosystems and are well-distributed in tropical and subtropical regions (Chee Yen and Mohamed Dawood, Citation2021). The ochraceous darkies, Euphaea ochracea is a damselfly species from the zygopteran family Euphaeidae and often identified based on the golden tip of their wings (male) (Abdul Aziz and Mohamed, Citation2019) (). To date, there are four mitogenomes available for this genus in GenBank, however, the mitogenome data for this species have yet to be reported publicly. Here, we sequenced and characterized a nearly complete mitochondrial genome of E. ochracea from Malaysia. The information presented in this work will provide baseline mitogenome data to understand the phylogenetic relationships within Euphaeidae and the study of insect genome evolution.
Materials
The adult sample of E. ochracea was collected from Ayer Hitam Forest Reserve Johor, Malaysia (2.025680, 102.794151) and was identified by a certified taxonomist, Dr. Aqilah Awg Abdul Rahman. The specimen was deposited at Universiti Tun Hussein Onn Malaysia (UTHM) (https://uthm.edu.my; Dr. Aqilah Awg Abdul Rahman; [email protected]) with the voucher ID DIO031.
Methods
Genomic DNA extraction and sequencing
The total genomic DNA was extracted from the muscle tissue of the hind leg using the Qiagen Blood and Tissue Kit (Qiagen, Valencia, CA), following the manufacturer’s instruction, and fragmented via a Bioruptor® system. Library preparation was done using the NEBNext® UltraTM II DNA Library Prep Kit for Illumina® prior to sequencing using the Illumina NovaSeq 6000 (PE150).
Mitogenome assembly, annotation, and visualization
The raw reads were firstly assessed for its quality with FastQC (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) and were trimmed for sequencing adapters, as well as low-quality reads using AdapterRemoval (Schubert et al. Citation2016). With the cox1 sequence of E. ochracea retrieved in the BOLD database (SICO 922-18.COI-5P) as the bait reference sequence, the trimmed reads were then assembled using NOVOPlasty v.4.2 (Dierckxsens et al. Citation2017). Next, the bwa-samtools pipeline was used to assess the assembled mitogenome by mapping the clean raw DNA reads (adapter free) to the mitogenome sequences. The sequence was then viewed on Tablet (Milne et al. Citation2009) to manually check for indels and sequence coverage. The nearly complete mitogenome was annotated using MITOS v2 web server (Bernt et al. Citation2013). The predicted protein-coding genes were further verified with the Open Reading Frame (ORF) Finder server (https://www.ncbi.nlm.nih.gov/orffinder/) following Miga et al. (Citation2022), by aligning the genes with those of the other Euphaea species using Jalview 2 v11.1.4 (Waterhouse et al. Citation2009) to improve the annotation quality. Then, the mitogenome map was produced by Proksee (https://proksee.ca/), an updated version of the CGView web server (Grant and Stothard Citation2008).
Phylogenetic analysis
Thirteen protein-coding gene sequences from 10 Odonata mitogenome as representatives were downloaded from GenBank, extracted, and aligned in batches using MAFFT (Katoh et al. Citation2002) before trimming with Gblocks (Castresana Citation2000) prior to concatenation. Ischnura pumilio (KC878732) (Lorenzo-Carballa et al. Citation2014) was used as the outgroup and this workflow was performed in PhyloSuite v1.2.2 (Zhang et al. Citation2020). Phylogenetic analysis was performed based on the Maximum-Likelihood method using the concatenated 13 protein-coding genes in MEGA 11 with 1000 bootstrap values (Tamura et al. Citation2021).
Results
The nearly complete mitogenome of E. ochracea is 15,545bp in length (GenBank accession no.: ON165247), in which 13 protein-coding genes (PCGs), 22 transfer RNAs (tRNAs), 2 ribosomal RNAs (rRNAs) and partial control region were recovered (). The control region which is also known as an AT-rich, the repetitive sequence can be difficult to resolve due to the underrepresentation of AT-rich sequences in Illumina data (Shen et al. Citation2015). Hence, the E. ochracea is presented as a linear mitogenome in GenBank. The mitogenome has a nucleotide composition of A (42.53%), T (28.19%), C (17.05%) and G (12.24%). The nucleotide skew statistics also showed high occurrences of A over T (A–T skew: 0.203) and C over G (G–C skew:0.164). Nine PCGs and 14 tRNAs are encoded on the heavy-strand (H-strand), and the remaining genes are on the light strand (L-strand). The protein-coding gene sequences are 11,145bp in length, while the transfer RNAs are 1,496bp, ranging from 64 bp (tRNA-Cys, tRNA-Arg and tRNA-Ser) to 72 bp (tRNA-Tyr, tRNA-Leu, tRNA-Lys). The 12S and 16S rRNAs are 747 bp and 1,289bp, respectively. Among the 13 PCGs, two PCGs (nad3 and nad1) were initiated by TTG codon, while others utilized the standard ATN start codon. Additionally, incomplete stop codons were observed in four PCGs (cox1, cox2, cox3 and nad5), while the remaining PCGs were terminated by either TAA or TAG stop codon. In E. ochracea, we detected four intergenic spacer regions (s1-s4), consistent with other Euphaea species, and short nucleotide overlaps between 8 gene junctions. A BLAST analysis was done on the cox1 sequence of this species which showed a 98% similarity with the cox1 sequence of E. ochracea from Myanmar (MN345956).
Phylogenetic analysis showed that the majority of the nodes have bootstrap support value higher than 94% (). The newly sequenced E. ochracea in this study resides together with the other Euphaea species in the family Euphaeidae, with a high bootstrap value of 100 and was phylogenetically closer to E. ornata (KF718295) (Cheng et al. Citation2018). Based on the analysis performed here, the family Euphaeidae has a close relationship with the family Pseudolestidae in Zygoptera.
Discussion and conclusion
This study provided a nearly complete mitogenome of Euphaea ochracea and was deposited in GenBank with accession no. ON165247. The nearly complete mitogenome displayed a gene order of tRNA-Ile, tRNA-Gln, and tRNA-Met, which is conserved and identical to other previously sequenced Euphaeidae damselflies. The absence of the s5 intergenic spacer region also supports the lack of this region as a specific character in damselflies (Xu et al. Citation2018). The phylogenetic relationship of Euphaeidae obtained from the analyses is also consistent with other previous studies, where Euphaeidae is phylogenetically closer to the family Pseudolestidae (Jiang et al. Citation2021; Wang et al. Citation2021). The draft mitogenome of E. ochracea presented in this work can provide useful DNA molecular data for further phylogenetic and evolutionary analysis in Euphaeidae.
Authors’ contributions
MM: Writing-original draft preparation, data curation. PNSJ: Laboratory work. MM, PNSJ, FMS: Conceptualization, data validation, reviewing, editing. MSS: Conceptualization. SP, HR, MABL, YJW, FMS: Resources (facilities) and funding acquisition. FMS: Supervision.
Permission
This work was done with the permission by the Forestry Department of Peninsular Malaysia under the Research permit JH/100 Jld. 30 (18).
Disclosure statement
No potential conflict of interest was reported by the author(s)
Data availability statement
The genome sequence data that support the findings of this study are openly available in GenBank of NCBI at (https://www.ncbi.nlm.nih.gov/) under accession no. ON165247. The associated BioProject, SRA and BioSample numbers are PRJNA753627, SRR15422665 and SAMN20720553 respectively.
Additional information
Funding
References
- Abdul Aziz MAA, Mohamed M. 2019. Annotated checklist of odonates (Insecta: Odonata) in Sungai Bantang Recreational Forest, Bekok, Johor, Malaysia. IOP Conf Ser: Earth Environ Sci. 269(1):012002.
- Bernt M, Donath A, Jühling F, Externbrink F, Florentz C, Fritzsch G, Pütz J, Middendorf M, Stadler PF. 2013. MITOS: improved de novo metazoan mitochondrial genome annotation. Mol Phylogenet Evol. 69(2):313–319.
- Bybee S, Córdoba-Aguilar A, Duryea MC, Futahashi R, Hansson B, Lorenzo-Carballa MO, Schilder R, Stoks R, Suvorov A, Svensson EI, et al. 2016. Odonata (dragonflies and damselflies) as a bridge between ecology and evolutionary genomics. Front Zool. 13(1):1–20.
- Castresana J. 2000. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol. 17(4):540–552.
- Chee Yen C, Mohamed Dawood M. 2021. Dragonflies and Damselflies (Odonata) of Kadamaian, Kinabalu Park, Sabah. J Trop Biol Conserv. 18:71–79.
- Chen MY, Chaw SM, Wang JF, Villanueva RJT, Nuñeza OM, Lin CP. 2015. Mitochondrial genome of a flashwing demoiselle, Vestalis melania from the Philippine Archipelago. Mitochondrial DNA. 26(5):720–721.
- Cheng YC, Chen MY, Wang JF, Liang AP, Lin CP. 2018. Some mitochondrial genes perform better for damselfly phylogenetics: species- and population-level analyses of four complete mitogenomes of Euphaea sibling species. Syst Entomol. 43(4):702–715.
- Dierckxsens N, Mardulyn P, Smits G. 2017. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 45(4):e18.
- Grant JR, Stothard P. 2008. The CGView server: a comparative genomics tool for circular genomes. Nucleic Acids Res. 36(Web Server issue):181–184.
- Jiang B, Li J, Zhang Y, Sun Y, He S, Yu G, Lv G, Mikolajewski DJ. 2021. Complete mitochondrial genomes of two damselfly species in coenagrionidae and phylogenetic implications. Mitochondrial DNA Part B: Resour. 6(8):2445–2448.
- Katoh K, Misawa K, Kuma KI, Miyata T. 2002. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30(14):3059–3066.
- Lin CP, Chen MY, Huang JP. 2010. The complete mitochondrial genome and phylogenomics of a damselfly, Euphaea formosa support a basal Odonata within the Pterygota. Gene. 468(1–2):20–29.
- Lorenzo-Carballa MO, Thompson DJ, Cordero-Rivera A, Watts PC. 2014. Next generation sequencing yields the complete mitochondrial genome of the scarce blue-tailed damselfly, Ischnura pumilio. Mitochondrial DNA. 25(4):247–248.
- Macher J-N, Drakou K, Papatheodoulou A, Hoorn B v d, Vasquez M. 2020. The mitochondrial genomes of 11 aquatic macroinvertebrate species from Cyprus. Metabarcoding and Metagenomics. 4:91–96.
- Miga M, Jahari PNS, Vei Siang C, Kamarudin KR, Shamsir MS, Tokiman L, Parimannan S, Rajandas H, Mohamed F, Salleh FM. 2022. The complete mitochondrial genome data of the common Rose butterfly, Pachliopta aristolochiae (Lepidoptera, Papilionoidea, Papilionidae) from Malaysia. Data Brief. 40:107740.
- Milne I, Bayer M, Cardle L, Shaw P, Stephen G, Wright F, Marshall D. 2009. Tablet-next generation sequence assembly visualization. Bioinformatics. 26(3):401–402.
- Schubert M, Lindgreen S, Orlando L. 2016. AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC Research Notes. 9(88):1–7.
- Shen J, Cong Q, Grishin N. v 2015. The complete mitochondrial genome of Papilio glaucus and its phylogenetic implications. Meta Gene. 5:68–83.
- Tamura K, Stecher G, Kumar S. 2021. MEGA11: molecular evolutionary genetics analysis Version 11. Mol Biol Evol. 38(7):3022–3027.
- Wang LJ, Hsu MH, Wang CH, Chung CH, Sung CH. 2021. The complete mitochondrial genome of Mnais tenuis Oguma, 1913 (Odonata: Calopterygidae) and its phylogenetic implications. Mitochondrial DNA Part B: Resour. 6(5):1648–1649.
- Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ. 2009. Jalview Version 2-A multiple sequence alignment editor and analysis workbench. Bioinformatics. 25(9):1189–1191.
- Xu S, Guan Z, Huang Q, Xu L, Vierstraete A, Dumont HJ, Lin Q. 2018. The mitochondrial genome of Atrocalopteryx melli Ris, 1912 (Zygoptera: Calopterygidae) via ion torrent PGM NGS sequencing. Mitochondrial DNA Part B: Resources. 3(1):115–117.
- Zhang D, Gao F, Jakovlić I, Zou H, Zhang J, Li WX, Wang GT. 2020. PhyloSuite: an integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol Ecol Resour. 20(1):348–355.