766
Views
2
CrossRef citations to date
0
Altmetric
Mitogenome Announcement

The complete chloroplast genome sequence of Casuarina equisetifolia

, , , , &
Pages 3046-3048 | Received 08 May 2021, Accepted 07 Aug 2021, Published online: 01 Oct 2021

Abstract

Casuarina equisetifolia, as windbreaks, soil erosion, and sand dune stabilization with high resistant to typhoon force winds, drought and salinization, belongs to the Casuarinaceae family. In this study, the complete chloroplast genome of C. equisetifolia was sequenced by Illumina sequencing platform and annotated by Geneious Prime. The complete chloroplast genome size is 156,128 bp in length, with a large single copy region (LSC: 86,192 bp) and a small single-copy region (SSC: 18,462 bp), which was separated by a pair of 25,737 bp inverted repeated regions (IRs). The chloroplast genome of C. equisetifolia encodes total 127 genes, including 82 protein-coding genes, 37 tRNA genes, and eight rRNA genes. The phylogenomic relationship analysis suggested that the Casuarinaceae family, which includes C. equisetifolia, was more closely related to the family of Betulaceae.

Casuarina equisetifolia is a member of the Casuarinaceae family and an invasive woody species, native to Malaysia, southern Asia, Australia, and Oceania (Hata et al. Citation2016). Nowadays, it has been successfully introduced to and gradually domesticated mainly in the southern Chinese coastal regions (Wei et al. Citation2021). It has high economic value due to providing fuel wood, land reclamation, dune stabilization, and scaffolding for construction, shelter belts, and pulp and paper production (Karthikeyan et al. Citation2013). However, there was very few genetic and genomic studies on the C. equisetifolia, which limits the study and utilization of C. equisetifolia. Herein, the complete chloroplast genome sequence of C. equisetifolia was assembled and characterized to provide more understanding of its evolution and genetic identification.

In this study, fresh leaves of C. equisetifolia were sampled from the Yangkou Forest Farm, in Fujian province, China, which located at 117.30–118.14 E, 26.39 − 27.12 N. The voucher specimen (voucher number MMH2020) was preserved at Key Laboratory of Forest Genetics and Biotechnology of Ministry of Education of China, Nanjing Forestry University. The total genomic DNA were extracted from fresh leaves and sequenced by a HiSeq Xten platform with the PE150 strategy (Novogene, Nanjing, China). After the sequencing, a total of ∼4.96 G raw data were obtained, then filtered and trimmed using SAMtools (Li Citation2011) and Fastp (Chen et al. Citation2018) to get the ∼4.94 G clean data. The chloroplast genomes de novo assembly was performed by software package velvet (Version. 1.2.10) (Zerbino et al. Citation2009) and annotated by Geneious Prime (Version. 2020.2.4) and tRNA-SCAN (Chan and Lowe Citation2019), with the chloroplast genome of putative closely related species Alnus cremastogyne (MH628453.1), Betula alnoides (MK888853.1) and Carpinus putoensis (KX695124.1) as references. The annotated complete chloroplast genome of C. equisetifolia was deposited in GenBank under the accession number of MZ032230.

A typical quadripartite structure was found in the complete chloroplast genome of C. equisetifolia with a length of 156,128 bp and ∼37% GC content. It contained a large single-copy region (LSC) of 86,192 bp with ∼34% GC content, a small single-copy region (SSC) of 18,462 bp with ∼30% GC content, and two inverted repeat regions (IRs) of 25,737 bp with ∼42% GC content. The genome annotation revealed that it contained total 127 functional genes, including 82 protein-coding genes, 37 tRNA genes, and eight rRNA genes.

Because of the lack of complete chloroplast genome data of species in the same family, 47 published species among the Fagales order, which C. equisetifolia resides within, were selected for a comprehensive phylogenetic analysis to determine the phylogenetic relationship of C. equisetifolia. Four species (Licania canescens, Garcinia paucinervis, Suriana maritima, and Polygala japonica) belonged to the Malpighiales and Fabales orders, which are the members of the Fabid clade together with the Fagales order, were used as outgroups. Fifty-one coding sequences (CDS) of protein-coding genes, which were commonly annotated in all 52 plants species, were selected and aligned using MEGA 7.0 (Kumar et al. Citation2016). The phylogenetic inference was generated based on maximum-likelihood (ML) analysis with the GTR model in RAxML v1.0.0 (https://raxml-ng.vital-it.ch/) (Kozlov et al. Citation2019). The phylogenetic tree showed that the members of Fagales order we selected has been divided into two groups. One group only contained the Fagaceae family, and the another one was consisted of the Betulaceae, Casuarinaceae, Fagaceae, Juglandaceae, Myricaceae family. In our study, the Casuarinaceae family, where only C. equisetifolia chloroplast was annotated in, was more closely related to the family of Betulaceae (). The chloroplast genome sequence of C. equisetifolia in this study will be useful for further analysis on molecular markers and molecular breeding, and reminds us that the cognition of Casuarinaceae family is still very limited.

Figure 1. Maximum-likelihood (ML) tree based on CDSs of protein-coding gene of 47 plant species belonged to the Fagales order and four outgroups. Numbers below the branches represent the bootstrap support values (%). The Fagales order is subdivided in several plant families, indicated by color coding: the Betulaceae (blue), Casuarinaceae (orange), Fagaceae (purple), Juglandaceae (green), Myricaceae (red). Four outgroups are indicated in gray.

Figure 1. Maximum-likelihood (ML) tree based on CDSs of protein-coding gene of 47 plant species belonged to the Fagales order and four outgroups. Numbers below the branches represent the bootstrap support values (%). The Fagales order is subdivided in several plant families, indicated by color coding: the Betulaceae (blue), Casuarinaceae (orange), Fagaceae (purple), Juglandaceae (green), Myricaceae (red). Four outgroups are indicated in gray.

Disclosure statement

No potential conflict of interest was reported by the authors.

Data availability statement

The genome sequence data that support the findings of this study are openly available in GenBank of NCBI at (https://www.ncbi.nlm.nih.gov/) under the accession no. MZ032230. The associated BioProject, SRA, and Bio-Sample numbers are PRJNA726353, SRR14368234, and SAMN18928268, respectively.

Additional information

Funding

This work was supported by National Forestry and Grassland Administration [2020133106] and Priority Academic Program Development of Jiangsu Higher Education Institutions.

References

  • Chan PP, Lowe TM. 2019. tRNAscan-SE: searching for tRNA genes in genomic sequences. Methods Mol Biol. 1962:1–14.
  • Chen S, Zhou Y, Chen Y, Jia G. 2018. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 34(17):i884–i890.
  • Hata K, Kawakami K, Kachi N. 2016. Increases in soil water content after the mortality of non-native trees in oceanic island forest ecosystems are due to reduced water loss during dry periods. Sci Total Environ. 545–546:372–380.
  • Karthikeyan A, Chandrasekaran K, Geetha M, Kalaiselvi R. 2013. Growth response of Casuarina equisetifolia Forst. rooted stem cuttings to Frankia in nursery and field conditions. J Biosci. 38(4):741–747.
  • Kozlov AM, Darriba D, Flouri T, Morel B, Stamatakis A. 2019. RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference. Bioinformatics. 35(21):4453–4455.
  • Kumar S, Stecher G, Tamura K. 2016. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 33(7):1870–1874.
  • Li H. 2011. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics. 27(21):2987–2993.
  • Wei Y, Zhang Y, Meng J, Wang Y, Zhong C, Ma H. 2021. Transcriptome and metabolome profiling in naturally infested Casuarina equisetifolia clones by Ralstonia solanacearum. Genomics. 113(4):1906–1918.
  • Zerbino DR, Mcewen GK, Margulies EH, Ewan B, Salzberg SL. 2009. Pebble and rock band: heuristic resolution of repeats and scaffolding in the velvet short-read de novo assembler. PLoS One. 4(12):e8407.