516
Views
0
CrossRef citations to date
0
Altmetric
Mitogenome Announcement

The complete chloroplast genome sequence of Casearia glomerata Roxb. (Malpighiales; Salicaceae) from Fujian, China

, , , , ORCID Icon &
Pages 1932-1933 | Received 23 Mar 2021, Accepted 24 May 2021, Published online: 14 Jun 2021

Abstract

Casearia glomerata Roxb. is classified in Salicaceae and has a high economic value. Herein, we report the complete chloroplast (cp) genome of C. glomerata using Illumina sequence data. The cp genome is 156,809 bp in length and contains a large single-copy (LSC) region of 84,888 bp and a small single-copy (SSC) region of 17,039 bp separated by two inverted repeat (IR) regions of 27,441 bp each. It contained a total of 123 genes, with an overall GC content of 36.81%. The phylogenetic analysis showed that C. glomerata is closely related to Casearia velutina. This study provides important sequence information for species identification and its phylogenetic relationship in the Salicaceae

Casearia glomerata Roxb. is a shrub classified in the plant family Salicaceae. It is widely distributed from Central Himalayas to Fujian, Guangdong, Guangxi, Hainan, Taiwan, Yunnan in China (Yang and Zmarzty Citation2007). Casearia glomerata has yellow-green flowers and orange-red fruits, which possess a high economic value to garden cultivation (Chen Citation1953). Chloroplasts (cps) are essential in plant cells and play a crucial role in sustaining life (Xiong et al. Citation2009). Due to the high conservation of cp genomes compared to the nuclear and mitochondrial genomes, cp genome sequences have often been used for phylogenetic studies and species identification (Khan et al. Citation2010). Currently, no study had been published on the complete cp genome sequence of C. glomerata. Here, we performed high-throughput sequencing on a specimen of C. glomerata from China to determine its cp genome structure and evolutionary relationship to Salicaceae.

The fresh leaves of C. glomerata were collected from dense forests on the coast of Haitan Island in Fuzhou, Fujian Province, China (25°49′68″N, 119°71′18″E). The voucher specimen is deposited at Fujian Agriculture and Forestry University (No. FZ-FJ2020-10A, FAFU, HuiHUANG: [email protected]). The genomic DNA was extracted using Plant Genomic DNA Kit, DP305 (TIANGEN, Beijing, China). The sequencing library was produced using the Illumina Truseq DNA Sample Preparation Kit (Illumina, San Diego, CA) according to the manufacturer's recommendations. The prepared library was loaded on the Illumina Novaseq 6000 platform for PE 2 × 150 bp sequencing at Novogene (Beijing, China). The raw data were used to assemble the complete cp genome using the GetOrganelle pipeline (Jin et al. Citation2020). Genome annotation was performed with PGA (Qu et al. Citation2019) by comparing the sequences with the cp genome of Casearia velutina, GenBank Accession Number MN078141. The annotated genome sequence was deposited in GenBank under Accession Number MW428079.

The circular cp genome of C. glomerata was 156,809 bp and contains a larger single-copy (LSC) region of 84,888 bp in length, a smaller single-copy (SSC) region of 17,039 bp in length, and a pair of inverted repeats (IRs) of 27,441 bp. There were 131 genes predicted in this genome, of which 79 are protein-coding genes, 36 tRNA, and eight ribosomal RNA genes. The base content of the C. glomerata cp genome is A (31.21%), T (31.98%), C (18.70%), G (18.11%), and the overall GC content of the cp genome is 36.81%.

The cp genome of C. glomerata was aligned with other nine cp genomes of Salicaceae and Arabidopsis thaliana was designated as the outgroup to construct the phylogenetic tree. All complete cp genomes were aligned with the MAFFT version 7.388 using default settings (Katoh and Standley Citation2013). The phylogenetic analysis was conducted based on maximum likelihood (ML) analyses implemented in IQ-TREE version 2.1.2 with the TVM + F+R5 nucleotide substitution model, which was selected by ModelFinder (Kalyaanamoorthy et al. Citation2017; Minh et al. Citation2020). The support for the inferred ML tree was inferred by bootstrapping with 1000 replicates. The phylogenetic analysis suggested that C. glomerata is closely related to C. velutina (). This cp genome will provide an important resource for addressing taxonomic issues and studying the molecular evolution of Casearia.

Figure 1. Maximum-likelihood (ML) tree based on nine cp genome sequences from the Salicaceae with Arabidopsis thaliana designated as outgroups. Numbers on the nodes are bootstrap values based on 1000 replicates. Casearia glomerata was marked in red.

Figure 1. Maximum-likelihood (ML) tree based on nine cp genome sequences from the Salicaceae with Arabidopsis thaliana designated as outgroups. Numbers on the nodes are bootstrap values based on 1000 replicates. Casearia glomerata was marked in red.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The genome sequence data that support the findings of this study are openly available in GenBank of NCBI at (https://www.ncbi.nlm.nih.gov/) under the accession no MW428079. The associated BioProject, SRA, and Bio-Sample numbers are PRJNA688994 SAMN17193100, and SRR13340381, respectively.

Additional information

Funding

This work was supported by the [Technology Project of Fujian Science and Technology Department] under grant [No. 2018Y3006]; and [Fujian Agriculture and Forestry University Science and Technology Innovation special project] under grant [No. KCXRC268A].

References

  • Chen R. 1953. Chinese tree taxonomy. Shanghai: Science and Technology Press.
  • Jin JJ, Yu WB, Yang JB, Song Y, Depamphilis CW, Yi TS, Li DZ. 2020. Get organelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 21(1):1–31.
  • Kalyaanamoorthy S, Minh BQ, Wong TK, Von Haeseler A, Jermiin LS. 2017. Model finder: fast model selection for accurate phylogenetic estimates. Nat Methods. 14(6):587–589.
  • Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 30(4):772–780.
  • Khan A, Khan IA, Asif H, Azim MK. 2010. Current trends in chloroplast genome research. Afr J Biotechnol. 9(24):3494–3500.
  • Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, Von Haeseler A, Lanfear R. 2020. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. 37(5):1530–1534.
  • Qu XJ, Moore MJ, Li DZ, Yi TS. 2019. PGA: a software package for rapid, accurate, and flexible batch annotation of plastomes. Plant Methods. 15(1):50.
  • Xiong AS, Peng RH, Zhuang J, Gao F, Zhu B, Fu XY, Xue Y, Jin XF, Tian YS, Zhao W, et al. 2009. Gene duplication, transfer, and evolution in the chloroplast genome. Biotechnol Adv. 27(4):340–347.
  • Yang QE, Zmarzty S. 2007. Flacourtiaceae. In: Wu Z, Raven P, Hong D, editor. Flora of China (Clusiaceae through Araliaceae). St. Louis (MO): Missouri Botanical Garden Press; p. 112–137.