443
Views
0
CrossRef citations to date
0
Altmetric
Plastome Report

The complete chloroplast genome of the Syzygium buxifolium Hook. Et Arn.1833 and its phylogenetic analysis

, &
Pages 851-855 | Received 26 Dec 2023, Accepted 18 Jun 2024, Published online: 27 Jun 2024

Abstract

Syzygium buxifolium. Hook. Et Arn.1833 is a member of the Myrtaceae family. This species is used in traditional Chinese medicines. It possesses numerous synonyms, reflecting the ambiguity in its taxonomy. The chloroplast genome has been widely used for species identification and phylogenetic analysis. Regrettably, there is a lack of information regarding the chloroplast genome of S. buxifolium. Here, we intend to obtain the chloroplast genome of S. buxifolium to resolve its classification problems. In particular, we utilized Illumina sequencing technology to sequence, GetOrganelle to assemble, and CPGAVAS2 to characterize the chloroplast genome of S. buxifolium. The chloroplast genome of S. buxifolium had a length of 158,581 bp and consisted of 111 unique genes, comprising 78 protein-coding genes, 29 transfer RNA (tRNA) genes, and four ribosomal RNA (rRNA) genes. In addition, we identified 86 Simple Sequence Repeats, 345 tandem repetitive sequences, and 34 dispersed repetitive sequences using modules implemented in CPGAVAS2. Lastly, we carried out phylogenetic analysis using Phylosuite. The results indicated a close relationship between S. buxifolium and S. grijsii. This study offers novel genetic data for the molecular identification and subsequent phylogenetic analysis of the Syzygium genus.

1. Introduction

Syzygium buxifolium Hook. Et Arn.1833 (Schoch et al. Citation2020), a member of the Myrtaceae family, is a plant species found in sparse woods or shrub communities in low mountainous regions. It may grow up to 5 meters tall and is widely distributed in Guangdong, Hunan, Fuzhou, Guizhou, and other locations (Song et al. Citation1999). The complete plant of S. buxifolium has been utilized medicinally. Its roots can treat asthma, encourage dampness, strengthen the spleen, and disperse blood stasis, while its leaves cleanse and purify (Zhao et al. Citation2006). In traditional Chinese medicines, S. buxifolium is mainly used to treat asthma, cancer spread, swelling, discomfort, burns, falls, swelling, and other diseases (Qin et al. Citation2002; Zhao et al. Citation2006). S. buxifolium’s primary bioactive elements are flavonoids, triterpenes, sterols, and essential oils (Zhou et al. Citation1998; Huong et al. Citation2023). Pharmacological studies have revealed that S. buxifolium has antibacterial, antiviral, antioxidant, anticancer, anti-aging, hypotensive, lipid-lowering, and immunomodulatory properties (Chen et al. Citation2016; Ding et al. Citation2018). However, due to their similar morphologies, Syzygium grijsii (Hance) Merr. & L.M. Perry and S. buxifolium are often confused (Huang Citation2003). Accurate identification of S. buxifolium materials is essential to ensure the efficacy and safety of their medicinal products.

The chloroplast is a plant organelle containing multiple essential proteins involved in photosynthesis and other metabolic processes (Keeling Citation2004). The chloroplast genome typically consists of four components: a large single-copy (LSC), a short single-copy (SSC), and two inverted repeats (IRs) regions (Palmer Citation1991; Wicke et al. Citation2011). The chloroplast genome is smaller than the nuclear and mitochondrial genomes and is not prone to recombination and nucleotide changes (Daniell et al. Citation2016). Phylogenetic analyses utilizing protein-coding and whole chloroplast genomes provide novel perspectives and significant advancements in our comprehension of plant evolution (Asaf et al. Citation2020). The non-coding region of chloroplast genomes has been used for plant species identification (Ngai et al. Citation2023). Nevertheless, the chloroplast genome of S. buxifolium has yet to be sequenced.

This study aims to sequence, assemble, and annotate the chloroplast genome of S. buxifolium in order to resolve taxonomic confusion. Additionally, by utilizing the chloroplast genome data to construct a phylogenetic tree, this research provides reliable and accurate genetic data support for species identification, molecular marker development, and systematic evolutionary studies of the genus Syzygium.

2. Materials and methods

Fresh leaves of S. buxifolium were gathered in Zhaiqian Town, Guidong County, Chenzhou City, Hunan Province, China (N25˚59′26ʺ, E113˚54′39ʺ) (). Healthy, pest-free mature leaves were picked, rinsed with tap water, dried, and utilized to extract DNA using the CTAB technique. The leftover leaves were covered in aluminum foil and stored at −80 °C. One sample was deposited at the Institute of Medicinal Plant Development (http://www.implad.ac.cn/, [email protected]) with the voucher number 20230711027.

Figure 1. Photo of S. buxifolium. Lichai Yuan took the picture at Zhaiqian town, Guidong County, Chenzhou City, Hunan Province, China (N25˚59′26ʺ, E113˚54′39ʺ). A: Leaf; B: Inflorescence; C: branchlet.

Figure 1. Photo of S. buxifolium. Lichai Yuan took the picture at Zhaiqian town, Guidong County, Chenzhou City, Hunan Province, China (N25˚59′26ʺ, E113˚54′39ʺ). A: Leaf; B: Inflorescence; C: branchlet.

We fragmented the genomic DNA into roughly 500 bp long insert fragments. We then used the NEBNext Ultra library construction kit for Illumina to construct the sequencing library. The library was sequenced on an Illumina platform using the PE150 mode at Novogene Bioinformatics Technology Co. Ltd. (Beijing, China). We used Trimmomatic (v0.35) (v0.35) (Bolger et al. Citation2014) to filter the raw data and eliminate adapters and low-quality bases. Clean data were assembled using the GetOrganelle v1.7.5 software (Jin et al. Citation2020). Gepard (Krumsiek et al. Citation2007) aligned the chloroplast genome to Arabidopsis thaliana (NC_000932.1). We used CPGAVAS2 (Shi et al. Citation2019) to annotate the chloroplast genome and manually corrected the annotation results using Apollo (Lee et al. Citation2009). The circular plastome was visualized using the CPGview tool (Liu et al. Citation2023).

Then, using the PhyloSuite program (Zhang et al. Citation2020) (Table S1), we built two trees using the whole chloroplast genome sequences of S. buxifolium and 17 other Syzygium species, with Eugenia reinwardtiana (Blume) DC (NC_084367.1) as outgroup. We used MAFFT to align these nucleic acid sequences (Katoh et al. Citation2019). Next, we built a tree using the maximum likelihood (ML) method implemented in IQ-tree v. 1.6.12 (Nguyen et al. Citation2015). In parallel, we constructed the Bayesian inference (BI) tree. The ML tree was built with the TVM + F + I + G4 model, and the BI tree was constructed with the GTR + I + G model. Both models were selected based on BIC. Lastly, the phylogenetic trees were shown using iTol (Letunic and Bork Citation2021).

3. Results

A total of 10.78 GB of raw data was produced. After filtering, 10.71 GB of clean data was acquired. We assembled the sequencing data and obtained two circular sequences of the S. buxifolium’s chloroplast genome. The two sequences differed in the SSC region’s orientation. The two sequences were compared with the chloroplast genome of Arabidopsis thaliana. For downstream analysis, we selected the sequence whose SSC region’s orientation is identical to that of the A. thaliana (Figure S1).

To validate the correctness of the assembled genome sequence, we mapped the sequencing reads to the assembled sequence. We obtained a 1527 × to 5654 × depth across the assembled genome, with an average depth of 3703.39 × (Figure S2). The results demonstrated the dependability of the assembly result.

With a total length of 158,581 bp, the chloroplast genome of S. buxifolium was a circular sequence with a conserved quadripartite structure (). It was composed of two inverted repeat regions (26,069 bp), a large single copy region (88,036 bp), and a small single copy region (18,407 bp). The chloroplast genome had a total G/C composition of 36.76%. The LSC region’s G/C composition (34.85%) was notably higher than that of the SSC region (30.94%) but lower than that of the IR regions (42.75%) (Table S2).

Figure 2. The genomic map of the chloroplasts in S. buxifolium. There are six tracks on the map. The first track (A) displays the forward and backward repeats connected by red and green arcs, viewed from the center outward. The tandem repeats are displayed as brief blue bars on track B, the second track. The microsatellite sequences are displayed as brief green and yellow bars on the third track (C). On the fifth track (D), the small single-copy (SSC), inverted repeat (IRa and IRb), and large single-copy (LSC) areas are displayed. Plotting of the GC content across the genome is done on track six (E). Plotting of genes is done on track seven (F). The gene name is followed by parenthesis with the optional codon use bias. Depending on how they are classified functionally, genes are colored. Transcription proceeds clockwise and counterclockwise for the inner and outer genes, respectively. The lower left corner displays the genes’ functional categorization.

Figure 2. The genomic map of the chloroplasts in S. buxifolium. There are six tracks on the map. The first track (A) displays the forward and backward repeats connected by red and green arcs, viewed from the center outward. The tandem repeats are displayed as brief blue bars on track B, the second track. The microsatellite sequences are displayed as brief green and yellow bars on the third track (C). On the fifth track (D), the small single-copy (SSC), inverted repeat (IRa and IRb), and large single-copy (LSC) areas are displayed. Plotting of the GC content across the genome is done on track six (E). Plotting of genes is done on track seven (F). The gene name is followed by parenthesis with the optional codon use bias. Depending on how they are classified functionally, genes are colored. Transcription proceeds clockwise and counterclockwise for the inner and outer genes, respectively. The lower left corner displays the genes’ functional categorization.

The chloroplast genome of S. buxifolium encoded 111 unique genes, including four rRNA genes, 29 tRNA genes, and 78 unique protein-coding genes. Two protein-coding genes (ycf3 and clpP) included two introns, whereas twelve protein-coding genes (atpF, ndhA, ndhB (x2), petB, petD, rpl16, rpl2 (×2), rpoC1, rps16, and ycf1) contained one intron each. (Figure S3). Trans-splicing was applied to the rps12 genes (Figure S4). Furthermore, one intron was present in eight tRNA-coding genes (trnA-UGC (×2), trnG-UCC, trnI-GAU (×2), trnK-UUU, trnL-UAA, and trnV-UAC) (Table S3). The S. buxifolium chloroplast genome’s coding sequence (CDS), tRNA, and rRNA lengths were 79,413, 9,052, and 2,705 bp, respectively. These lengths accounted for 50.08%, 5.71%, and 1.71% of the genome’s overall length (Table S2).

To learn more about the phylogenetic relationship among the members of the Syzygium genus. We built phylogenetic trees using Maximum Likelihood (ML) and Bayesian Inference (BI) methods with the chloroplast genome sequences. The ML and BI approaches produced two trees with nearly identical topology. Both trees were supported with high bootstrap values. According to the phylogenetic tree, S. buxifolium and S. grijsii (Hance) Merr. & L. M. Perry formed a minor branch apart from S. cumini (L.) Skeels, S. forrestii Merr. & L. M. Perry, and S. fluviatile (Hemsl.) Merr. & L. M. Perry. The phylogenetic tree’s strong bootstrap scores for each branch node within the genus Syzygium ().

Figure 3. Phylogenetic tree generated using maximum likelihood (ML) Bayesian interference (BI) and based on the chloroplast genome. ML bootstrap (BS) and BI posterior probability (PP) values are represented by numbers at the nodes. The 17 Syzyglium were downloaded from GenBank, namely S. forrestii (MK102721.1) (Zhang et al. Citation2018), S. cumini (MT700494.1), S. fluviatile (OP778216.1), S. buxifolium (OR699087), S. grijsii (ON243764.1), S. jambos (L.) Alston (MT731620.1) (Sun et al. Citation2020), S. megacarpum Rathakr. & N. C. Nair (OP778217.1), S. aromaticum (L.) Merr. & L.M.Perry (NC_047249.1), S. rehderianum Merr. & L. M. Perry (ON256193.1), S. samarangense (Blume) Merr. & L. M. Perry (MW698694.1) (Wei et al. Citation2022), S. nervosum DC. (MW054699.1) (Li et al. Citation2021), S. malaccense (L.) Merr. & L. M. Perry (MT830744.1), S. polyanthum (Wight) Walp. (OQ355361.1), S. australe (J. C. Wendl. ex Link) B. Hyland (OP778215.1), S. acuminatissimum (Blume) DC. (MT975437.1) (Zeng et al. Citation2021), S. odoratum (Lour.) DC. (MZ929420.1) (Chen et al. Citation2022), S. tsoongii (Merr.) Merr. & L. M. Perry (OP778218.1), S. album Q. F. Zheng (OL802920.1), Eugenia reinwardtiana (NC_084367.1).

Figure 3. Phylogenetic tree generated using maximum likelihood (ML) Bayesian interference (BI) and based on the chloroplast genome. ML bootstrap (BS) and BI posterior probability (PP) values are represented by numbers at the nodes. The 17 Syzyglium were downloaded from GenBank, namely S. forrestii (MK102721.1) (Zhang et al. Citation2018), S. cumini (MT700494.1), S. fluviatile (OP778216.1), S. buxifolium (OR699087), S. grijsii (ON243764.1), S. jambos (L.) Alston (MT731620.1) (Sun et al. Citation2020), S. megacarpum Rathakr. & N. C. Nair (OP778217.1), S. aromaticum (L.) Merr. & L.M.Perry (NC_047249.1), S. rehderianum Merr. & L. M. Perry (ON256193.1), S. samarangense (Blume) Merr. & L. M. Perry (MW698694.1) (Wei et al. Citation2022), S. nervosum DC. (MW054699.1) (Li et al. Citation2021), S. malaccense (L.) Merr. & L. M. Perry (MT830744.1), S. polyanthum (Wight) Walp. (OQ355361.1), S. australe (J. C. Wendl. ex Link) B. Hyland (OP778215.1), S. acuminatissimum (Blume) DC. (MT975437.1) (Zeng et al. Citation2021), S. odoratum (Lour.) DC. (MZ929420.1) (Chen et al. Citation2022), S. tsoongii (Merr.) Merr. & L. M. Perry (OP778218.1), S. album Q. F. Zheng (OL802920.1), Eugenia reinwardtiana (NC_084367.1).

4. Discussion and conclusions

In this study, we obtained the first chloroplast genome of S. buxifolium and characterized its genomic features. We determined the phylogenetic relationship between S. buxifolium and other species of Syzygium. We found it was closest to S. grijsii. Our study offers vital data for species identification and evolutionary analysis within the Syzygium genus.

The chloroplast genome possesses a conservative quadripartite structure spanning a length of 158,581 bp and harboring 111 distinct genes. The genome size of S. buxifolium is within the range of chloroplast genomes in other Syzygium species, which varies from 159,352 to 160,327 bp (Table S1) (Zhang et al. Citation2018; Sun et al. Citation2020; Chen et al. Citation2022; Wei et al. Citation2022).

The phylogeny study demonstrated a close genetic relationship between S. buxifolium and S. grijsii. This finding was consistent with the conventional morphological classification (Chase et al. Citation2016). Although the chloroplast genomes of both S. buxifolium (158,581 bp) and S. grijsii 158,591 bp) encoded the same numbers and types of genes and differed by only ten bases across the genome. In addition, we observed variation in several regions (e.g. trnG-trnF, trnE-trnT, ycf3, and rps32-trnF) (data not shown). Variations in these highly variable regions could be used for species identification. Unfortunately, we could not collect samples of S. grijsii to validate these highly variable regions experimentally, and we will next collect samples of S. grijsii for PCR validation.

In conclusion, we sequenced and characterized the chloroplast genome of S. buxifolium. Through phylogenetic analysis, we resolved the phylogenetic uncertainty of S. buxifolium. Lastly, we identified several highly variable regions that can be used for molecular marker development.

Disclosure statement

No potential conflict of interest was reported by the authors.

Data availability statement

The complete chloroplast genome sequences of S. buxifolium in this study have been submitted to the NCBI database under the accession number OR699087. The associated BioProject, Bio-Sample, and SRA numbers are PRJNA1055315, SAMN38989340, and SRS19952658.

Additional information

Funding

This work was supported by the Natural Science Foundation of Hunan Province (2021JJ40520) and Provincial Special Project for Construction of Innovation Demonstration Area at Chenzhou City under National Sustainable Development Plan(2023sfq04).

References

  • Asaf S, Khan AL, Khan A, Al-Harrasi A. 2020. Unraveling the chloroplast genomes of two Prosopis Species to identify its genomic information, comparative analyses and phylogenetic relationship. Int J Mol Sci. 21(9):3280. doi:10.3390/ijms21093280.
  • Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 30(15):2114–2120. doi:10.1093/bioinformatics/btu170.
  • Chase MW, Christenhusz MJM, Fay MF, Byng JW, Judd WS, Soltis DE, Mabberley DJ, Sennikov AN, Soltis PS, Stevens PF, et al. 2016. An update of the angiosperm phylogeny group classification for the orders and families of flowering plants: APG IV. Bot J Linn Soc. 181(1):1–20. doi:10.1111/boj.12385.
  • Chen J-m, Huang X-d, Cai J-x, Fang X-l, Chen L. 2016. Comparative study on antioxidant active of dichloromethane fraction and ethyl acetate fraction from Syzygium Buxifolium leave. J Xichang College·Nat Sci Ed. 30(4):11–12.
  • Chen L-D, Wang H-F, Hou D-J. 2022. The complete plastome of Syzygium odoratum (Lour.) DC. 1928 (Myrtaceae). Mitochondrial DNA B Resour. 7(4):705–706. doi:10.1080/23802359.2022.2067015.
  • Daniell H, Lin CS, Yu M, Chang WJ. 2016. Chloroplast genomes: diversity, evolution, and applications in genetic engineering. Genome Biol. 17(1):134. doi:10.1186/s13059-016-1004-2.
  • Ding H, Sun X, Li C, Zhao H, Jia B. 2018. Advances in studies on chemical constituents and pharmacological effects of Syzygium buxifolium. Chin J Ethnomed Ethnopharm. 04):34–36. (
  • Huang X-d. 2003. The primary study on micromorphology of leaf, chemical component and Artimi al activity of Syzygium buxigolium Hook. Et Arn [D]. 8.
  • Huong LT, Hung NH, Linh NN, Pham TV, Dai DN, Hop NQ, Setzer WN, Son NT, Andlauer W, Brück WM. 2023. Essential oils of five Syzygium species growing wild in Vietnam: chemical compositions and antimicrobial and mosquito larvicidal potentials. Molecules. 28(22):7505. doi:10.3390/molecules28227505.
  • Jin JJ, Yu WB, Yang JB, Song Y, dePamphilis CW, Yi TS, Li DZ. 2020. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 21(1):241. doi:10.1186/s13059-020-02154-5.
  • Katoh K, Rozewicki J, Yamada KD. 2019. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 20(4):1160–1166. doi:10.1093/bib/bbx108.
  • Keeling PJ. 2004. Diversity and evolutionary history of plastids and their hosts. Am J Bot. 91(10):1481–1493. doi:10.3732/ajb.91.10.1481.
  • Krumsiek J, Arnold R, Rattei T. 2007. Gepard: a rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics. 23(8):1026–1028. doi:10.1093/bioinformatics/btm039.
  • Lee E, Harris N, Gibson M, Chetty R, Lewis S. 2009. Apollo: a community resource for genome annotation editing. Bioinformatics. 25(14):1836–1837. doi:10.1093/bioinformatics/btp314.
  • Letunic I, Bork P. 2021. Interactive tree of life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49(W1):W293–w296. doi:10.1093/nar/gkab301.
  • Li P, Guo W, Lei K, Ji L. 2021. Characterization of the complete chloroplast genome of Syzygium nervosum. Mitochondrial DNA B Resour. 6(3):1014–1015. doi:10.1080/23802359.2021.1894999.
  • Liu S, Ni Y, Li J, Zhang X, Yang H, Chen H, Liu C. 2023. CPGView: a package for visualizing detailed chloroplast genome structures. Mol Ecol Resour. 23(3):694–704. doi:10.1111/1755-0998.13729.
  • Ngai HL, Kong BL, Lau DT, Shaw PC. 2023. Differentiation of Lingxiaohua and Yangjinhua by chloroplast genome sequencing and DNA barcoding markers. Genome. 66(2):21–33. doi:10.1139/gen-2022-0063.
  • Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 32(1):268–274. doi:10.1093/molbev/msu300.
  • Palmer JD. 1991. Chapter 2 – Plastid chromosomes: structure and evolution. In The molecular biology of plastids, edited by Bogorad L, Vasil IK; p. 5–53. Academic Press.
  • Qin X, Luo J, Gao Z. 2002. Chinese Yao pharmacy. Beijing: Nationalities Publishing House.
  • Schoch CL, Ciufo S, Domrachev M, Hotton CL, Kannan S, Khovanskaya R, Leipe D, Mcveigh R, O’Neill K, Robbertse B, et al. 2020. NCBI Taxonomy: a comprehensive update on curation, resources and tools. Database. 2020 doi:10.1093/database/baaa062.
  • Shi L, Chen H, Jiang M, Wang L, Wu X, Huang L, Liu C. 2019. CPGAVAS2, an integrated plastome sequence annotator and analyzer. Nucleic Acids Res. 47(W1):W65–W73. doi:10.1093/nar/gkz345.
  • Song L, Zhang G, Xu G, Xie Z, Ling Y, Wang X, Xiao P, Ding X, Cao C, Li Y, et al. 1999. Chinese Materia Medican. vol. 5, pp. 653–654. Shanghai: Shanghai Science and Technology Press.
  • Sun Z, Huang Q, Feng C. 2020. Complete chloroplast genome sequence of the rose apple, Syzygium jambos (Myrtaceae). Mitochondrial DNA B Resour. 5(3):3460–3462. doi:10.1080/23802359.2020.1826000.
  • Wei X, Li L, Xu L, Zhang X, Zeng L, Xu J. 2022. Complete chloroplast genome sequence of Syzygium samarangense (Myrtaceae) and phylogenetic analysis. Mitochondrial DNA B Resour. 7(6):977–979. doi:10.1080/23802359.2022.2080022.
  • Wicke S, Schneeweiss GM, dePamphilis CW, Müller KF, Quandt D. 2011. The evolution of the plastid chromosome in land plants: gene content, gene order, gene function. Plant Mol Biol. 76(3-5):273–297. doi:10.1007/s11103-011-9762-4.
  • Zeng F, Deng Y, Liu X, Zhu X, Tan G. 2021. The complete chloroplast genome of Syzygium acuminatissimum. Mitochondrial DNA B Resour. 6(1):127–128. doi:10.1080/23802359.2020.1847615.
  • Zhang D, Gao F, Jakovlić I, Zou H, Zhang J, Li WX, Wang GT. 2020. PhyloSuite: An integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol Ecol Resour. 20(1):348–355. doi:10.1111/1755-0998.13096.
  • Zhang X, Wang J-H, Wang H, Zhao K-K, Zhu Z, Wang H-F. 2018. Complete plastome sequence of Syzygium forrestii Merr. et Perry (Myrtaceae): an endemic species in China. Mitochondrial DNA Part B. 4(1):126–127. doi:10.1080/23802359.2018.1536474.
  • Zhao G, Dian S, Chen R. 2006. Dictionary of Chinese herbal medicine. p. 1450–1452Shanghai: shanghai Science and Technology Press.
  • Zhou F, Liang P, Zhou Q, Qin Z. 1998. Chemical constituents of the stem and root of Syzygium buxifolium Hook. Et Arn. Zhongguo Zhong Yao Za Zhi. 23(3):164–165, 192. 192.