521
Views
0
CrossRef citations to date
0
Altmetric
Mitogenome Announcement

Complete chloroplast genome sequence of Sphaerophysa salsula (Leguminosae)

, , , , , & ORCID Icon show all
Pages 399-401 | Received 19 Nov 2020, Accepted 16 Dec 2020, Published online: 08 Feb 2021

Abstract

Sphaerophysa salsula (Pall.) DC. is a perennial herbaceous plant belonging to the genus Sphaerophysa, Galegeae, Leguminosae, and is mainly distributed in dry areas in Central Asia and Northwest China. The complete chloroplast genome with a total size of 123,300 bp was reported in this study. Further annotation revealed the chloroplast genome contains 109 genes, including 76 protein coding genes, 29 tRNA genes, and four rRNA genes. A total of 107 simple sequence repeats (SSRs) from mononucleotide to hexa-nucleotide repeat motif were identified in the chloroplast genome. This information will be useful for study on the evolution and genetic diversity of Sphaerophysa salsula in the future.

Sphaerophysa salsula (Pall.) DC., is a highly nutritive and drought-tolerant perennial grass belonging to the genus Sphaerophysa (family Leguminosae), and mainly distributed in Central Asia and Northwest China. There are only two species in this genus worldwide, and only one species was found in China. Metabolites extracted from S. salsula whole herbs or seeds have many pharmacological activities including bactericidal (Ma et al. Citation2002), insecticidal (Li et al. Citation2012), antioxidative (Venkateswarlu et al. Citation2003), and anti-tumor effects (Wang and Ma Citation2009). In addition, S. salsula usually grows in dry and saline soils in the desert regions or salt pond in Northwest China, it also grows as forage in winter in hillsides and grasslands. Thus, it is an ecologically and economically important plant.

Most chloroplast genomes are characterized by a quadripartite structure, with two copies of an inverted repeat (IR) separating the large (LSC) and small (SSC) single copy regions. A number of previous studies have examined the phylogenetic distribution of different plastid genome rearrangements among legumes, including the loss of one copy of the IR (Palmer and Thompson Citation1982; Lavin et al. Citation1990; Jansen et al. Citation2008). The probably most dramatic example of the phylogenetic utility of a plastid genomic rearrangement among legumes is the loss of one copy of the IR by all members sampled from the tribes Carmichaelieae, Cicereae, Hedysareae, Trifolieae, Fabeae, Galegeae, and three genera of Millettieae (Lavin et al. Citation1990; Liston Citation1995; Jansen et al. Citation2008). The monophyly of this clade with the loss of one copy of the IR is known as the IR-lacking clade or IRLC (Wojciechowski et al. Citation2000). In this study, to obtain the new insight into the phylogeny of S. salsula, we assembled and annotated the accurate chloroplast genome from sequenced data of S. salsula with Illumina HiSeq platform.

The materials of S. salsula in this study were collected from Atushi City, Kizilsu Kirghiz Autonomous Prefecture, Xinjiang province of China (76°58.491′E, 40°63.411′N, 1662 m above sea level). The voucher specimen (TD-00637, Sphaerophysa salsula (Pall.) DC.) was stored in the herbarium of Tarim University. First, the total genomic DNA was extracted using CTAB method and sequenced using the Illumina HiSeq platform. Then, we removed the adaptors and reads with low quality from the raw data (SRR12875154), and the whole chloroplast genome was assembled using GetOrganelle (Jin et al. Citation2020). Finally, the chloroplast gene structures were annotated using CPGAVAS2 (Shi et al. Citation2019) and PGA (Qu et al. Citation2019). The complete chloroplast genome was 123,300 bp (MW122834), the average GC content was 34.09%. The chloroplast genomes encoded 109 functional genes, including 76 protein-coding genes (Supplementary Table S1), 29 tRNA genes, and four rRNA genes. A total of 107 simple sequence repeat (SSR) markers ranging from mononucleotide to hexa-nucleotide repeat motif were identified in S. salsula chloroplast genome.

In our study, to explore the phylogenetic relationship of S. salsula within Leguminosae, additional 33 species from Leguminosae were studied. With the Polygala japonica and Polygala tenuifolia as the outgroups, the phylogenetic trees were built from the whole protein-coding gene matrix by maximum-likelihood (ML) and Bayesian inference (BI) (). The ML tree was generated using IQ-TREE (Nguyen et al. Citation2015) based on the best model of TVM + F+R3 and 1000 bootstrap replicates, and BI analysis was performed in MrBayes-3.2.7. This result showed that the analyzed S. salsula was closer to the species of Lessertia frutescens, both of which belong to the tribe Galegeae. The chloroplast genome of S. salsula owns the same gene number (109) as that of Medicago truncatula (the model legume) with just one different gene, ycf68 in M. truncatula instead of ycf4 in S. salsula. In the phylogenetic tree (), the tribe Galegeae with S. salsula showed closely related to the tribe Hedysareae, but was distant from the tribe Trifolieae including M. truncatula.

Figure 1. Phylogenetic tree reconstructed by maximum-likelihood (ML) and Bayesian inference (BI) analysis based on the whole chloroplast protein-coding genes of these 36 species.

Figure 1. Phylogenetic tree reconstructed by maximum-likelihood (ML) and Bayesian inference (BI) analysis based on the whole chloroplast protein-coding genes of these 36 species.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The genome sequence data that support the findings of this study are openly available in GenBank of NCBI at https://www.ncbi.nlm.nih.gov/ under the accession no. MW122834. The associated ‘BioProject’, ‘SRA’, and ‘Bio-Sample’ numbers are PRJNA670257, SRR12875154, and SAMN16491006, respectively.

Additional information

Funding

This work was financially supported by the National Natural Sciences Foundation of China [31660069], Joint Foundation of Huazhong Agricultural University & Tarim University Research Cooperation [TDHNLH201701], The Innovative Team Building Plan for Key Areas of Xinjiang Production and Construction Corps [2018CB003], Xinjiang Production & Construction Corps Key Laboratory of Protection and Utilization of Biological Resources in Tarim Basin [BRZD2003].

References

  • Jansen RK, Wojciechowski MF, Sanniyasi E, Lee S-B, Daniell H. 2008. Complete plastid genome sequence of the chickpea (Cicer arietinum) and the phylogenetic distribution of rps12 and clpP intron losses among legumes (Leguminosae). Mol Phylogenet Evol. 48(3):1204–1217.
  • Jin J-J, Yu W-B, Yang J-B, Song Y, dePamphilis CW, Yi T-S, Li D-Z. 2020. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 21(1):241.
  • Lavin M, Doyle JJ, Palmer JD. 1990. Evolutionary significance of the loss of the chloroplast–DNA inverted repeat in the Leguminosae subfamily Papilionoidae. Evolution. 44(2):390–402.
  • Li H, Jiang ZL, Ma WB, Feng JT, Zhang X. 2012. Larvicidal activity of the botanical extracts from 96 plants against Culex pipiens pallens. Acta Agric Boreali-Occidentalis Sin. 21(10):156–164.
  • Liston A. 1995. Use of the polymerase chain reaction to survey for the loss of the inverted repeat in the legume chloroplast genome. In: Crisp M, Doyle JJ, editors. Advances in legume systematics 7: phylogeny. Kew: Royal Botanic Gardens; p. 31–40.
  • Ma ZJ, Hou BL, Wang JH, Li X. 2002. Isoflavans from Sphaerophysa salsula. Pharmazie. 57(1):75–76.
  • Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 32(1):268–274.
  • Palmer JD, Thompson WF. 1982. Chloroplast DNA rearrangements are more frequent when a large inverted repeat sequence is lost. Cell. 29(2):537–550.
  • Qu X-J, Moore MJ, Li D-Z, Yi T-S. 2019. PGA: a software package for rapid, accurate, and flexible batch annotation of plastomes. Plant Methods. 15(1):50.
  • Shi L, Chen H, Jiang M, Wang L, Wu X, Huang L, Liu C. 2019. CPGAVAS2, an integrated plastome sequence annotator and analyzer. Nucleic Acids Res. 47(W1):W65–W73.
  • Venkateswarlu S, Satyanarayana B, Sureshbabu CV, Subbaraju GV. 2003. Synthesis and antioxidant activity of 4-[2-(3,5-dimethoxyphenyl)ethenyl]-1,2-benzenediol, a metabolite of Sphaerophysa salsula (organic chemistry). Biosci Biotechnol Biochem. 67(11):2463–2466.
  • Wang D, Ma Z. 2009. Cytotoxic activity of cycloartane triterpenoids from Sphaerophysa salsula. Nat Prod Commun. 4(1):23–25.
  • Wojciechowski MF, Sanderson MJ, Steele KP, Liston A. 2000. Molecular phylogeny of the “temperate herbaceous tribes” of papilionoid legumes: a supertree approach. In: Herendeen P, Bruneau A, editors. Advances in legume systematics, part 9. Kew: Royal Botanic Garden; p. 277–298.