409
Views
2
CrossRef citations to date
0
Altmetric
Mitogenome Announcement

The complete chloroplast genome of Chimonobambusa purpurea (Bambusatae)

, , , , &
Pages 691-692 | Received 11 Oct 2020, Accepted 24 Jan 2021, Published online: 10 Mar 2021

Abstract

Chimonobambusa purpurea is one of the important bamboo species in southwest of China. We studied the complete chloroplast (cp) genome of C. purpurea in this study. The cp genome of C. purpurea (GenBank accession: MW030500) was 139,574 bp in length, including a large single-copy (LSC) region of 83,171 bp, a small single-copy (SSC) region of 12,811 bp, and a pair of inverted repeated (IR) regions of 21,796 bp. And the genome contained 133 genes, including 86 protein-coding genes, 39 tRNA genes, and 8 rRNA genes. Based on 30 cp genomes, we used the phylogenetic analysis to build phylogenetic tree, indicating that C. purpurea is closely related to C. tumidissinoda.

Chimonobambusa purpurea is one of the important bamboo species in southwest of China. C. purpurea is mainly distributed in Shaanxi, Hubei, and Sichuan in China (Li and Stapleton Citation2006). The bamboo poles of C. purpurea can be used as raw materials for papermaking and the bamboo shoots is edible, which has potential commercial value. In this study, we have studied and characterized the chloroplast genome of C. purpurea, and provide genomic and genetic resources for further research.

Genomic DNA was extracted from fresh leaves of an individual of C. purpurea. The specimens of C. purpurea were collected from Chongzhou Country, Chengdu City, Sichuan Province, China (N30°45′27″, E103°25′14″) on 2 June 2020. The specimens were deposited in the herbarium room of Chengdu Academy of Agricultural and Forestry Sciences (http://www.cdnky.com/, Bao-Xin Wang and [email protected]) under the voucher number LW20200602-01. First, a whole-genome shotgun library with an insert size of 400 bp was prepared, and then the library was sequenced by the Illumina NovaSeq platform and thus generating 150 bp paired-end reads. Ten million high-quality reads were mapped to the published Bambusa emeiensis chloroplast genomes as references using mummer version 3.1 (Kurtz et al. Citation2004). Second, we using A5-miseq v20150522 (Coil et al. Citation2015) and SPAdes version 3.9.0 to assemble these reads into complete chloroplast genomes (Bankevich et al. Citation2012). The assembled chloroplast genome sequence was annotated using the online Geseq web server (https://chlorobox.mpimp-golm.mpg.de/geseq.html). Follow the default parameters.

The complete plastid genome sequence of C. purpurea (GenBank accession: MW030500) was 139,574 bp in length, including a large single-copy (LSC) region of 83,171 bp, a small single-copy (SSC) region of 12,811 bp, and a pair of inverted repeated (IR) regions of 21,796 bp. The complete chloroplast genome contained 133 genes, including 86 protein-coding genes, 39 tRNA genes, and 8 rRNA genes. The complete genome GC content was 38.90%, and the corresponding values of the LSC, SSC, and IR were 36.99%, 33.23%, and 44.23%, respectively.

The maximum-likelihood phylogenetic tree was constructed based on 30 complete chloroplast genomes of Bambusoideae species, and Phaenosperma globosum as outgroup. The sequences were aligned by MAFFT version 7.037 (Katoh and Standley Citation2013), and the phylogenetic tree constructed by IQ-TREE Multicore version 1.6.12 (Gao et al. Citation2018). The result indicated that C. purpurea has a close relationship with C. tumidissinoda ().

Figure 1. Maximum-likelihood phylogenetic analysis of 30 species of Bambusoideae and Phaenosperma globosum as outgroup based on plastid genome sequences by IQ-TREE multicore version 1.6.12 under GTR + R6 model for 5000 ultrafast bootstraps. Branch lengths (above) and bootstrap values (below) were indicated around nodes. GeneBank accession numbers of each species were listed in the tree.

Figure 1. Maximum-likelihood phylogenetic analysis of 30 species of Bambusoideae and Phaenosperma globosum as outgroup based on plastid genome sequences by IQ-TREE multicore version 1.6.12 under GTR + R6 model for 5000 ultrafast bootstraps. Branch lengths (above) and bootstrap values (below) were indicated around nodes. GeneBank accession numbers of each species were listed in the tree.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The genome sequence data that support the findings of this study are openly available in GenBank of NCBI at (https://www.ncbi.nlm.nih.gov/) under the accession no. MW030500. The associated BioProject, SRA, and BioSample numbers are PRJNA684786, SRR13258662, and SAMN17069327, respectively.

Additional information

Funding

This work was supported by the Chengdu Municipal Financial Science and Technology Project [2019-YF05-01649-SN].

References

  • Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, et al. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 19(5):455–477.
  • Coil D, Jospin G, Darling AE. 2015. A5-MiSeq: an updated pipeline to assemble microbial genomes from Illumina MiSeq data. Bioinformatics. 31 (4):587–589.
  • Gao FL, Shen JG, Liao FR, Cai W, Lin SP, Yang HK, Chen SL. 2018. The first complete genome sequence of narcissus latent virus from Narcissus. Arch Virol. 163(5):1383–1386.
  • Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 30(4):772–780.
  • Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, Salzberg SL. 2004. Versatile and open software for comparing large genomes. Genome Biol. 5(2):R12.
  • Li DZ, Stapleton C. 2006. Flora of China 22. Beijing, China: Science Press; p. 152–161.