582
Views
2
CrossRef citations to date
0
Altmetric
Mitogenome Announcement

Characterization of the complete chloroplast genome of Pinus wangii (Pinaceae), an endangered and endemic species in China

ORCID Icon, & ORCID Icon
Pages 1195-1197 | Received 31 Jul 2018, Accepted 31 Aug 2018, Published online: 29 Oct 2018

Abstract

The complete cp genome of Piuns wangii was 117,014 bp in size, containing a large single copy (LSC, 64,061 bp) region and a small single copy (SSC, 52,007 bp) region separated by two reduced inverted repeats (IRs, 473 bp). A total of 107 genes were predicted, including 73 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. Most of the 51 simple sequence repeats (SSRs) were mononucleotides motifs of A/T types and were found to be located in non-coding regions. Phylogenetic analysis of Asian Pinus showed P. wangii clustered a strongly-supported clade with Pinus morrisonicola, Pinus sibirica, Pinus wallichiana, and Pinus pumila.

Pinus wangii Hu and W. C. Cheng is a five needle pine species, which belongs to the genus Pinus of the family Pinaceae. The species is sparsely distributed in limestone habitats of Xichou, Maguan, and Malipo counties of Yunnan Province (Fu et al. Citation1999). It is an important preferred afforestation tree species and wood sources for coffin, furniture, and potted landscapes (Xiang et al. Citation2017). In recent years, the natural habitat of P. wangii is severely damaged by over-exploitation, as a result, only five fragmented populations and 358 individuals are remaining on inaccessible cliffs (Zhou et al. Citation2012). It is the extremely small population species endemic to Yunnan province (Zhou et al. Citation2012) and ‘Endangered’ species in IUCN Red List (http://www.iucnredlist.org/). Meanwhile, it is listed as a national second-class protected plant in China (http://rep.iplant.cn/). It is urgent to take effective measures to protect this endangered and endemic species based on field investigation and scientific research (Zhang et al. Citation2015). In addition, its phylogenetic position is unclear in previous studies because of the lack of sampling. Here we characterized the complete chloroplast (cp) genome sequence of P. wangii based on the genome skimming sequencing data, detected the occurrence, type, and distribution of simple sequence repeats (SSRs) and constructed the phylogenetic tree based on the maximum-likelihood (ML) method.

The fresh leaves of P. wangii were collected from Kunming Botanical Garden. The voucher specimen (YMQ2018020) was deposited at Herbarium, Kunming Institute of Botany, CAS (KUN). The total DNA was extracted by TIANGEN Kit (Beijing, China). The genome skimming sequencing data was achieved on the Illumina Hiseq 2500 platform in Novogene Bioinformatics Technology Co., Ltd. (Beijing, China). The cp genome was assembled using CLC Genomics Workbench v.8.5.1 (CLC Bio, Aarhus, Denmark) with P. armandii Franch, as the reference (Accession Number KP412541) (Li et al. Citation2016). The annotation of the cp genome was conducted using Dual Organellar Genome Annotator (DOGMA) (Wyman et al. Citation2004) with manual adjustments for start and stop codons and intron/exon boundaries of protein-coding genes. The online tRNAscan-SE Search Service (Lowe and Chan Citation2016) (http://lowelab.ucsc.edu/tRNAscan-SE/) was used to further confirm tRNA genes. The complete cp genome sequence was submitted to the GenBank under Accession Number of MH167924. The circular map of the cp genome was drawn with the OGDRAW (Lohse et al. Citation2013; http://ogdraw.mpimp-golm.mpg.de/). The complete cp genome of P. wangii was 117,014 bp in length. It was the typical quadripartite structure, in which two inverted repeats (IRs, 473 bp) were separated by the large single-copy (LSC, 64,061 bp) and the small single-copy (SSC, 52,007 bp) regions. The cp genome had 107 genes, including 73 protein-coding genes (PCGs), 30 tRNA genes, and 4 rRNA genes. Most of genes occurred as single-copy, while three tRNAs (trnS-GCU, trnR-ACG and trnT-GGU) and one tRNA (TrnI-GAU) had two and three copies, respectively. In addition, two PCGs (rps12 and ycf3) had two introns each, six PCGs (atpF, petB, petD, rpl2, rpl16, and rpoC1) and six tRNA genes (trnA-UGC, trnG-UCC, trnI-GAU, trnK-UUU, trnL-UAA and trnV-UAC) contained one intron. The overall GC content of the cp genome was 38.8%, while that of LSC, SSC and IR regions was 38.0%, 39.7% and 37.6%, respectively.

We used MISA (MicroSatellite identification tool; http://pgrc.ipk-gatersleben.de/misa/) to identify and localize the perfect microsatellites. The unit sizes of mono-, di-, tri-, tetra-, penta-, hexa-nucleotide repeats were set to minimum number of repeats of 10, 5, 4, 3, 3, 3, respectively. We removed one inverted repeat region (IRA) in SSRs analysis and manually checked the detected repeats. The occurrence and distribution of different types of SSR in the cp genome were summarized in . A total of 51 SSRs were detected throughout the cp genome of P. wangii, with 37, six, five, two and one for mono-, di-, tetra-, penta-, and hexa-nucleotide repeats, respectively. The majority of the mono-nucleotides were A or T and all of di-nucleotides were AT or TA repeats, which was consistent with the A/T-richness in the complete cp genome (Xuan et al. Citation2013). Five SSRs were located in four PCGs (petD, rpl32, rpoB and ycf1 (×2)), four were observed in introns (atpF, rps12 and ycf3 (×2)), one was found in rRNA (rrn23) and one was identified in the boundary of IGS and PCG (accD) of the cp genome of P. wangii. Most of these SSRs were found in SSC region (28/51), followed by SSC region (23/51).

Table 1. Simple sequence repeats (SSRs) identified in the chloroplast genome of Pinus wangii.

For maximum likelihood analyses, the complete chloroplast genome of 21 Asian Pinus species and one Picea species as outgroup were downloaded from NCBI GenBank. The combined datasets based on plastid genomes of 23 species were aligned by MAFFT v7.307 (Katoh and Standley Citation2013). A maximum-likelihood (ML) tree was constructed in RAxML (Stamatakis Citation2014) with the GTR + G model, and a total of 1000 bootstrap replicates were performed. The phylogenetic results showed that P. wangii formed a strongly-supported clade with Pinus morrisonicola, Pinus sibirica, Pinus wallichiana and Pinus pumila ().

Figure 1. Phylogenetic tree inferred by Maximum Likelihood (ML) method based on the complete chloroplast genome of 22 species of Asian Pinus, bootstrap values (%) are shown on the branch.

Figure 1. Phylogenetic tree inferred by Maximum Likelihood (ML) method based on the complete chloroplast genome of 22 species of Asian Pinus, bootstrap values (%) are shown on the branch.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This study was supported by the National Natural Science Foundation of China under Grant no. 3150018; and Applied Basic Research Project of Yunnan under Grant no. 2017FB150.

References

  • Fu LG, Li N, Elias TS, Mill RR. 1999. Pinaceae. In: Wu ZY, Raven PH, editors. Flora of China. vol. 4. Beijing: Science Press, St. Louis: Missouri Botanical Garden Press; p. 11–52.
  • Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol E. 30:772–780.
  • Li ZH, Qian ZQ, Liu ZL, Deng TT, Zu YM, Zhao P, Zhao GF. 2016. The complete chloroplast genome of armand pine Pinus armandii, an endemic conifer tree species to China. Mitochondr DNA. 27:2635–2636.
  • Lohse M, Drechsel O, Kahlau S, Bock R. 2013. OrganellarGenomeDRAW—a suite of tools for generating physical maps of plastid and mitochondrial genomes and visualizing expression data sets. Nucleic Acids Res. 41:W575–W581.
  • Lowe TM, Chan PP. 2016. tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res. 44:W54–W57.
  • Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 30:1312–1313.
  • Wyman SK, Jansen RK, Boore JL. 2004. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 20:3252–3255.
  • Xiang ZY, Zhang SS, Kang HM, Yang WZ. 2017. Effect of salicylic acid on high quality seedling forming and physiological index of Pinus wangii. Chinese Agric Sci Bull. 33:29–32.
  • Xuan Y, Lei G, Bo W, Su YJ, Wang T. 2013. The complete chloroplast genome sequence of Cephalotaxus oliveri (Cephalotaxaceae): evolutionary comparison of Cephalotaxus chloroplast DNAs and insights into the loss of inverted repeat copies in gymnosperms. Genome Biol E. 5:688–698.
  • Zhang ZY, Cui B, Mao JF, Pang XM, Li YY. 2015. Isolation and characterization of thirty-eight microsatellite loci for the Pinus wangii, an endangered species. Conserv Genet Resour. 7:397–398.
  • Zhou Y, Jiang H, Yang WZ, Zhang SS, Xiang ZY. 2012. Study on stock of Pinus wangii, an extremely small population species. J West China Forestry Sci. 41:80–83.