494
Views
0
CrossRef citations to date
0
Altmetric
Mitogenome Announcement

A complete chloroplast genome of Keteleeria davidiana (Pinaceae) and its phylogenetic implications

, , , &
Pages 2074-2075 | Received 09 Feb 2021, Accepted 18 Mar 2021, Published online: 21 Jun 2021

Abstract

Keteleeria davidiana (Bertrand) Beissner 1891 (Pinaceae) is a rare tertiary relict plant endemic to China. However, since the main morphological characteristics used for identifying K. davidiana are variable, some taxonomic treatments within the species are still controversial. Here a complete chloroplast genome of K. davidiana representing a special genotype was assembled, which could provide more information for the taxonomic study of this species. The assembled genome was 117,642 bp in length with a large single-copy (LSC) region (74,825 bp), a small single-copy (SSC) region (40,247 bp), and two incomplete inverted repeats (IRs) regions (1285 bp each). In total, 118 genes were predicted, including 4 rRNAs, 34 tRNAs, and 80 protein-coding genes. The overall GC content of the assembled genome was 38.5%. Phylogenetic analysis showed that different accessions of K. davidiana formed a clade with relatively low support (bootstrap value = 71), which indicated a high level of sequences variation within the species.

Keteleeria davidiana (Bertrand) Beissner 1891 (Pinaceae) has long been treated as a rare tertiary relict plant endemic to China. However, since the main morphological characteristics used for identifying K. davidiana are variable, some taxonomic treatments within the species are still controversial. For example Cheng et al. (Citation1975) published K. pubescens W. C. Cheng & L. K. Fu 1975, which was reduced as either a variety (Silba Citation1990) or a subspecies (Silba Citation2008) of K. davidiana, subsequently. In the latest literature (The Biodiversity Committee of Chinese Academy of Sciences Citation2020), K. pubescens was no longer recognized but treated as a synonym of K. davidiana. All these make the classification of K. davidiana extremely complicated. To address this issue, a complete chloroplast genome of K. davidiana, representing the genotype of ‘K. pubescens’, was assembled by using the next-generation sequencing, which could provide more information for the taxonomic study of this species.

The material of K. davidiana used in this study was collected from Kunming Institute of Botany, Chinese Academy of Sciences (25°8′36.91′′N, 102°45′8.46′′E). The voucher specimen was deposited at the herbarium of Nanjing Forestry University (Contact: Xian-Rong Wang, [email protected]) under the voucher number NF20200927002. Total DNA was isolated from the fresh leaves for sequencing library construction. Then the library was paired-end sequenced on the Illumina NovaSeq 6000 platform (Illumina Inc., San Diego, CA) by Genepioneer Biotechnologies Co., Ltd. (Nanjing, China) with the standard Illumina re-sequencing protocols. Finally, 6 Gb clean reads (Phred scores > 20) were obtained. The clean reads were first mapped to the plant chloroplast genome database using Bowtie2 (Langmead and Salzberg Citation2012) software for chloroplast reads extraction. Then SPAdes assembler version 3.10.0 (Bankevich et al. Citation2012) combined with SSPACE (Boetzer et al. Citation2011) was used for genome assembly. After that, GapFiller version 1.11 (Boetzer and Pirovano Citation2012) was further applied for gap filling, followed by PGA (Qu et al. Citation2019) software for genome annotation. The assembled chloroplast genome of K. davidiana was 117,642 bp in length with a large single-copy (LSC) region (74,825 bp), a small single-copy (SSC) region (40,247 bp), and a pair of inverted repeats (IRs) regions (1285 bp each). Unlike most angiosperm, the IR regions in K. davidiana were incomplete. Finally, 118 genes were predicted, including four rRNAs, 34 tRNAs (33 unique species), and 80 protein-coding genes (78 unique species). The overall GC content of the assembled genome was 38.5%.

Maximum likelihood (ML) tree was predicted to reveal the taxonomic status of K. davidiana. In total, 19 chloroplast genome sequences were used for phylogenetic tree construction including 18 downloaded from NCBI (). PhyloSuite software (Zhang et al. Citation2020) was used for genic region extraction, and only genic regions were used for phylogenetic tree construction. The data matrix was aligned by MAFFT version 7.158 (Katoh and Standley Citation2013). Then RAxML-VI-HPC (Stamatakis Citation2006) was used for phylogenetic inference under the GTR-gamma model with a bootstrap test of 1000 replications. Finally, the ML tree was edited and visualized using Figtree software (http://tree.bio.ed.ac.uk/software/figtree/). As shown in , two accessions of K. davidiana formed a clade with relatively low support (bootstrap value = 71). There were 80 single nucleotide polymorphisms (SNPs) and 15 insertion/deletion (indels) variations between the genome assembled in this study and the earlier one (Wu et al. Citation2009). Most of these variations (63 SNPs and 14 indels) were at non-coding regions. The sequence identity between the two chloroplast genomes was 99.93%, which indicated a high level of sequences variation within the species.

Figure 1. Maximum-likelihood phylogenetic tree based on 19 complete plastid genomes from Pinaceae. Two species from Pinus are set as outgroup. Number at the node shows bootstrap value (1000 replicates) for each branch. Number after ‘|’ shows the accession number in GenBank for each taxon. The position of Keteleeria davidiana reported in this study is marked in bold.

Figure 1. Maximum-likelihood phylogenetic tree based on 19 complete plastid genomes from Pinaceae. Two species from Pinus are set as outgroup. Number at the node shows bootstrap value (1000 replicates) for each branch. Number after ‘|’ shows the accession number in GenBank for each taxon. The position of Keteleeria davidiana reported in this study is marked in bold.

Disclosure statement

No potential conflict of interest was reported by the authors.

Data availability statement

The raw sequence data supporting this study are deposited in the National Center for Biotechnology Information Short Read Archive under BioProject ID PRJNA700108 (accession number SRP305000). The assembled genome and its annotation are openly available in GenBank of NCBI at https://www.ncbi.nlm.nih.gov, reference number MW580774.

Additional information

Funding

This study was funded by China Postdoctoral Science Foundation under Grant [number 2019M651839]; and Innovation Fund for Young Scholars of Nanjing Forestry University under Grant [number. CX2019029].

References

  • Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, et al. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 19(5):455–477.
  • Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W. 2011. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics. 27(4):578–579.
  • Boetzer M, Pirovano W. 2012. Toward almost closed genomes with GapFiller. Genome Biol. 13(6):R56.
  • Cheng WC, Fu LK, Cheng CY. 1975. Gymnospermae sinicae. Acta Phytotax Sin. 13:56–89.
  • Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 30(4):772–780.
  • Langmead B, Salzberg SL. 2012. Fast gapped-read alignment with Bowtie 2. Nat Methods. 9(4):357–359.
  • Qu XJ, Moore MJ, Li DZ, Yi TS. 2019. PGA: a software package for rapid, accurate, and flexible batch annotation of plastomes. Plant Methods. 15:50.
  • Silba J. 1990. A supplement to the international census of the Coniferae. II. Phytologia. 68:7–78.
  • Silba J. 2008. Keteleeria davidiana subsp. pubescens (W. C. Cheng & L. K. Fu) Silba. J Int Conifer Preserv Soc. 15:48.
  • Stamatakis A. 2006. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 22(21):2688–2690.
  • The Biodiversity Committee of Chinese Academy of Sciences. 2020. Catalogue of life China: 2020 annual checklist. Beijing, China: The Biodiversity Committee of Chinese Academy of Sciences.
  • Wu CS, Lai YT, Lin CP, Wang YN, Chaw SM. 2009. Evolution of reduced and compact chloroplast genomes (cpDNAs) in gnetophytes: selection toward a lower-cost strategy. Mol Phylogenet Evol. 52(1):115–124.
  • Zhang D, Gao F, Jakovlić I, Zou H, Zhang J, Li WX, Wang GT. 2020. PhyloSuite: an integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol Ecol Resour. 20(1):348–355.