425
Views
0
CrossRef citations to date
0
Altmetric
Mitogenome Announcement

Characterization of the complete chloroplast genome of Betula pendula purple rain (betulaceae)

, , , , , & show all
Pages 281-284 | Received 28 Nov 2022, Accepted 30 Jan 2023, Published online: 20 Feb 2023

Abstract

Betula pendula purple rain is a variety of Betula pendula that is native to Europe and has important ornamental and economic value. In this study, we sequenced the complete chloroplast genome of B. pendula purple rain. This genome had a typical quadripartite structure with 160,552 bases, including a large single copy (LSC) region of 89,433 bases, a small single copy (SCC) region of 19,007 bases and two inverted repeat (IR) regions of 26,056 bases. The GC content of the chloroplast genome was 36% and contained 124 genes, including 79 protein-coding genes, 8 rRNA genes and 37 tRNA genes. The maximum likelihood phylogenetic analysis of reported chloroplast genomes showed that B. pendula purple rain was most closely related to Betula occidentalis and Betula platyphylla.

Introduction

B. pendula purple rain belongs to the family Betulaceae, which occurs from northern Xinjiang to the Altai Mountains, Europe and America (Li et al. Citation2009). B. pendula purple rain is a deciduous tree, up to 25 meters high, that grows in river beach, valley, foot of wet zone or sunny stone hillside areas located 500–2000 meters above sea level. Its leaves are purple, its trunk is grey–white, and its ornamental value is very high (Lin Citation2014). Chloroplasts are a kind of plastid that mainly exist in some cells of mesophylls and young stems of higher plants. Chloroplasts are also sites of photosynthesis in green plants, where pigments and proteins are synthesized (Xing and Liu Citation2008). The chloroplast genome of angiosperms consists of four major fragments: the small single copy region (SSC), the large single copy region (LSC), and two inverted repeat (IR) regions. Chloroplasts have the advantages of a relatively conserved structure, moderate base mutation rate and simple sequencing, and they have been widely used in the phylogenetic studies of various plant groups (Greiner et al. Citation2019). In this paper, for the first time, we report the complete chloroplast genome of B. pendula purple rain, which provides more detailed and complete data for the study of chloroplasts in Betulaceae.

Materials

The fresh leaves of B. pendula purple rain were collected from eight-year-old plants (N: 45°72′05900′, E: 126°63′4623″) from the Forest Genetics and Breeding Base of Northeast Forestry University on July 30, 2022 (). Professor Jiang Jing of Northeast Forestry University identified as B. pendula purple rain. The specimens were stored at Heilongjiang University of Chinese Medicine (http://yxy.hljucm.net/, Dr. Zhang, [email protected]) (registration number: HRB20220730001).

Figure 1. Photo taken at the Forest Genetics and Breeding Base of Northeast Forestry University on July 30, 2022. Eight-year-old B. pendula purple rain.

Figure 1. Photo taken at the Forest Genetics and Breeding Base of Northeast Forestry University on July 30, 2022. Eight-year-old B. pendula purple rain.

Methods

Total genomic DNA of B. pendula purple rain was extracted by the CTAB method and stored at Heilongjiang University of Chinese Medicine. Qualified DNA was used to construct a 150 bp paired-end library, which was sequenced by the Illumina NovaSeq6000 high-throughput sequencing platform. SOAPnuke software was used to identify and remove low-quality read fragments and missing reads. SPAdes software was used for assembly, and CPGAVAS2 software was used to annotate the complete chloroplast genome sequence of B. pendula purple rain (Bankevich et al. Citation2012; Shi et al. Citation2019). The chloroplast genome has been submitted to the National Genomics Data Center (NGDC:https://ngdc.cncb.ac.cn/) with the entry number (GSA:CRA008742) (CNCB-NGDC Members and Partners Citation2021).

Results

The chloroplast genome has a quadripartite structure with 160,552 bases, including a large single copy (LSC) region of 89,433 bases, a small single copy (SCC) region of 19,007 bases and two inverted repeat (IR) regions of 26,056 bases. The GC content of the chloroplast genome is 36% and contains 124 genes, including 79 protein-coding genes, 8 rRNA genes and 37 tRNA genes. Of these genes, ycf3 and clpP have two introns, whereas trnK-UUU, trnG-GCC, atpF, rpoC1, trnL-UAA, trnV-UAC, petB, petD, rpl16, rpl2, ndhB, trnI-GAU, trnA-UGC and ndhA have one intron.

A comparative chloroplast genome analysis of B. pendula purple rain and eight other species previously reported Betulaceae chloroplast genomes () showed that the chloroplast genome ranged in length from 160,263 to 161,123 bp, with a GC content of 36% to 36.4%. Together, these chloroplast genomes contain 72 common genes (Wang et al. Citation2019; Yoshida et al. Citation2020). These results suggest that the chloroplast genome of Betulaceae is relatively stable in length, structure and GC content and that the chloroplast genome is relatively conserved. To elucidate the phylogenetic relationship of B. pendula purple rain, the complete chloroplast genome sequences of 8 related Betula species were downloaded from NCBI, and the chloroplast genome sequences of 2 Alnus plants were selected as outgroups. Phylosuite software was used to extract common genes, MAFFT software (version: 7.313) was used to match multiple sequences of common genes, IQTREE software (version: v.1.6.8) was used to select the best fitting model (TVM + F + I), and the chloroplast genome evolutionary tree was obtained (Bootstraps: 1000). The chloroplast genome map of B. pendula purple rain. ().

Figure 2. The map contains four rings. From the center going outward, the first circle shows the forward and reverse repeats connected with red and green arcs respectively. The next circle shows the tandem repeats marked with short bars. The third circle shows the microsatellite sequences identified using MISA. The fourth circle is drawn using drawgenemap and shows the gene structure on the plastome. The genes were colored based on their functional categories.

Figure 2. The map contains four rings. From the center going outward, the first circle shows the forward and reverse repeats connected with red and green arcs respectively. The next circle shows the tandem repeats marked with short bars. The third circle shows the microsatellite sequences identified using MISA. The fourth circle is drawn using drawgenemap and shows the gene structure on the plastome. The genes were colored based on their functional categories.

Figure 3. Phylogenetic tree reconstruction of 11 samples using maximum likelihood based on complete chloroplast genome. Note: The sequences of Betula pendula Purple Rain was from NGDC.;The remaining sequence data were from NCBI; represents the species in this study.

Figure 3. Phylogenetic tree reconstruction of 11 samples using maximum likelihood based on complete chloroplast genome. Note: The sequences of Betula pendula Purple Rain was from NGDC.;The remaining sequence data were from NCBI; represents the species in this study.

Discussion and conclusion

In this study, we first report the complete chloroplast genome of B. pendula purple rain characterized by purple leaves. The maximum likelihood phylogenetic analysis of the reported chloroplast genomes showed that B. pendula purple rain was most closely related to B. occidentalis and B. platyphylla. B. pendula purple rain, B. occidentalis and B. platyphylla was sister to a monophyletic clade of B. utilis subsp albosinensis and B. chichibuensis at the interspecific level (BS = 100%). The chloroplast genome data of this plant provide more detailed and complete information for the study of evolutionary relationships within Betulaceae, which lays the foundation for plant identification and evolutionary genetic analysis.

Ethical approval

Dr. Meiqi Zhang obtained permission from Northeast Forestry University to collect plant samples. Due to the important ornamental value of B. pendula purple rain, Dr. Zhang collected the samples for further molecular research. Plant collection and experimental studies were conducted according to the Regulations of the People’s Republic of China on the Protection of New Plant Varieties issued by Northeast Forestry University.

Author contributions

ZMQ conceived the project. JJ and MW collected samples and performed research. GY, SXY, LWL and GYL analyzed data. ZMQ wrote the manuscript.JJ, MW, GY, SXY, LWL and GYL revised the manuscript. All authors read and approved the manuscript.

Acknowledgments

The authors are really grateful to the open raw genome data from public database.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

Data availability statement The genome sequence data that support the findings of this study are openly available in the National Genomics Data Center of NGDC at https://ngdc.cncb.ac.cn/ under the accession no. CRA008742. The associated BioProject, GSA, and Bio-Sample numbers are PRJCA012943, CRA008742, and SAMC976969, respectively.

Additional information

Funding

This study was supported by Heilongjiang Touyan Innovation Team Program [Grant Number: [2019] No. 5].

References

  • Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 19(5):455–477.
  • Greiner S, Golczyk H, Malinova I, Pellizzer T, Bock R, Brner T, Herrmann RG. 2019. Chloroplast nucleoids are highly dynamic in ploidy, number, and structure during angiosperm leaf development. Plant J. 2020;102(4):730–746.
  • Changhai L, Wei L, Dan Z,Zongyan S. 2009. Study on Introduction and Cultivation of Foreign Garden Trees. In: 2009 Annual Conference of China Botanical Garden and 50th Birthday of Guangxi Medicinal Botanical Garden, Nanning, Guangxi, China. pp. 244–250.
  • Lin L. 2014. Studies on the growth physiological characteristics and leaf color related gene expression of Betula chinensis Dr., Northeast Forestry University.
  • CNCB-NGDC Members and Partners. 2021. Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2021. Nucleic Acids Res. 49(D1):D18–D28.
  • Shi L, Chen H, Jiang M, Wang L, Wu X, Huang L, Liu C. 2019. CPGAVAS2, an integrated plastome sequence annotator and analyzer. Nucleic Acids Res. 47(W1):W65–W73.
  • Wang Y, Yang D, Zhang J. 2019. The complete chloroplast genome sequence of Betula alnoides. Mitochondrial DNA Part B. 4(1):1820–1821.
  • Xing SC, Liu CJ. 2008. Progress in chloroplast genome analysis. Progress in Biochemistry and Biophysics. 35(1):21–28.
  • Yoshida T, Igarashi Y, Hirao T. 2020. Characterization of the complete chloroplast genome of Betula chichibuensis (Betulaceae), a critically endangered limestone birch. Mitochondrial DNA B Resour. 5(3):2166–2167.