526
Views
3
CrossRef citations to date
0
Altmetric
Mitogenome Announcement

The complete chloroplast genome of Androsace mariae

ORCID Icon, , , , , , & show all
Pages 376-377 | Received 16 Nov 2020, Accepted 13 Dec 2020, Published online: 08 Feb 2021

Abstract

The complete chloroplast genome of Androsace mariae was sequenced and assembled. It is a circular form genome of 151,958 bp in length, which was separated into four distinct regions, a large single-copy (LSC) of 83,292 bp, a small single-copy region (SSC) of 16,744 bp, two inverted repeats (IR) of 25,961 bp. After annotation, a total of 133 genes were predicted, of which, 87 were encoded proteins, 8 rRNA, and 37 tRNA. The evolutionary history, inferred using the neighbour-joining method, indicates that A. mariae was grouped within Primulaceae, and comprised a clade with other three species in Androsace, Androsace paxiana, Androsace laxa and Androsace bulleyana, with 100% bootstrap value.

Androsace mariae, belonging to Primulaceae, is an alpine perennial herbaceous plant with little red flowers. It is always found on the Qinghai–Tibet Plateau and adjacent highlands, it can survive at altitudes of 1800–4000 m (Delectis Florae Reipublicae Popularis Sinicae Agendae Academiae Sinicae Edita. Citation1989). Except Flora Reipublicae Popularis Sinicae, we have not found any other literatures of this plant. In this study, we reported its complete chloroplast (cp) genome.

Samples from Qilian mountains (36°34′37″N, 101°48′27″E) in Qinghai province were collected. Voucher specimen (GAUF20200531AMAR001) was deposited in the Herbarium, College of Forestry, Gansu Agricultural University. A sample was used for cp genome sequencing, in which the total DNA was extracted from fresh leaves and paired-end library was constructed. The sequencing was performed on the Illumina NovaSeq platform (Nanjing Jisihuiyuan biotechnology Co. Ltd). A total of 29,947,190 raw reads with 150-bp paired-end length were obtained. The complete cp genome was assembled with the de novo assembler SPAdes (Bankevich et al. Citation2012). Gene annotation was performed via prodigal v2.6.3 (Doug et al. Citation2010), hmmer v3.1b2 (hmmer.org, There is not yet any appropriate citable published paper that describes the HMMER3 software suite) and aragorn v1.2.38 (Dean and Bjorn Citation2004).

The cp genome of A. mariae (GenBank accession no. MT732944) has a typical quadripartite form of 151,958 bp in length, and composed of a large single-copy region (LSC, 83,292 bp), a small single-copy region (SSC, 16,744 bp), and two inverted repeats (IR, 25,961 bp). GC content of the genome is 37.27%. A total of 133 genes were predicted on this cp genome, of which, 87 were encoded proteins, 8 rRNA, and 37 tRNA.

Phylogenetic analysis was performed based on complete cp genomes of A. mariae and other 28 related species in Primulaceae, three species in Gentianaceae as outgroup. The sequences were aligned using MAFFT (Katoh et al. Citation2002) and trimAl was employed to remove ambiguously aligned sites (Capella-Gutierrez et al. Citation2009). The evolutionary history was inferred using the neighbour-joining method in MEGA7.0 (Kumar et al. Citation2016). Bootstrap (BS) values were calculated from 1000 replicate analysis (). As expected, A. mariae was grouped within Primulaceae, and comprised a clade with other three species in Androsace, Androsace paxiana, Androsace laxa, and Androsace bulleyana, with 100% BS value. The complete cp genome of A. mariae will be helpful for further studies on population genetics, taxonomy or resources protection.

Figure 1. NJ phylogenetic tree based on 32 species chloroplast genomes was constructed using MEGA7.0. Numbers on each node are bootstrap from 1000 replicate.

Figure 1. NJ phylogenetic tree based on 32 species chloroplast genomes was constructed using MEGA7.0. Numbers on each node are bootstrap from 1000 replicate.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

All Illumine reads supporting this research have been submitted to the NCBI Short Read Archive with BioProject ID: PRJNA678464 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA678464). The cp genome assembled and annotated is available in GenBank with accession number: MT732944 (https://www.ncbi.nlm.nih.gov/nuccore/MT732944.1).

Additional information

Funding

This work was supported by High-level talent foundation of Qinghai Nationalities University [grant No. 2017XJG03] and the National Natural Science Foundation of China [grant No. 31560215, 31760244].

References

  • Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, et al. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 19(5):455–477.
  • Capella-Gutierrez S, Silla-Martinez JM, Gabaldon T. 2009. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 25(15):1972–1973.
  • Dean L, Bjorn C. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 32(1):11–16.
  • Delectis Florae Reipublicae Popularis Sinicae Agendae Academiae Sinicae Edita. 1989. Flora reipublicae popularis sinicae. Vol. 59(1). Beijing: Science Press; p. 193.
  • Doug H, Gwo-Liang C, Philip FL, Miriam LL, Frank WL, Hauser LJ. 2010. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 11:119.
  • Katoh K, Misawa K, Kuma K, Miyata T. 2002. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30(14):3059–3066.
  • Kumar S, Stecher G, Tamura K. 2016. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 33(7):1870–1874.