786
Views
1
CrossRef citations to date
0
Altmetric
Mitogenome Announcement

Complete chloroplast genome sequence and phylogenetic analysis of Torenia fournieri

, &
Pages 2004-2006 | Received 03 Feb 2021, Accepted 31 May 2021, Published online: 14 Jun 2021

Abstract

Torenia fournieri belongs to the genus Torenia in the family Linderniaceae. The complete chloroplast genome of T. fournieri was sequenced and analyzed by Illumina sequencing in this study. The full length of the complete chloroplast genome is 153,938 bp, containing a pair of inverted repeat regions of 24,805 bp (IRa and IRb) separated by a large single copy region (LSC) of 85,498 bp and a small single copy region (SSC) of 18,830 bp. The T. fournieri chloroplast genome encodes 131 genes, comprising 87 protein-coding genes, 36 tRNA genes, 8 rRNA genes, without pseudogene. Phylogenetic analysis showed that T. fournieri was closely related to T. benthamiana and T. concolor within the genus Torenia in family Linderniaceae.

Torenia fournieri is one species of the genus Torenia (Linderniaceae), which distributes predominantly in Fujian, Guangdong, Guangxi, Taiwan, Yunan and Zhejiang provinces in China (Wu and Raven Citation1998). Torenia fournieri is not only an important tropical and subtropical ornamental plant, but also used as a model plant widely applied for flower research (Nishihara et al., Citation2018). Torenia fournieri used in this paper was planted in the Biological Garden of College of Life Sciences of Zhaoqing University (N23°6′, E112°30′, Zhaoqing, China), and the specimens (No: BGCLSZU001) were deposited in GDZQU herbarium of Zhaoqing University.

Firstly, the chloroplast genome DNA of T. fournieri was extracted from young leaves, and Covaris M220 (Covaris, Woburn, MA, USA) was used for breaking the DNA into about 300 bp fragments. Secondly, we constructed shotgun sequencing libraries according to the TruSeq™ DNA Sample Prep Kit for Illumina. Thirdly, whole genome sequencing was executed using the Illumina NovaSeq platform (Illumina, USA) (Genepioneer Biotechnologies Co. Ltd, Nanjing, China). Pair-end Illumina raw reads were cleaned from adaptors and barcodes and then quality filtered using Trimmomatic (Bolger et al. Citation2014). Then, reads were mapped to the chloroplast genome of the reference species (Genbank accession number: NC_045273.1), and the reads of nuclear and mitochondrial origins were excluded using Bowtie2 v2.2.4 (Langmead and Salzberg Citation2012). SPAdes 3.10.1 were used reconstruct the chloroplast genomes with de novo assembly method (Bankevich et al. Citation2012), and chloroplast contigs were concatenated into larger contigs using Sequencher 5.3.2 (Gene Codes Inc., Ann Arbor, MI, USA). A ‘genome walking’ technique, using the Unix ‘grep’ function, was used to find reads that could fill any gaps between contigs that did not assemble in the initial set of analyses (Souza et al. Citation2019). Jellyfish v.2.2.3 was used to correct misassembled contigs (Marcais and Kingsford Citation2011). Annotation of the chloroplast genomes were generated by CpGAVAS (Liu et al. Citation2012) and a circular representation was drawn with the online tool OGDRAW (Lohse et al. Citation2007). The complete chloroplast genome sequence has been submitted to Genbank with the accession number of MW307826.

The length of chloroplast genome sequence of T. fournieri is 153,938 bp, including two inverted repeat regions (IRa and IRb, each 24,805 bp) separated by a LSC (85,498 bp) region and a SSC (18,830 bp) region. The GC content of the overall chloroplast genome, IR regions, LSC, and SSC are 37.57, 43.45, 35.39 and 32.00%, respectively. The chloroplast genome contains 131 genes in total, including 87 protein-coding genes, 36 tRNAs, 8 rRNAs, without pseudogene.

The 15 whole genomes were used for phylogenetic tree analysis. First, we used MAFFT v7.427 (Katoh et al. Citation2005) -auto mode to align each sequence. The gaps in the alignment were removed using the program trimAl with ‘-nogaps’ v 1.4 (Capella-Gutierrez et al. Citation2009). Finally, Maximum-likelihood (ML) method was used to construct the phylogenetic tree with MEGA v7.0 (Kumar et al. Citation2016); the nucleotide substitution model used was the Kimura 2-parameter model with 1000 bootstrap replicates (). We found that T. fournieri was closely related to T. benthamiana and T. concolor within the genus Torenia in family Linderniaceae.

Figure 1. Phylogenetic tree reconstruction of 15 species based on sequences from whole chloroplast genomes. All the sequences were downloaded from NCBI Genbank.

Figure 1. Phylogenetic tree reconstruction of 15 species based on sequences from whole chloroplast genomes. All the sequences were downloaded from NCBI Genbank.

Author contributions

GC: Performed the experiments investigation, project administration, writing the original draft and data curation.

LW: Prepared the resources.

YW: Supervised the project and made revisions to the manuscript.

Disclosure statement

The authors declare no conflict of interest.

Data availability statement

The data that newly obtained at this study are available in the NCBI under accession number of MW307826 (https://www.ncbi.nlm.nih.gov/nuccore/MW307826).

Additional information

Funding

This study is funded by National Natural Science Foundation of China [81373901], National Survey on Chinese Materia Medica Resources of State Administration of Traditional Chinese Medicine [GZY-KJS-2018-284-001, GZY-KJS-2018-004, GZY-KJS-2019-004], and Special Subsidy Funds for National Public Health Service [CS (2016) No. 44].

References

  • Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, et al. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 19(5):455–477.
  • Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a fexible trimmer for Illumina sequence data. Bioinformatics. 30:2114–2120.
  • Capella-Gutierrez S, Silla-Martinez JM, Gabaldon T. 2009. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 25(15):1972–1973.
  • Katoh K, Kuma K, Toh H, Miyata T. 2005. MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 33(2):511–518.
  • Kumar S, Stecher G, Tamura K. 2016. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol E. 33:1870–1874.
  • Langmead B, Salzberg SL. 2012. Fast gapped-read alignment with Bowtie 2. Nat Methods. 9:357–359.
  • Liu C, Shi L, Zhu Y, Chen H, Zhang J, Lin X. 2012. CpGAVAS, an integrated web server for the annotation, visualization, analysis, and Genbank submission of completely sequenced chloroplast genome sequences. BMC Genomics. 13:715.
  • Lohse M, Drechsel O, Bock R. 2007. OrganellarGenomeDRAW (OGDRAW): a tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes. Curr. Genet. 52:267–274.
  • Marcais G, Kingsford C. 2011. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics. 27(6):764–770.
  • Nishihara M, Higuchi A, Watanabe A, Tasaki K. 2018. Application of the CRISPR/Cas9 system for modification of flower color in Torenia fournieri. BMC Plant Biol. 18:331.
  • Souza UJBd, Nunes R, Targueta CP, Diniz-Filho JAF, Telles MPC. 2019. The complete chloroplast genome of Stryphnodendron adstringens (Leguminosae – Caesalpinioideae): comparative analysis with related Mimosoid species. Sci Rep. 9:14206.
  • Wu Z, Raven P. 1998. Scrophulariaceae vol 18. Flora of China. Beijing (China): Science Press. p. 37–40.