761
Views
3
CrossRef citations to date
0
Altmetric
Mitogenome Announcement

Characterization of the complete chloroplast genome of leek Allium porrum L. (Amaryllidaceae)

ORCID Icon, &
Pages 2602-2603 | Received 10 Jun 2019, Accepted 22 Jun 2019, Published online: 22 Jul 2019

Abstract

The complete chloroplast genome sequence of Allium porrum was determined by Illumina single-end sequencing. The complete plastid genome was 152,732 bp in length containing a large single copy (LSC) of 81,744 bp and a small single copy (SSC) of 17,910 bp, which were separated by a pair of 26,524 bp inverted repeats (IRs). A total of 133 genes were annotated, including 80 protein-coding genes, 38 tRNA genes, 8 rRNA genes, and 7 pseudogenes. The overall GC contents of the plastid genome were 36.7%. Unlike A. sativum and A. obliquum in the leek, plastome infA gene is absent, and rpl23 gene is a pseudogene due to a 4 bp deletion and the formation of a premature stop codon.

Leek (Allium porrum L., Amaryllidaceae) due to its taste and valuable dietary properties is a popular vegetable crop, the annual production of which is more than 2 million tons per year (http://www.fao.org/faostat/). In leek, the entire above-ground part is edible – the bleached false stem and green leaves, which are used in food both raw and cooked. At the same time, leek, like other Allium species, is a rich source of secondary metabolites (Soininen et al. Citation2014).

The chloroplast genome A. porrum (cultivar ‘Premier’; seed from Federal Scientific Center of Vegetable Crops, Russia) was amplified via long range PCR using 11 pairs of primers developed on the basis of the Allium cepa plastid genome (Filyushin, Beletsky, et al. Citation2018), sequencing was conducted using the Illumina HiSeq 1500 Sequencing System with single-end 220 bp reads. Spades v.3.8 was used to assemble the high-quality short reads into contigs (Bankevich et al. Citation2012). Contigs were assembled against the complete chloroplast genome A. cepa (NC_024813) and A. sativum (NC_031829) as a reference. Gaps were closed using assembly graph in Bandage (Wick et al. Citation2015), reads were then mapped against the resulting single contig to ensure the correctness of the finished assembly. The plastid genome of A. porrum was annotated by using the DOGMA program (http://dogma.ccbb.utexas.edu). The start and stop codons for the genes were identified and corrected manually. All pseudogenes were additionally verified by Sanger sequencing from flanking primers.

The assembled A. porrum plastid genome (Genbank accession no. MK820026) was 152,732 bp in length, showing a typical quadripartite structure including a pair of inverted repeats (IRs) of 26,524 bp separating one large single copy region (LSC) of 81,744 bp and one small single copy region (SSC) of 17,910 bp. GC contents of the genome were 36.7%. A total of 133 genes were identified that include 80 protein-coding genes, 38 tRNA genes, 8 rRNA genes, and 7 pseudogenes.

Most of the genes are single copy, whereas 17 genes present in double copies, including 6 protein-coding genes (rps19, rpl2, ycf2, ndhB, rps7), 8 tRNA genes (trnR-ACG, trnL-CAA, trnV-GAC, trnH-GUG, trnI-CAU, trnI-GAU, trnA-UGC, trnN-GUU) and all 4 rRNA genes in IRs (rrn4.5, rrn5, rrn16 and rrn23). Intron sequences are found in 17 genes, 15 of which contain a single intron (atpF, rpoC1, ndhA, trnK-UUU, trnG-GCC, trnL-UAA, trnV-UAC; 4 genes in IRs: rpl2, ndhB, trnI-GAU, trnA-UGC), while two (clpP and ycf3) have two introns.

Seven genes became pseudogenes due to internal stop codons (rps2, rps16 and ycf15, and rpl23 in IRs) or because of incomplete duplication in the IRB/SSC junction region (ycf1). Unlike the Allium sativum (Filyushin et al. Citation2016) and Allium obliquum plastomes (Filyushin, Mazur, et al. Citation2018), in the leek plastome infA gene is absent (additionally verified by Sanger sequencing).

On the ML tree, A. porrum clustered with other Allium species, the closest to it is garlic A. sativum, with which they belong to the section Allium ().

Figure 1. Phylogenetic tree inferred by maximum-likelihood using 80 protein-coding gene sequences from plastomes of Allium species (Allium altaicum (NC_040972), Allium cepa (NC_024813), Allium fistulosum (NC_040222), Allium obliquum (NC_037199), Allium porrum (MK820026), Allium sativum (NC_031829)) and Asparagus officinalis (NC_034777) as an outgroup. PhyML 3.1 was used for the sequence alignment and construction of the tree. Bootstrap support values based on 1000 replicates are displayed on each node.

Figure 1. Phylogenetic tree inferred by maximum-likelihood using 80 protein-coding gene sequences from plastomes of Allium species (Allium altaicum (NC_040972), Allium cepa (NC_024813), Allium fistulosum (NC_040222), Allium obliquum (NC_037199), Allium porrum (MK820026), Allium sativum (NC_031829)) and Asparagus officinalis (NC_034777) as an outgroup. PhyML 3.1 was used for the sequence alignment and construction of the tree. Bootstrap support values based on 1000 replicates are displayed on each node.

Disclosure statement

The authors report no conflicts of interest. The authors alone are responsible for the content and writing of this article.

Additional information

Funding

This work was supported by the grant of the President of Russian Federation for Young Scientists Мк-3350.2019.11.

References

  • Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, et al. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 19:455–477.
  • Filyushin Ма, Beletsky AV, Mazur AM, Kochieva EZ. 2016. The complete plastid genome sequence of garlic Allium sativum L. Mitochondr DNA B. 1:831–832.
  • Filyushin MA, Beletsky AV, Mazur AM, Kochieva EZ. 2018. Characterization of the complete plastid genome of lop-sided onion Allium obliquum L. (Amaryllidaceae). Mitochondr DNA B. 3:393–394.
  • Filyushin MA, Mazur AM, Shchennikova AV, Kochieva EZ. 2018. Comparative analysis of the complete plastomes of garlic Allium sativum and bulb onion Allium cepa. Vavilov J Genet Br. 2:524–530.
  • Soininen TH, Jukarainen N, Soininen P, Auriola SO, Julkunen-Tiitto R, Oleszek W, Stochmal A, Karjalainen RO, Vepsäläinen JJ. 2014. Metabolite profiling of leek (Allium porrum L) cultivars by (1) H NMR and HPLC-MS. Phytochem Anal. 25:220–228.
  • Wick RR, Schultz MB, Zobel J, Holt KE. 2015. Bandage: interactive visualisation of de novo genome assemblies. Bioinformatics. 31:3350–3352.