1,100
Views
18
CrossRef citations to date
0
Altmetric
Mitogenome Announcement

The complete chloroplast genome of ornamental coffee tree, Coffea arabica L. (Rubiaceae)

ORCID Icon, ORCID Icon, & ORCID Icon
Pages 1059-1060 | Received 10 Jan 2019, Accepted 10 Feb 2019, Published online: 05 Mar 2019

Abstract

Coffea arabica L. taking 70% of world coffee production is also used as an ornamental species. One imported coffee tree from Indonesia near to thirty years ago of which leaf is wrinkled (named as IN1) was chosen to know its genetic background. Here, we presented complete chloroplast genome of Coffea arabica IN1 which is 155,277 bp long and has four subregions: 85,248 bp of large single copy (LSC) and 18,137 bp of small single copy (SSC) regions are separated by 25,946 bp of inverted repeat (IR) regions including 131 genes (86 protein-coding genes, eight rRNAs, and 37 tRNAs). The overall GC content of the chloroplast genome is 37.4% and those in the LSC, SSC, and IR regions are 35.4%, 31.3%, and 43.0%, respectively. In comparison to three available coffee chloroplast genomes, 84 bp insertion on IN1 chloroplast genome is found, which is big differences in comparison to other available coffee chloroplast genomes. Even though relatively low number of sequence variations on coffee chloroplast genomes, these results can be used as a corner stone for establishing molecular markers to identify its origin or cultivars.

Coffea arabica L., has been cultivated world-wide along with ‘Bean belt’ (Bentley and Baker Citation2000), occupying 70% of world coffee production (Lashermes et al. Citation1999; O'brien and Kinnaird Citation2003). Several species in Coffea genus are used as ornamental species because of good-shape leaves, e.g., Coffea horsfieldiana has been used as ornamental species in Indonesia (Wellman Citation1961). There is one Korean gardener who imported coffee trees from Indonesia around 30 years ago for selling. Its leaf is more wrinkled than other cultivated coffee trees. To accumulate genetic diversity of coffee chloroplast genomes, we chose this wrinkled-leaf coffee tree for completing a third coffee chloroplast genomes.

Total DNA of IN1 coffee tree (Y. Kim, IB-00587 in InfoBoss Cyber Herbarium (IN)) was extracted from fresh leaves using a DNeasy Plant Mini Kit (QIAGEN, Hilden, Germany). Genome sequencing was performed using HiSeq2000 at Macrogen Inc., Korea, de novo assembly and sequence confirmation were conducted by Velvet 1.2.10 (Zerbino and Birney Citation2008), SOAPGapCloser 1.12 (Zhao et al. Citation2011), BWA 0.7.17 (Li Citation2013), and SAMtools 1.9 (Li et al. Citation2009). Geneious R11 11.0.5 (Biomatters Ltd., Auckland, New Zealand) was used for chloroplast genome annotation based on Coffea arabica chloroplast genome (NC_008535; Samson et al. Citation2007).

Chloroplast genome of IN1 (Genbank accession is MK353212) is 155,277 bp and has four subregions: 85,248 bp of large single copy (LSC) and 18,137 bp of small single copy (SSC) regions are separated by 25,946 bp of inverted repeat (IR). It contains 131 genes (86 protein-coding genes, 8 rRNAs, and 37 tRNAs); 19 genes (8 protein-coding genes, 4 rRNAs, and 7 tRNAs) are duplicated in IR regions. The overall GC content is 37.4% and in the LSC, SSC, and IR regions are 35.4%, 31.3%, and 43.0%, respectively.

Based on alignment with three C. arabica chloroplast genomes (NC_008535, MK342634, MK353209, named as CA, CH3, and HP1, respectively; Samson et al. Citation2007; Park et al. The complete chloroplast genome of cold hardiness individual of Coffea arabica L. (Rubiaceae), accepted; Park et al. The complete chloroplast genome of high production individual tree of Coffea arabica L. (Rubiaceae), accepted), one big insertion (84 bp) is found only on IN1 chloroplast genome, which is a big difference in comparison to sequence variations found in the other coffee chloroplast genomes (Park et al. The complete chloroplast genome of cold hardiness individual of Coffea arabica L. (Rubiaceae), accepted; Park et al. The complete chloroplast genome of high production individual tree of Coffea arabica L. (Rubiaceae), accepted). One deletion is CH3-specific, two deletions and two single nucleotide polymorphisms (SNPs) are CA-specific, and one 2-bp-deletion is identified in HP1. One SNP presents three different bases at 111,618 bp of CA chloroplast genome (Park et al. The complete chloroplast genome of cold hardiness individual of Coffea arabica L. (Rubiaceae), accepted). All these variations are in intergenic or intron areas on chloroplast genomes.

Six Coffea and four chloroplast genomes of Rubiaceae were used as outgroup species for constructing neighbor joining (bootstrap repeat is 10,000) and maximum-likelihood (bootstrap repeat is 1,000) trees using MEGA X (Kumar et al. Citation2018) based on the alignment by MAFFT 7.388 (Katoh and Standley Citation2013). Phylogenetic trees show that IN1 is clustered with other Coffea arabica chloroplast genomes. Even though relatively low number of sequence variations are observed on coffee chloroplast genomes, these results can be used as a corner stone for establishing molecular markers to identify its origin or cultivars.

Figure 1. (A) Neighbor joining (bootstrap repeat is 10,000) and maximum likelihood (bootstrap repeat is 1,000) phylogenetic trees of five Coffea and four Rubiaceae complete chloroplast genomes: five Coffea arabica (MK353212, in this study, NC_008535, KY085909, MK342634, and MK353209), Coffea canephora (NC_030053), Mitragyna speciosa (NC_034698), Dunnia sinensis (NC_039965), Emmenopterys henryi (NC_036300), and Gynochthodes nanlingensis (NC_028614). Phylogenetic tree was drawn based on neighbor joining tree. The numbers above branches indicate bootstrap support values of neighbor joining and maximum likelihood phylogenetic trees, respectively.

Figure 1. (A) Neighbor joining (bootstrap repeat is 10,000) and maximum likelihood (bootstrap repeat is 1,000) phylogenetic trees of five Coffea and four Rubiaceae complete chloroplast genomes: five Coffea arabica (MK353212, in this study, NC_008535, KY085909, MK342634, and MK353209), Coffea canephora (NC_030053), Mitragyna speciosa (NC_034698), Dunnia sinensis (NC_039965), Emmenopterys henryi (NC_036300), and Gynochthodes nanlingensis (NC_028614). Phylogenetic tree was drawn based on neighbor joining tree. The numbers above branches indicate bootstrap support values of neighbor joining and maximum likelihood phylogenetic trees, respectively.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This work was supported by InfoBoss Research Grant [IBG-0011].

References

  • Bentley JW, Baker PS. 2000. The Colombian Coffee Growers' Federation: organised, successful smallholder farmers for. London: ODI.
  • Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 30:772–780.
  • Kumar S, Stecher G, Li M, Knyaz C, Tamura K. 2018. MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol Biol Evol. 35:1547–1549.
  • Lashermes P, Combes M-C, Robert J, Trouslot P, D'Hont A, Anthony F, Charrier A. 1999. Molecular characterisation and origin of the Coffea arabica L. genome. Mol Gen Genet. 261:259–266.
  • Li H. 2013. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Quantitative Biol. arXiv Preprint:1303.3997.
  • Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. 2009. The sequence alignment/map format and SAMtools. Bioinformatics. 25:2078–2079.
  • O'brien TG, Kinnaird MF. 2003. Caffeine and conservation. Bogor (Indonesia): American Association for the Advancement of Science.
  • Park J, Kim Y, Xi H, Nho M, Woo J, Seo Y. The complete chloroplast genome of cold hardiness individual of Coffea arabica L. (Rubiaceae). Mitochondrial DNA Part B (Accepted).
  • Park J, Kim Y, Xi H, Nho M, Woo J, Seo Y. The complete chloroplast genome of high production individual tree of Coffea arabica L. (Rubiaceae). Mitochondrial DNA Part B (Accepted).
  • Samson N, Bausher MG, Lee SB, Jansen RK, Daniell H. 2007. The complete nucleotide sequence of the coffee (Coffea arabica L.) chloroplast genome: organization and implications for biotechnology and phylogenetic relationships amongst angiosperms. Plant Biotechnol J. 5:339–353.
  • Wellman FL. 1961. Coffee: botany, cultivation, and utilization. London: L. Hill.
  • Zerbino DR, Birney E. 2008. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18:821–829.
  • Zhao Q-Y, Wang Y, Kong Y-M, Luo D, Li X, Hao P. 2011. Optimizing de novo transcriptome assembly from short-read RNA-Seq data: a comparative study. BMC Bioinformatics. 12:S2.