528
Views
0
CrossRef citations to date
0
Altmetric
Mitogenome Announcement

Characterization of the Complete Chloroplast Genome of Hygroryza aristata (Retz.) Nees ex Wight & Arn. (Zizaniinae, Poaceae)

, , , , , ORCID Icon, , & show all
Pages 1949-1950 | Received 06 Apr 2021, Accepted 22 May 2021, Published online: 22 Jun 2021

Abstract

The complete chloroplast genome sequence of Hygroryza aristata was sequenced, assembled and published for the first time here. The chloroplast genome was 135,681 bp in length and comprised of a large single-copy (LSC, 81,532 bp) region and a small single-copy (SSC, 12,383 bp) region interspersed by two inverted repeats (IRs, 20,883 bp). Gene annotation resulted in the identification of 113 unique genes including 79 protein-coding genes, 30 transfer RNA (tRNA) genes, and four ribosomal RNA (rRNA) genes. In addition, 118 simple sequence repeats (SSRs) and 47 long repeats were identified. Phylogenetic analysis based on maximum likelihood analysis (ML) resolved the placement of H. aristata sister to a clade of Rhynchoryza subulata and Zizania.

Hygroryza aristata (Retz.) Nees ex Wight & Arn. (Zizaniinae, Poaceae) is a perennial, aquatic, stoloniferous grass that forms extensive floating mats in ponds and lakes. The species is distributed across Southern China and Southeast Asia including Bangladesh, Cambodia, India, Laos, Malaysia, Myanmar, Nepal, Pakistan, Sri Lanka, Thailand, and Vietnam (Morya et al. Citation2017). This species is planted in artificial waterscapes, used as food for poultry, and a source of chemical compounds like lignan and indole alkaloids which are used medicinally for anti-inflammatory and antioxidant properties (Chung et al. Citation2011). Herein, we characterized the complete chloroplast genome of H. aristata for the first time and performed phylogenetic analysis to resolve the relationship among close relatives, which will provide a genomic resource for studying Poaceae evolution and breeding markers for rice wild relatives.

Fresh leaves of H. aristata were sampled from greenhouse grown plants at the Institute of Botany of the Chinese Academy of Sciences in Beijing (Beijing, China; 39°54′20″ N, 116°25′29″ E) and stored for later use with accession code HA20190314C (Specimen Museum of Botany of the Chinese Academy of Sciences in Beijing, ZQ Wu, [email protected]). The total cellular DNA was extracted using the cetyltrimethyl ammonium bromide (CTAB) method (Doyle Citation1987) and purified with phenol extraction (Yang et al. Citation2014). PCR amplification and Sanger sequencing methods were employed to obtain the whole chloroplast genome of H. aristata. The entire chloroplast was sequenced in overlapping fragments from PCR products using the chloroplast primers from Wu et al. (Citation2009). The PCR amplicons were purified following Tang et al. (Citation2010) and directly sequenced on an ABI 3730 (Applied Biosystems, Foster City, CA, USA). The final Sanger sequences were trimmed and assembled with the ContigExpress program from the Vector NTI Suite 6.0 (Informax Inc., North Bethesda, MD). The final assembled chloroplast sequence was submitted to DOGMA v1.2 (Wyman et al. Citation2004) for annotation using Zizania aquatica as reference. The final annotation was submitted to GenBank and given the accession MW849262. Simple sequence repeats (SSRs) across the chloroplast genome were identified using MISA (Beier et al. Citation2017) searching for motif sizes from one to six nucleotide units with minimum thresholds set to 8, 5, 4, 3, 3, and 3 repeat units for mono-, di-, tri-, tetra-, penta-, and hexa-nucleotide SSRs, respectively. REPuter (Kurtz et al. Citation2001) was employed to detect long repeats across the chloroplast genome with default settings.

Similar to most typical angiosperms, the chloroplast genome of H. aristata was 135,681 bp in size and divided into a large single-copy (LSC, 81,532 bp) region and a small single-copy (SSC, 12,383 bp) region separated by a pair of inverted repeats (IRs, 20,883 bp). The overall GC content was 39.03%, and 37.17%, 33.44%, 44.32% in the LSC, SSC, and IRs, respectively. A total of 113 unique genes were identified (17 of which were duplicated in the IRs) consisting of 79 protein-coding genes, 30 transfer RNA (tRNA) genes, and four ribosomal RNA (rRNA) genes. From this, 16 genes were found to contain intron(s) including eight tRNA genes and eight protein coding genes. As for the SSR identification, there were 108 mono-nucleotide, 4 di-nucleotide, 2 tri-nucleotide, and 4 tetra-nucleotide repeats totaling 118 SSR loci. Additionally, 34 forward matches and 13 palindromic matches were recognized from the long repeat analyses.

To investigate the phylogenetic position of the H. aristata chloroplast, we constructed a phylogenetic tree based on sampling from 23 Oryzeae species chloroplast genome sequences published in NCBI, and Phyllostachys propinqua (Arundinariinae) chloroplast genome was selected as an outgroup. Alignment was accomplished in MAFFT v7 (Katoh et al. Citation2019). The best-fit substitution model for the data was Blosum62 + F + I + G4 as chosen in ModelFinder v1.6.8 (Kalyaanamoorthy et al. Citation2017). The maximum likelihood tree was obtained in IQ-Tree v1.6 (Nguyen et al. Citation2015) with 1,000 bootstrap replicates. As shown in , H. aristata was most closely related to a clade of Rhynchoryza subulata and Zizania.

Figure 1. A maximum likelihood tree indicating the position of H. aristata with other chloroplast sequences from Oryzeae species and Phyllostachys propinqua (Arundinariinae) as an outgroup. All nodes were fully supported (Bootstrap value = 100).

Figure 1. A maximum likelihood tree indicating the position of H. aristata with other chloroplast sequences from Oryzeae species and Phyllostachys propinqua (Arundinariinae) as an outgroup. All nodes were fully supported (Bootstrap value = 100).

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The data that support the findings of this study are openly available in GenBank at https://www.ncbi.nlm.nih.gov/genbank/, MW849262.

Additional information

Funding

This work was supported by the [National Natural Science Foundation of China] under Grant [31970244] and [Zhejiang Provincial Natural Science Foundation of China] under Grant [LY21C160001].

References

  • Beier S, Thiel T, Münch T, Scholz U, Mascher M. 2017. MISA-web: a web server for microsatellite prediction. Bioinformatics. 33(16):2583–2585.
  • Chung YM, Lan YH, Hwang T, Leu YL. 2011. Anti-inflammatory and antioxidant components from Hygroryza aristata. Molecules. 16(3):1917–1927.
  • Doyle JJ. 1987. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull. 19:11–15.
  • Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. 2017. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 14(6):587–589.
  • Katoh K, Rozewicki J, Yamada KD. 2019. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 20(4):1160–1166.
  • Kurtz S, Choudhuri JV, Ohlebusch E, Schleiermacher C, Stoye J, Giegerich R. 2001. REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 29(22):4633–4642.
  • Morya GC, Vinita V, Mishra HS, Shakya S, Bahadur R, Yadav KN. 2017. Millets: the indigenous food grains. Int J Adv AYUSH. 6(1):447–452.
  • Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 32(1):268–274.
  • Tang L, Zou XH, Achoundong G, Potgieter C, Second G, Zhang DY, Ge S. 2010. Phylogeny and biogeography of the rice tribe (Oryzeae): evidence from combined analysis of 20 chloroplast fragments. Mol Phylogenet Evol. 54(1):266–277.
  • Wu FH, Kan DP, Lee SB, Daniell H, Lee YW, Lin CC, Lin NS, Lin CS. 2009. Complete nucleotide sequence of Dendrocalamus latiflorus and Bambusa oldhamii chloroplast genomes. Tree Physiol. 29(6):847–856.
  • Wyman SK, Jansen RK, Boore JL. 2004. Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 20(17):3252–3255.
  • Yang JB, Li DZ, Li HT. 2014. Highly effective sequencing whole chloroplast genomes of angiosperms by nine novel universal primer pairs. Mol Ecol Resour. 14(5):1024–1031.