1,412
Views
0
CrossRef citations to date
0
Altmetric
Research Paper

Whole human genome 5’-mC methylation analysis using long read nanopore sequencing

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Pages 1961-1975 | Received 22 Oct 2021, Accepted 29 Jun 2022, Published online: 20 Jul 2022
 

ABSTRACT

Methylation microarray and bisulphite sequencing are often used to study 5’-methylcytosine (5’-mC) modification of CpG dinucleotides in the human genome. Although both technologies produce trustworthy results, the evaluation of the methylation status of CpG sites suffers from the potential side effects of DNA modification by bisulphite and/or the ambiguity of mapping short reads in repetitive and highly homologous genomic regions, respectively. Nanopore sequencing is an attractive alternative for the study of 5’-mC since it allows sequencing of native DNA molecules, whereas the long reads produced by this technology help to increase the resolution of those genomic regions. In this work, we show that nanopore sequencing with 10X coverage depth, using DNA from a human cell line, produces 5’-mC methylation frequencies consistent with those obtained by 450k microarray, digital restriction enzyme analysis of methylation, and reduced representation bisulphite sequencing. High correlation between methylation frequencies obtained by nanopore sequencing and the other methodologies was also noticeable in either low or high GC content regions, including CpG islands and transcription start sites. We also showed that a minimum of five reads per CpG yields strong correlations (>0.89) in replicate nanopore sequencing runs and an almost uniform linearity of the methylation frequency variation between zero and one. Furthermore, nanopore sequencing was able to correctly display methylation frequency patterns based on genomic annotations of CpG regions. These results demonstrate that nanopore sequencing is a fast, robust, and reliable approach to the study of 5’-mC in the human genome with low coverage depth.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability

The datasets presented in this study are available from the European Nucleotide Archive (ENA) at EMBL-EBI under the accession number PRJEB45078 (https://www.ebi.ac.uk/ena/browser/view/PRJEB45078).

Supplementary material

Supplemental data for this article can be accessed online at https://doi.org/10.1080/15592294.2022.2097473

Additional information

Funding

This work is a result of the GenomePT project (POCI-01-0145-FEDER-022184), supported by COMPETE 2020 – Operational Programme for Competitiveness and Internationalisation (POCI), Lisboa Portugal Regional Operational Programme (Lisboa2020), Algarve Portugal Regional Operational Programme (CRESC Algarve2020), under the PORTUGAL 2020 Partnership Agreement, through the European Regional Development Fund (ERDF), and by Fundação para a Ciência e a Tecnologia (FCT). This work was also supported by Fundos FEDER through the Programa Operacional Factores de Competitividade – COMPETE and by Fundos Nacionais through the FCT within the scope of the project UID/BIM/00009/2019 (Centre for Toxicogenomics and Human Health-ToxOmics).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.