714
Views
1
CrossRef citations to date
0
Altmetric
Research Paper

In silico structural analysis of sequences containing 5-hydroxymethylcytosine reveals its potential as binding regulator for development, ageing and cancer-related transcription factors

ORCID Icon, &
Pages 503-518 | Received 25 Mar 2020, Accepted 24 Jul 2020, Published online: 02 Sep 2020
 

ABSTRACT

The presence of 5-hydroxymethyl cytosine in DNA has been previously associated with ageing. Using in silico analysis of normal liver samples we presently observed that in 5-hydroxymethyl cytosine sequences, DNA methylation is dependent on the co-presence of G-quadruplexes and palindromes. This association exhibits discrete patterns depending on G-quadruplex and palindrome densities. DNase-Seq data show that 5-hydroxymethyl cytosine sequences are common among liver nucleosomes (p < 2.2x10−16) and threefold more frequent than nucleosome sequences. Nucleosomes lacking palindromes and potential G-quadruplexes are rare in vivo (1%) and nucleosome occupancy potential decreases with increasing G-quadruplexes. Palindrome distribution is similar to that previously reported in nucleosomes. In low and mixed complexity sequences 5-hydroxymethyl cytosine is frequently located next to three elements: G-quadruplexes or imperfect G-quadruplexes with CpGs, or unstable hairpin loops (TCCCAY6TGGGA) mostly located in antisense strands or finally A-/T-rich segments near these motifs. The high frequencies and selective distribution of pentamer sequences (including TCCCA, TGGGA) probably indicate the positive contribution of 5-hydroxymethyl cytosine to stabilize the formation of structures unstable in the absence of this cytosine modification. Common motifs identified in all total 5-hydroxymethyl cytosine-containing sequences exhibit high homology to recognition sites of several transcription factor families: homeobox, factors involved in growth, mortality/ageing, cancer, neuronal function, vision, and reproduction. We conclude that cytosine hydroxymethylation could play a role in the recognition of sequences with G-quadruplexes/palindromes by forming epigenetically regulated DNA ‘springs’ and governing expansions or compressions recognized by different transcription factors or stabilizing nucleosomes. The balance of these epigenetic elements is lost in hepatocellular carcinoma.

Disclosure statement

The authors report no conflict of interest.

Supplementary material

Supplemental data for this article can be accessed here.

Additional information

Funding

This work is not part of a funded project.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.