3,086
Views
0
CrossRef citations to date
0
Altmetric
Review

DNA methylation episignatures: insight into copy number variation

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Pages 1373-1388 | Received 17 Aug 2022, Accepted 23 Nov 2022, Published online: 20 Dec 2022

Abstract

In this review we discuss epigenetic disorders that result from aberrations in genes linked to epigenetic regulation. We describe current testing methods for the detection of copy number variants (CNVs) in Mendelian disorders, dosage sensitivity, reciprocal phenotypes and the challenges of test selection and overlapping clinical features in genetic diagnosis. We discuss aberrations of DNA methylation and propose a role for episignatures as a novel clinical testing method in CNV disorders. Finally, we postulate that episignature mapping in CNV disorders may provide novel insights into the molecular mechanisms of disease and unlock key findings of the genome-wide impact on disease gene networks.

Epigenetic disorders & the epigenetic machinery

Many genes are directly or indirectly linked to epigenetic regulation, particularly those of the epigenetic machinery involved in histone modification, chromatin remodeling and DNA methylation. Epigenetics is the field of genetics involving the study of reversible heritable changes in gene function that occur without changes to the underlying DNA sequence. The epigenetic machinery involves genes encoding for proteins that contribute to the regulation of gene expression at the transcriptional and post-transcriptional level [Citation1]. Components of the epigenetic machinery can be classified as writers, erasers, readers and remodelers. Writers function to modify a region of interest of the genome based on the cell type, as well as on the developmental and metabolic stage of the cell, and they mark individual regions for use or disuse depending on the chromatin state [Citation2]. Writer enzymes include DNA methyltransferases that catalyze the addition of a methyl group to genomic DNA [Citation3]. Eraser genes encode for proteins that remove epigenetic marks and include deacetylase or demethylase enzymes. Reader proteins of the epigenetic machinery recognize the specific epigenetic markers in a locus and may act to recruit other enzymes to this region. The last group are the chromatin remodelers. This class of enzymes uses the energy of ATP to move nucleosomes along the DNA to make regions of chromatin more accessible. The other function of remodelers is to facilitate processes related to protein–DNA interactions [Citation4]. It is important to note that proteins involved in the epigenetic machinery often interact as part of macromolecular structures with complex interdependence and interactivity. The correct functioning of the epigenetic machinery, and thus transcriptional regulation in all processes at the cellular level, particularly during early development, are critical and essential steps in the developmental processes of the body [Citation5]. Therefore, aberrations in epigenetic machinery genes can impair cell function, and development, and ultimately result in epigenetic disorders [Citation2].

In the last decade many new genes associated with the epigenetic machinery have been identified [Citation6]. Bjornsson described the role of a substantial number of these genes in association with 44 Mendelian disorders and their most common overlapping features including neurodevelopmental phenotypes such as intellectual disability (ID), developmental delay and limb/nail abnormalities [Citation7]. These overlapping features create challenges in obtaining a diagnosis with current testing methods [Citation7]. Investigation of the epigenetic machinery and its associated genes is vital to understanding the pathogenesis of these conditions and their phenotypes [Citation7].

Phenotypes reflect the biological information stored in an individual’s genotype and are the sum of all observable characteristics in the genes and the influence of the environment [Citation8]. Over the past decades, this phenotype–genotype association has been demonstrated repeatedly at the molecular level. The relevant biological information can be stored in DNA, RNA or amino acid sequences and the subsequent proteins produced [Citation9]. These proteins regulate the development of an organism from a fertilized egg to an adult individual and ensure proper general cell functioning in the human body. The quality and quantity (dosage) of proteins produced can impact the function of a cell [Citation10]. While sharing the common DNA sequence, different cell types with specific functions and phenotypes contribute to the overall function of a given organism. This is related to the intercellular differences in gene expression, as well as proper temporal and spatial control of this gene expression during development. Therefore, control of gene expression influences cellular differentiation and development and the resulting observed phenotype [Citation10]. Gene expression can be influenced by epigenetic processes including DNA methylation and histone modification. These epigenetic modifications add an important regulatory layer to the dosage requirements for cell-type-specific gene expression, as cells must achieve a balance between the activities of the opposing epigenetic processes, for example, writers and erasers [Citation2].

One type of structural aberration that can impact gene dosage are copy number variants (CNVs), such as deletions or duplications. CNVs involve deletion or duplication of more than 1 kb of genetic material [Citation11] and can be responsible for phenotypic variation and disease susceptibility [Citation12]. CNVs can result in a change in the (total) amount of protein produced (dosage) and, where genes with epigenetic functions are involved, can therefore also systematically impact the balance of epigenetic modifications.

CNVs & dosage sensitivity

Gene dosage sensitivity has been shown to be a major determinant of the pathogenicity of CNVs [Citation13] and previous studies have described a gene balance hypothesis whereby “a stoichiometric balance is maintained among all of the complex gene products in a pathway” [Citation14]. Therefore, dosage effects are the result of the linear relationship between gene copy number and protein product [Citation13], and any deletion or duplication in a gene across the genome that is dosage sensitive could result in phenotypic effects. Multiple ongoing large-scale projects, such as ClinGen Dosage Sensitivity Map (www.clinicalgenome.org) and gnomAD [Citation15,Citation16], aim to accumulate evidence for the dosage sensitivity of the genes in the human genome in order to aid in the clinical assessment of variants. Analogous to this, some genes can be deleted, even in the homozygous state, without apparent phenotypic consequences [Citation13,Citation17]. These genes are described as dosage-insensitive genes. Haploinsufficiency is the result of one copy of a gene being inactivated or deleted and the remaining single gene not producing enough protein to maintain ‘normal’ function [Citation18]. In contrast, triplosensitivity arises when additional copies of a gene increase the protein product and produce a phenotype [Citation19].

Haploinsufficiency of the NSD1 gene (the result of microdeletions or intragenic sequence-level variants) is reported in Sotos syndrome with clinical features including generalized overgrowth, macrocephaly and ID [Citation20]. Alternatively, microduplications of NSD1 have been reported in individuals with a reciprocal Sotos phenotype, presenting with short stature, microcephaly, failure to thrive, seizures and ID [Citation21,Citation22]. It is speculated that gene dosage effects are responsible for the opposing phenotypes observed [Citation21,Citation22]. Similar reciprocal phenotypes have been reported in CNVs at the 1q21.1 [Citation23] and 16p11.2 regions [Citation24–26]. A study by Brunetti-Pierri et al. described dosage effects of the 1q21.1 region associated with head circumference, with deletions resulting in microcephaly and duplications macrocephaly [Citation23]. Deletions of the 16p11.2 region are associated with obesity and macrocephaly [Citation27], whereas duplication of the same region results in low body weight and microcephaly [Citation25]. Dosage of the 7q11.23 region has been shown to affect transcriptional pathways leading to symmetrically opposite expression alterations between deletions and duplications, based on induced pluripotent stem cell modeling [Citation28]. The authors proposed that the symmetrically dysregulated genes within the region may contribute to the opposing disease phenotypes observed in 7q11.23 deletion carriers (Williams–Beuren syndrome) and 7q11.23 duplication carriers (chromosome 7q11.23 duplication syndrome) [Citation28]. Further evidence for the opposing dosage-dependent alterations of these CNVs were reported in a study describing symmetrical dose-dependent DNA-methylation alterations in 7q11.23 deletions and duplications [Citation29]. Deletions of the 7q11.23 region associated with Williams–Beuren syndrome increased DNA methylation, in comparison to the decreased DNA methylation observed in duplication carriers [Citation29].

We postulate that opposing methylation effects may occur in many of the genomic regions where symmetrical deletion and duplications are commonly observed; however, future studies will be required to investigate this further.

Challenges of CNV interpretation

Fragment sizes of deletion and duplication events are quite variable. CNVs can involve a single gene or may consist of multiple genes and/or their regulatory elements. Clinical manifestations may therefore be the result of the effects of disruptions to multiple genes, termed contiguous gene disorders (e.g., 22q11.2 deletion syndrome) or alternatively much of the observed phenotype can be ascribed to the effects of a single causative gene contained within the region (e.g., the NSD1 gene in Sotos syndrome). Overall, these CNVs result in modified or complete absence of protein that may lead to a change in phenotypic presentation [Citation30]. Challenges arise when there is an absence of functional evidence for a CNV or no reports of similar aberrations in the control population databases. Therefore, many CNVs outside of the common genomic disorder regions are classified as variants of uncertain significance (VUS). Further challenges include CNVs that are thought to be pathogenic but have been inherited from apparently unaffected parents [Citation31] and individuals with seemingly identical deletions or duplications that have variable phenotypic features [Citation32]. Reduced penetrance (presence or absence of a phenotype) and variable expressivity (degree and spectrum of observed phenotypes) add an additional layer of difficulty when interpreting and classifying CNVs, as well as in counseling families in the clinic.

Challenges of CNV detection

The detection of CNVs can be carried out using several methods and each method has its own advantages and disadvantages. Clinically relevant CNVs can range from nucleotide-level insertion deletions (indels) to the gain or loss of entire chromosomes [Citation33]. Multiple techniques have been developed and are utilized routinely in clinical laboratories to identify CNVs, as not all CNVs can be detected by the same method. The selection of the method of testing is a combination of national testing guidelines and clinical presentation; however, phenotypic overlap often complicates optimal decision-making regarding the most appropriate first-line choice of a particular diagnostic test [Citation34]. In many instances several of these tests must be performed in a single patient to obtain a diagnosis.

Cytogenetic analysis

Classical cytogenetic methods, including karyotype and fluorescent in situ hybridization (FISH), can detect large CNVs (>3–5 Mb) and microscopic structural features of each chromosome. Karyotyping is the process of pairing and ordering the chromosomes and comparing the banding pattern to identify structural variants. Karyotypes are made with standard staining procedures that show the characteristic structural features (bands) of each chromosome. There are several staining procedures that can be performed to karyotype: G-banding (GTG banding) involves the use of trypsin-Giemsa staining, R-banding (RHG banding) produces the reverse banding pattern from G-banding and uses Giemsa, whereas Q-banding (QFQ banding) uses quinacrine. Additionally, FISH can be performed using fluorescent probes (e.g., spectral karyotyping, multicolour-FISH or targeted FISH) [Citation35,Citation36]. Karyotyping depends heavily on the quality of the sample [Citation37]. If the patient is suspected to have a specific aberration, targeted FISH can be performed with the indicated probes mapping to the region. An advantage of karyotyping is that it is possible to detect apparently balanced translocations and mosaic profiles [Citation38]. It is worth noting, however, that these methods are limited in detection of smaller aberrations (<3 Mb) [Citation35] and are currently primarily used for detection of whole chromosome aneuploidies or balanced rearrangements.

Chromosomal microarray

Chromosomal microarray (CMA) is the first-tier clinical test for individuals with developmental delay, ID and/or congenital anomalies [Citation39]. One type of CMA used to detect chromosomal imbalances is comparative genomic hybridization (CGH). This technique is performed by cohybridizing the fluorescently labeled DNA of a patient with that of a reference control, each labeled with a different color [Citation40]. CGH is used to identify differences in the number of DNA copies of parts of the genome, that is, CNVs [Citation41]. Another type of CMA is the SNP array. This array is more sensitive than CGH, and the DNA probes that are used derive from regions in the genome that show the differences between individuals at a single base pair site thus enabling the detection of polymorphic alleles [Citation42]. The resolution of CGH or SNP arrays differs and depends on probe coverage and location as well as the platform used to perform the array. For example, the Illumina (CA, USA) CytoSNP platform includes more than 850,000 probes enhanced at regions of clinical significance. The resolution of the array-CGH or SNP array is greater than that achieved by the classical cytogenetic technologies and can detect aberrations as small as 10–40 kb [Citation43]; however, CMA cannot detect balanced translocations, inversions, insertions, nucleotide-level changes or low mosaic profiles [Citation37].

Multiplex ligation-dependent probe amplification

Multiplex ligation-dependent probe amplification (MLPA) has been utilized in the detection of CNVs for over two decades. This method allows for quantitative assessment of multiple targeted DNA sequences in a single reaction [Citation44]. Initially, MLPA has been utilized in the assessment of deletions of the 22q11.2 region in DiGeorge/Velocardiofacial syndrome [Citation45]. More recently, many additional kits have become available to detect CNVs in other regions including those associated with Williams–Beuren or Sotos syndromes (MRC Holland, Amsterdam, The Netherlands). MLPA is highly sensitive and can be used to detect CNVs <40 kb, including atypical deletions and duplications which are not routinely detected by classical cytogenetic methods [Citation46]. Similar to FISH, MLPA is also limited to assessment of specific genomic regions. However, MLPA has a rapid turnaround time and is relatively inexpensive compared with other methods [Citation44].

Optical genome mapping

More recently, there has been an emergence of techniques involving optical genome mapping (OGM) that are designed to detect both balanced and unbalanced structural variants in a single platform, including CNVs larger than 500 bp, providing a higher sensitivity than the standard methods (CMA, karyotype, FISH) [Citation47]. In addition, OGM can identify broken, missing, rearranged or extra chromosomes. OGM can also detect structural variants with a low allele fraction [Citation48]. The technique involves labeling ultra-long DNA molecules with a 6 bp general binding motif that creates a pattern that spans the whole genome and is unique to each individual sample. Labeled DNA is then detected through electrophoresis. Every sample has a unique optical genome map, which is aligned to a reference genome, and the differences between those are automatically identified by the software [Citation49]. Limitations of OGM are mostly the result of gaps in the human reference genome and inability to detect breakpoints within large repetitive sequences such as pericentromeric regions and p-arms of the acrocentric chromosomes. The recent mapping of the genome gaps may help to reduce these limitations in the future [Citation50].

Next-generation sequencing & Sanger sequencing

To detect smaller CNVs (1–10 kb) and indels (<1 kb), next-generation sequencing (NGS) can be performed, either as targeted panels or whole exome or genome sequencing (WES and WGS respectively) [Citation51]. Indels can typically be identified by NGS with good sensitivity, but the specificity tends to be low [Citation11]. There is also biased detection of deletions over insertions due to difficulty aligning to the reference sequence. Techniques using read-pair, read-depth, split-read or assembly-based methods have been described to identify indels and CNVs from NGS [Citation52]. Another sequencing method that is widely used is Sanger sequencing. In comparison with NGS, Sanger sequencing only sequences a single DNA fragment at a time. Sanger is mainly used to target smaller genomic regions in a large cohort, for the validation of results from studies performed with NGS or to sequence variable regions [Citation53].

Whole exome sequencing & whole genome sequencing

WES is a DNA sequencing strategy that utilizes NGS techniques to provide a survey of base substitutions and focuses on the exons of 22,000 coding genes, with the output comprising less than 2% of the entire genome. However, a significant proportion of pathogenic variants are located in coding sequences [Citation54,Citation55]. Complexities of interpretation of WES are highlighted by the following: 1) pathogenic variants associated with the phenotype, 2) VUS and 3) secondary findings including pathogenic variants unrelated to the indication for testing.

WGS is a comprehensive method for analyzing entire nonrepetitive genomes including analysis of intronic regions. However, there is limited understanding of the impact of variants, including CNVs, outside of protein coding regions and therefore many of the variants detected in these regions are classified as VUS. Recent studies have shown that CNVs over 50 bp can routinely be detected from targeted NGS gene panels through bioinformatic assessment of the sequencing read depth [Citation56]. Nevertheless, like any clinical test, WES and WGS are not without challenges and limitations and cannot detect large CNVs particularly across repetitive regions. In addition, WES/WGS cannot identify the genomic location of structural variants or low-level mosaicism [Citation51]. Recently, the American College of Medical Genetics and Genomics recommended that WES or WGS be considered as a first- or second-tier test in pediatric patients with congenital anomalies or intellectual disabilities [Citation57].

Long-read sequencing

Long-read sequencing, sometimes referred to as third-generation sequencing, is a method designed to detect DNA sequences up to 140kb, 2–10× coverage depending on the method and platform. Examples of long-range sequencing approaches include ones developed by Pacific Biosciences (PacBio) and Oxford Nanopore Technologies [Citation58]. Nanopore sequencing is a method for long-read cDNA and RNA sequencing and can generally detect up to 10–50 kb. This method involves direct nucleotide assessment without active DNA synthesis. A protein nanopore is stabilized in an electrically resistant polymer membrane and the single stranded DNA passes through this protein nanopore [Citation59]. PacBio is a real-time sequencing method that can detect up to 10–50 kb reads. This method captures sequence data during the replication process and adds a sequence by synthesis strategy that captures a single DNA molecule. PacBio sequencing involves fluorescent light emission that can detect the single nucleotides that are added by the DNA polymerase. Similar to OGM, long-read sequencing utilizes ultra-long high molecular weight DNA, which can be challenging as fresh material or intact cells are required [Citation60]. Long-read sequencing technologies are well suited for genomic, transcriptomic and epigenetic studies [Citation61].

All the methods used in current diagnostic tests involve interrogation of the underlying DNA sequence at the base pair level (e.g., NGS, MLPA or Sanger sequencing) or at the chromosome level (e.g., CMA, FISH or karyotype). Recent studies have reported DNA methylation episignatures as an emerging diagnostic tool that does not directly assess the underlying DNA sequence [Citation62]. Episignatures instead assess the epigenetic consequences of DNA variants [Citation63]. provides a summary of the above-described methods.

Table 1. Overview of the different techniques used to detect copy number variants.

Episignatures

An episignature is defined as a recurring and reproducible DNA methylation pattern associated with a common genetic or environmental etiology in a specific disorder population [Citation64]. Episignatures are highly sensitive and specific biomarkers that not only detect aberrant DNA methylation but can also provide novel insights into the pathogenesis of genetic conditions as well as highlighting potential targets for future therapies [Citation65]. Episignature testing has shown diagnostic utility in resolving VUS [Citation63,Citation64,Citation66], and detection of an episignature can be used as strong functional evidence for variant reclassification from VUS to likely pathogenic. Alternatively, it should be noted that the absence of a detectable episignature, while suggestive, does not absolutely rule out pathogenicity.

A portion of episignatures that have been identified to date involve genes directly or indirectly associated with the epigenetic machinery that were also described by Bjornsson [Citation7]. Examples include, but are not limited to, the epigenetic writers EZH2 linked to Weaver syndrome [Citation64] and DNMT3A in Tatton–Brown–Rahman syndrome [Citation66], the eraser gene KDM5C in Claes–Jensen syndrome [Citation67], the reader PHF6 in Börjeson–Forssman–Lehmann syndrome [Citation64] and the remodelers SRCAP in Floating-Harbor syndrome [Citation68] and ATRX in alpha-thalassemia mental retardation syndrome [Citation69].

The Illumina microarray (Illumina, CA, USA) is the primary platform used to profile DNA methylation episignatures. This microarray technology provides a genome-wide, cost-effective, and high throughput test that is easily introduced in a laboratory setting with short technical turnaround times. Through bioinformatic applications to the data produced from these microarrays, the DNA methylation profiles can be assessed. EpiSign™ (London Health Sciences Centre Research Inc and Bekim Sadikovic, London, Canada) is a genome-wide DNA methylation diagnostic test that can detect more than 60 rare genetic disorders associated with more than 80 genes [Citation62], with the majority involved in epigenetic regulation. Disorders associated with episignatures that are currently screened with EpiSign V3 test are shown in . In addition, EpiSign can also detect several imprinting and repeat expansion disorders including Beckwith–Wiedemann syndrome, Silver-Russell syndrome [Citation70] and fragile-X syndrome [Citation62]. Depending on sample size, effect size, type of trait (continuous, categorical or binary) and distribution of the trait, different methodological approaches can (and should) be applied to study association and predictive power of DNA methylation profiles. For example, as described above, EpiSign is based on a support vector machine algorithm. However, we and other studies optimized their analyses using, among others, elastic net, random forest or other tree-based machine learning approaches [Citation71,Citation72]. In most studies optimization of these models is rather empirical, and the performance of multiple models mostly is evaluated post hoc [Citation73]. The EpiSign workflow to detect novel DNA methylation episignatures is quite unique. The first phase within this workflow focuses on the detection of a disease-associated episignature alone. The second phase of episignature discovery comprises additional training against all other previously detected and validated episignatures, including cycles of optimization if necessary. This approach guarantees the detection of robust episignatures, reflected by both a high selectivity and sensitivity [Citation62].

Figure 1. Current episignatures detectable by EpiSign™ V3.

There are four different categories in this figure: reader, eraser, writer and remodeler proteins. On the outside of the figure there are open circles that denote recessive inheritance and closed circles that denote dominant inheritance. In addition, the disorder phenotypes are shown in different colors, dysmorphic features (green), growth retardation (blue) and intellectual disability (orange).

This figure is adapted with permission from [Citation7], © Bjornsson.

Figure 1. Current episignatures detectable by EpiSign™ V3. There are four different categories in this figure: reader, eraser, writer and remodeler proteins. On the outside of the figure there are open circles that denote recessive inheritance and closed circles that denote dominant inheritance. In addition, the disorder phenotypes are shown in different colors, dysmorphic features (green), growth retardation (blue) and intellectual disability (orange).This figure is adapted with permission from [Citation7], © Bjornsson.

Considering these findings and the previously described DNA methylation profiles seen in patients with deletions and duplications of 7q11.23, 22q11.2 and 22q13.3 [Citation29,Citation65,Citation74], we propose that other CNVs may also exhibit unique diagnostic methylation episignatures.

Episignatures associated with CNVs

DNA methylation arrays, such as the Illumina (CA, USA) Infinium EPIC array utilized in diagnostic episignature testing, have permitted the identification of unique episignatures associated with sequence-level variants in many genes. These discoveries have provided novel insights into the pathogenesis of Mendelian disorders particularly in those genes associated with the epigenetic machinery. However, to date, the utility of assessing DNA methylation profiles that may be the result of larger CNVs has not been widely discussed [Citation75].

Differential methylation in regions associated with some of the most common CNV genomic disorders have been identified. Strong et al. described the role of DNA methylation in the pathogenesis of Williams–Beuren syndrome (7q11.23 deletion) and 7q11.23 duplication syndrome [Citation29]. This CNV region is 1.5 Mb in size and contains approximately 25 genes, of which six have a reported role in epigenetic function. This study concluded that epigenetic mechanisms likely contribute to the complex neurological phenotypes observed in these patients. Another recent study assessed atypical deletions in the 7q11.23 region in order to identify the critical region and genes responsible for the neurodevelopment of patients with Williams–Beuren syndrome [Citation76]. The study concluded that three genes may play an important role: BAZ1B, FZD9 and STX1A [Citation76]. Interestingly, BAZ1B is a tyrosine-protein kinase with an epigenetic regulatory function, as it plays a central role in chromatin remodeling [Citation77]. It is plausible that this gene may be responsible for the aberrant DNA methylation observed in the Strong et al. study; however, further work would be required to assess the DNA methylation patterns in patients with gene-level variants in BAZ1B to confirm this hypothesis.

The interrogation of deletions of different sizes at the 22q13.3 region was also carried out in individuals with Phelan McDermid syndrome [Citation65]. Assessment of DNA methylation profiles in this cohort of individuals identified a critical region responsible for the observed aberrant DNA methylation. This region contained the candidate gene BRD1, a bromodomain-containing protein involved in histone modification. Haploinsufficiency of BRD1 has since been shown to result in abnormal brain development and function due to dysregulation of histone acetylation in mouse models [Citation78]. Investigation of single gene deletions of BRD1 in humans have not been performed but could shed light on the mechanisms responsible for the phenotypic differences observed in Phelan McDermid syndrome individuals.

Differential methylation has also been reported in the 22q11.2 region in DiGeorge/velocardiofacial syndrome [Citation74]. This study also assessed variable sized deletions and identified an episignature in 43 individuals with typical deletions, and two deletions that were more proximal showed the same methylation profile (). It is possible that these proximal deletions disrupt enhancer or regulatory regions for genes contained within the typical deletion. Alternatively, deletions may affect the 3D chromatin organization due to disruption of topologically associated domains (TADs) in this region. Differential gene expression has been observed in patients with deletion of 16p11.2 as a consequence of TAD disruption [Citation79]. This suggests that remodeling of chromatin spatial organization plays a key role in the resultant phenotypes. In addition, a DNA methylation episignature was described in a cohort of individuals with autism spectrum disorder and deletions of 16p11.2 [Citation80]. This supports the idea that aberrant DNA methylation patterns may be seen in disorders where TAD disruptions occur.

Figure 2. 22q11.2 deletion syndrome demonstrating that episignatures can be mapped in copy number variant disorders and utilized in clinical testing.

(A) Deletions of the 22q11.2 region: those in red all share a common episignature including the two most proximal deletions. The four deletions represented by the blue horizontal bars do not share the common episignature. (B) Heatmap confirms that 22q11.2 deletion subjects (orange) can be differentiated from age and sex matched controls (blue) using selected CpG probes identified as the episignature. (C) Multidimensional scaling plot confirming the clustering of affected subjects from controls. (D) Support vector machine classifier model. This model was trained using the selected probes for the 22q11.2 episignature, 75% of controls and 75% of other neurodevelopmental disorders samples (blue). The remaining 25% of controls and other disorder samples were used for testing (grey). Plot demonstrates the specificity of the 22q11.2 episignature in differentiating affected cases from other neurodevelopmental disorders with mapped episignatures on the EpiSign™ clinical classifier.

22q11.2DS: 22q11.2 deletion syndrome; ADCADN: Cerebellar ataxia deafness and narcolepsy syndrome; AUTS18: Susceptibility to autism 18; BEFAHRS: Beck-Fahrner syndrome: BFLS: Borjeson–Forssman–Lehmann syndrome; BISS: Blepharophimosis intellectual disability SMARCA2 syndrome; CdLS: Cornelia de Lange syndrome; CHARGE: CHARGE syndrome; Chr16p11.2del: Chromosome 16p11.2 deletion syndrome; CSS: Coffi–Siris syndrome; CSS4: Coffin-Siris syndrome 4; CSS9: Coffin–Siris syndrome 9; Down: Down syndrome; Dup7: 7q11.23 duplication syndrome; DYT28: Dystonia 28; EEOC: Epileptic encephalopathy-childhood onset; FLHS: Floating Harbour syndrome; GTPTS: Genitopatellar syndrome; HMA: Hunter McAlpine craniosynostosis syndrome; HVDAS: Helsmoortel–van der Aa syndrome; ICF: Immunodeficiency-centromeric instability-facial anomalies syndrome; IDDSELD: Intellectual developmental disorder with seizures and language delay; Kabuki: Kabuki syndrome; KDVS: Koolen-De Vries syndrome; Kleefstra: Kleefstra syndrome; LLS: Luscan-Lumish syndrome; MKHK: Menke-Hennekam syndrome; MLASA2: Myopathy lactic acidosis and sideroblastic anemia 2; MRD23: Intellectual developmental disorder 23; MRD51: Intellectual developmental disorder 51; MRX93: Intellectual developmental disorder X-linked 93; MRX97: Intellectual developmental disorder X-linked 97; MRXSA: Intellectual developmental disorder X-linked syndromic Armfield type; MRXSCH: Intellectual developmental disorder X-linked syndromic Christianson type; MRXSCJ: Intellectual developmental disorder X-linked syndromic Claes-Jensen type; MRXSN: Intellectual developmental disorder X-linked syndromic Nascimento type; MRXSSR: Intellectual developmental disorder X-linked syndromic Snyder–Robinson type; PHMDS: Phelan–McDermid syndrome; PRC2: PRC2 complex (Weaver and Cohen-Gibson) syndrome; RENS1: Renpenning syndrome; RMNS: Rahman syndrome; RSTS: Rubinstein–Taybi syndrome; SBBYSS: Ohdo syndrome; Sotos: Sotos syndrome; TBRS: Tatton–Brown–Rahman syndrome; WDSTS: Wiedemann–Steiner syndrome; WHS: Wolf-Hirschhorn syndrome; Williams: Williams syndrome.

This figure is adapted with permission from [Citation74].

Figure 2. 22q11.2 deletion syndrome demonstrating that episignatures can be mapped in copy number variant disorders and utilized in clinical testing. (A) Deletions of the 22q11.2 region: those in red all share a common episignature including the two most proximal deletions. The four deletions represented by the blue horizontal bars do not share the common episignature. (B) Heatmap confirms that 22q11.2 deletion subjects (orange) can be differentiated from age and sex matched controls (blue) using selected CpG probes identified as the episignature. (C) Multidimensional scaling plot confirming the clustering of affected subjects from controls. (D) Support vector machine classifier model. This model was trained using the selected probes for the 22q11.2 episignature, 75% of controls and 75% of other neurodevelopmental disorders samples (blue). The remaining 25% of controls and other disorder samples were used for testing (grey). Plot demonstrates the specificity of the 22q11.2 episignature in differentiating affected cases from other neurodevelopmental disorders with mapped episignatures on the EpiSign™ clinical classifier.22q11.2DS: 22q11.2 deletion syndrome; ADCADN: Cerebellar ataxia deafness and narcolepsy syndrome; AUTS18: Susceptibility to autism 18; BEFAHRS: Beck-Fahrner syndrome: BFLS: Borjeson–Forssman–Lehmann syndrome; BISS: Blepharophimosis intellectual disability SMARCA2 syndrome; CdLS: Cornelia de Lange syndrome; CHARGE: CHARGE syndrome; Chr16p11.2del: Chromosome 16p11.2 deletion syndrome; CSS: Coffi–Siris syndrome; CSS4: Coffin-Siris syndrome 4; CSS9: Coffin–Siris syndrome 9; Down: Down syndrome; Dup7: 7q11.23 duplication syndrome; DYT28: Dystonia 28; EEOC: Epileptic encephalopathy-childhood onset; FLHS: Floating Harbour syndrome; GTPTS: Genitopatellar syndrome; HMA: Hunter McAlpine craniosynostosis syndrome; HVDAS: Helsmoortel–van der Aa syndrome; ICF: Immunodeficiency-centromeric instability-facial anomalies syndrome; IDDSELD: Intellectual developmental disorder with seizures and language delay; Kabuki: Kabuki syndrome; KDVS: Koolen-De Vries syndrome; Kleefstra: Kleefstra syndrome; LLS: Luscan-Lumish syndrome; MKHK: Menke-Hennekam syndrome; MLASA2: Myopathy lactic acidosis and sideroblastic anemia 2; MRD23: Intellectual developmental disorder 23; MRD51: Intellectual developmental disorder 51; MRX93: Intellectual developmental disorder X-linked 93; MRX97: Intellectual developmental disorder X-linked 97; MRXSA: Intellectual developmental disorder X-linked syndromic Armfield type; MRXSCH: Intellectual developmental disorder X-linked syndromic Christianson type; MRXSCJ: Intellectual developmental disorder X-linked syndromic Claes-Jensen type; MRXSN: Intellectual developmental disorder X-linked syndromic Nascimento type; MRXSSR: Intellectual developmental disorder X-linked syndromic Snyder–Robinson type; PHMDS: Phelan–McDermid syndrome; PRC2: PRC2 complex (Weaver and Cohen-Gibson) syndrome; RENS1: Renpenning syndrome; RMNS: Rahman syndrome; RSTS: Rubinstein–Taybi syndrome; SBBYSS: Ohdo syndrome; Sotos: Sotos syndrome; TBRS: Tatton–Brown–Rahman syndrome; WDSTS: Wiedemann–Steiner syndrome; WHS: Wolf-Hirschhorn syndrome; Williams: Williams syndrome.This figure is adapted with permission from [Citation74].

The capability of episignatures to differentiate, for example, the 22q11.2 deletion syndrome from other neurodevelopmental disorders, including other CNV disorders such as Williams–Beuren syndrome, is shown in D. This sensitivity and specificity of episignatures for the diagnosis of several CNV disorders has been previously described [Citation62]. Taken together, these studies demonstrate the clinical utility of assessing CNVs of differing sizes to uncover novel candidate genes or critical regions that may be responsible for the observed episignature and resultant phenotypes.

Another approach to investigating episignatures in CNVs would be a ‘gene-level up’ method. This method would identify genes with reported roles in epigenetic regulation and assess the DNA methylation profiles of individuals with variants in that specific gene alongside CNVs overlapping the gene of interest. This approach was utilized in mapping a genome-wide DNA methylation episignature in SETD1B-related syndrome [Citation81]. In this study the authors described patients with small gene-level variants and large CNVs in SETD1B, a gene that encodes a SET domain containing protein that is part of a histone methyltransferase complex. This complex primarily targets gene promoters and is highly correlated with gene expression [Citation82]. All patients presented with a similar phenotype that included ID, language delay, epilepsy and behavioral problems such as autism spectrum disorder and anxiety. They also had dysmorphia including full cheeks, full lower lip, macroglossia and tapering fingers. The phenotypes reported are in line with the common features of the Mendelian disorders of the epigenetic machinery described by Fahrner and Bjornsson [Citation2].

Similarly, a genome-wide DNA methylation episignature was reported in individuals with Sotos syndrome [Citation83]. Profiles from individuals with microdeletions of the 5q35 region were assessed alongside gene-level variants in NSD1 for episignature mapping. NSD1 is a histone methyltransferase that can both negatively and positively influence transcription [Citation84]. An episignature was also described for duplications of the same 5q35 region involving NSD1 associated with Hunter–McAlpine syndrome (HMA) [Citation64]. This episignature was able to differentiate HMA from 41 other Mendelian neurodevelopmental disorders. The signature of the 5q35 deletion (Sotos syndrome) included solely hypomethylated loci in contrast to the 5q35 duplication (HMA) which was represented by hypermethylation of the same loci. This mirror effect is most likely the consequence of an opposite dosage of similar genetic loci, that is, the loss of function versus the gain of function [Citation64]. Genes involved in epigenetic regulation are also found in other common CNV regions associated with neurodevelopmental disorders where no episignature has been reported to date. HNRNPU is a gene that codes for a DNA- and RNA-binding protein that links specific DNA elements and plays a role in chromatin organization [Citation85]. It is located within the 1q43q44 deletion syndrome region and has been implicated as the main candidate for the epilepsy component of the disorder’s phenotype [Citation86]. In addition, gene-level variants in HNRNPU detected by WES are described in patients with neurodevelopmental phenotypes [Citation87]. HNRNPU has also been shown to regulate TAD boundaries and TAD interactions and therefore may play a role in chromatin reorganization [Citation85]. Similarly, haploinsufficiency of the histone deacetylase protein HDAC4 has been implicated as the candidate gene in 2q37 deletion syndrome [Citation88]. Individuals with 2q37 deletion syndrome also exhibit phenotypes in line with Mendelian disorders of the epigenetic machinery [Citation2], such as ID and brachydactyly [Citation88].

We hypothesize that disruption of chromatin regulation in regions affected by CNVs, through effects on 3D organization and/or the presence of genes with epigenetic regulatory functions, may result in aberrant DNA methylation profiles. This may be especially true in neurodevelopmental disorders with phenotypes overlapping those previously described by Fahrner and Bjornsson [Citation2]. These DNA methylation episignatures are sensitive and specific biomarkers that could be used for clinical diagnosis [Citation62]. This DNA methylation profile testing approach could alleviate the burden of choosing the appropriate CNV detection method (e.g., WES or CMA) as more CNV disorder episignatures are defined. Taken together, this approach could aid in the refinement of critical regions and genes and further our understanding of the pathogenesis of many genomic disorders that may have implications in the development of targeted therapies in the future.

The potential advantages of episignature mapping in other common CNV disorders was discussed in a recent study by Rooney and Sadikovic [Citation75]. Several of the common CNV regions, associated with neurodevelopmental phenotypes, contain genes with links to the epigenetic machinery (). This strongly suggests the potential for epigenetic consequences of CNVs in these regions. To date, the majority of these CNV regions do not have mapped episignatures. Of note, chromosome region 1p36 contains the CHD5 gene which is a component of the SWI/SNF chromatin remodeling complex, of which pathogenic variants in several other genes of this complex have been reported to result in aberrant DNA methylation profiles [Citation91]. In contrast, several regions contain genes that, at present, have no reported direct links to the epigenetic machinery. One such region is the 15q11.2 nonimprinted region. It also needs to be acknowledged that the spectrum of episignature disorders is expanding beyond genes directly involved in chromatin and epigenetic regulation. Further work would be required to confirm if changes in the DNA profiles of these 15q11.2 deletion carriers can be detected. Given the evidence of the disruption of 3D chromatin organization via TADs reported in the 16p11.2 deletion region, and effects of TAD disruption on gene expression [Citation79], it’s plausible that deletions of 15q11.2 may still result in an episignature despite an absence of epigenetic machinery genes in this region. Future studies assessing DNA methylation in 15q11.2 deletion carriers would be necessary. An additional challenge in mapping episignatures in CNV disorders may arise in those associated with nonrecurrent breakpoints. Should the episignature be driven by a single gene contained within the region, similar to Sotos syndrome [Citation83], then overlapping genomic coordinates would be pertinent to episignature mapping. Similarly, if changes to the DNA methylation profiles are driven by TADs then differently sized CNVs with differing breakpoints could result in different outcomes of TAD disruption. Again, further work will be required to assess CNV disorders in order to elucidate this further.

Table 2. Epigenetic machinery genes located in common copy number variant regions.

Utility of episignatures

Assessment of differentially methylated regions (DMRs) identified by DNA methylation arrays can provide information about genes affected by the disrupted DNA methylation elsewhere in the genome that may be contributing to the clinical phenotype observed in patients. Biological pathways downstream of the BAF complex were identified through assessment of DMRs in Coffin–Siris and Nicolaides–Baraitser syndromes associated with variants in ARID1B, SMARCB1, SMARCA4 and SMARCA2 genes [Citation91]. These disorders were shown to have significant overlap in DNA methylation profiles. In addition, gene-set enrichment analyses were performed and showed anatomical and system development as the most significant returned terms [Citation91]. Both syndromes are associated with ID and limb abnormalities and represent a disease spectrum. Of note, these episignatures were also identified in patients with deletions of 6q25 containing the ARID1B gene. This provided novel insights into the pathogenesis of 6q25 deletion syndrome. This is in line with a previous study that assessed a deletion involving ARID1B as the only coding gene in the region and suggested ARID1B as the candidate gene for the disorder [Citation92]. DMRs from a DNA methylation episignature were also assessed in 22q11.2 deletion syndrome and identified hypomethylation of several genes that may contribute to the phenotype [Citation74]. These genes included HOXA2, associated with cleft palate and hearing impairment, and IRF8, known to be involved with immunodeficiency 32B.

HOX genes are also implicated as drivers of embryonic bone, tissue and organ development through gene-set enrichment analysis performed on the DNA methylation episignature in Bohring–Opitz syndrome [Citation93]. The most prominent human phenotype terms identified in the study were related to bone abnormalities of the limbs and limb joints in line with the phenotypes of previously reported epigenetic disorders. Therefore, DNA methylation episignatures may provide novel insights into the mechanisms of genetic disorders and the gene pathways affected. Assessment of genome-wide DNA methylation effects has the potential to unlock a state-of-the-art analysis of gene networks. These insights may provide novel targets for therapy in neurodevelopmental disorders.

Conclusion

Episignature analysis may also have potential utility in investigating reduced penetrance and variable expressivity conditions. To date, episignature mapping has focused on utilizing affected individuals with shared genomic variants to identify a common aberrant DNA methylation profile. It is well documented that several of the common CNV disorders show reduced penetrance and variable expressivity, including 22q11.2 and 16p11.2 deletion syndromes [Citation31,Citation88]. It is postulated that haploinsufficiency alone cannot explain the phenotypic variation observed in these conditions [Citation94]. Epigenetic analysis may help to further elucidate the intrafamilial differences in these families. Further work assessing large cohorts of phenotypically affected and unaffected individuals that carry the same CNV would be required.

Additionally, the detection of an episignature can act as a surrogate for functional studies and help resolve VUS, as has been shown in several studies [Citation63,Citation93,Citation95]. The resolution of VUS minimizes challenges in genetic counseling and provides valuable information for patients and their families.

Future perspective

Bjornsson described 44 genes with specific functions (eraser, writer, remodeler and readers) that are part of the epigenetic machinery and include common overlapping clinical features [Citation7]. This overlap presents challenges to genetic testing using current methodologies, as clinicians must decide between single nucleotide or larger CNV-based techniques. EpiSign™, a recently developed clinical assay utilizing episignatures, has proven utility in differentiating between neurodevelopmental disorders including those associated with CNVs, sequence level variants, repeat expansion disorders and imprinting conditions. Thus, EpiSign™ provides a single platform for the detection of the functional consequences of DNA aberrations, to aid in genetic diagnosis and in resolving VUS. Episignatures have been detected in more genes than initially described by Bjornsson [Citation7] as well as in disorders associated with CNVs, such as DiGeorge/Velocardiofacial syndrome [Citation74]. Therefore, it is necessary to continue to research DNA methylation patterns in all disorder genes and CNVs that could expand the current diagnostic testing platforms and provide novel insights into the pathogenesis of these disorders.

Executive summary
  • The epigenetic machinery involves genes encoding proteins that contribute to the regulation of gene expression at the transcriptional and post-transcriptional level.

  • In the last decade many new genes associated with the epigenetic machinery have been identified and associated with human disorders.

  • Copy number variants (CNVs), such as deletions or duplications, are one type of genetic aberration that has the potential to impact gene dosage.

  • Gene dosage sensitivity has been shown to be a major determinant of the pathogenicity of CNVs.

  • The detection of CNVs can be carried out using several methods and each method has its own advantages and disadvantages. All the methods used in current diagnostic tests involve interrogation of the underlying DNA sequence at the base pair level (e.g., next-generation sequencing, multiplex ligation-dependent probe amplification or Sanger sequencing) or at the chromosome level (e.g., chromosomal microarray, fluorescent in situ hybridization or karyotype).

  • An episignature is defined as a recurring and reproducible DNA methylation pattern associated with a common genetic or environmental etiology in a disorder-specific population. Most episignatures that have been mapped to date involve genes directly or indirectly associated with the epigenetic machinery and overlap with those described by Bjornsson.

  • Assessment of differentially methylated regions identified by whole genome DNA methylation analysis can provide invaluable information on additional genes that may be affected elsewhere in the genome that could be contributing to the clinical phenotype observed in patients.

  • The detection of an episignature can act as a surrogate for functional studies and help resolve variants of uncertain significance.

  • It is necessary to continue to research DNA methylation patterns in all disorder genes and CNVs that could expand the current diagnostic testing platforms and provide novel insights into the pathogenesis of these disorders.

Author contributions

L van der Laan, K Rooney, MMAM Mannens, P Henneman and B Sadikovic contributed to the initial study conception and design. L van der Laan, K Rooney and TMA Trooster designed the figures and tables. The manuscript was written by L van der Laan and K Rooney, and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Financial & competing interests disclosure

The authors have no relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript. This includes employment, consultancies, honoraria, stock ownership or options, expert testimony, grants or patents received or pending, or royalties.

No writing assistance was utilized in the production of this manuscript.

References

  • Cacabelos R , TelladoI , CacabelosP. The epigenetic machinery in the life cycle and pharmacoepigenetics. In: Pharmacoepigenetics.Elsevier, Amsterdam, The Netherlands, 1–100 (2019).
  • Fahrner JA , BjornssonHT. Mendelian disorders of the epigenetic machinery: tipping the balance of chromatin states. Annu. Rev. Genomics Hum. Genet.15, 269–293 (2014).
  • Lyko F . The DNA methyltransferase family: a versatile toolkit for epigenetic regulation. Nat. Rev. Genet.19(2), 81–92 (2018).
  • Hargreaves DC , CrabtreeGR. ATP-dependent chromatin remodeling: genetics, genomics and mechanisms. Cell Res.21(3), 396–420 (2011).
  • Poh WJ , WeeCPP , GaoZ. DNA methyltransferase activity assays: advances and challenges. Theranostics6(3), 369–391 (2016).
  • Gibson WT , HoodRL , ZhanSHet al. Mutations in EZH2 cause weaver syndrome. Am. J. Hum. Genet.90(1), 110–118 (2012).
  • Bjornsson HT . The Mendelian disorders of the epigenetic machinery. Genome Res.25(10), 1473–1481 (2015).
  • Churchill FB . William Johannsen and the genotype concept. J. Hist. Biol.7(1), 5–30 (1974).
  • Ahnert SE . Structural properties of genotype–phenotype maps. JR Soc. Interface14(132), 20170275 (2017).
  • Gibney ER , NolanCM. Epigenetics and gene expression. Heredity (Edinb.)105(1), 4–13 (2010).
  • Abel HJ , DuncavageEJ. Detection of structural DNA variation from next generation sequencing data: a review of informatic approaches. Cancer Genet.206(12), 432–440 (2013).
  • Zhang Y , HaraksinghR , GrubertFet al. Child development and structural variation in the human genome. Child Dev.84(1), 34–48 (2013).
  • Rice AM , MclysaghtA. Dosage-sensitive genes in evolution and disease. BMC Biol.15(1), 78 (2017).
  • Yamasaki M , MakinoT , KhorSSet al. Sensitivity to gene dosage and gene expression affects genes with copy number variants observed among neuropsychiatric diseases. BMC Med. Genomics13(1), 55 (2020).
  • Collins RL , BrandH , KarczewskiKJet al. A structural variation reference for medical and population genetics Aggregation Database Production Team*, Genome Aggregation Database Consortium*. Nature581, 444 doi:10.1038/s41586-020-2287-82020) ( Online).
  • Karczewski KJ , FrancioliLC , TiaoGet al. The mutational constraint spectrum quantified from variation in 141,456 humans, Genome Aggregation Database Consortium. Nature581, 434,doi:10.1038/s41586-020-2308-72020) ( Online).
  • Zarrei M , MacDonaldJR , MericoD , SchererSW. A copy number variation map of the human genome. Nat. Rev. Genet.16(3), 172–183 (2015).
  • Huang N , LeeI , MarcotteEM , HurlesME. Characterising and predicting haploinsufficiency in the human genome. PLOS Genet.6(10), 1–11 (2010).
  • Davoli T , XuAW , MengwasserKEet al. Cumulative haploinsufficiency and triplosensitivity drive aneuploidy patterns and shape the cancer genome. Cell155(4), 948 (2013).
  • Tatton-Brown K , ColeTR , RahmanNet al. Sotos Syndrome. In: Gene Reviews.Seattle, WA (2004).
  • Zhang H , LuX , BeasleyJet al. Reversed clinical phenotype due to a microduplication of Sotos syndrome region detected by array CGH: microcephaly, developmental delay and delayed bone age. Am. J. Med. Genet. A155(6), 1374–1378 (2011).
  • Dikow N , MaasB , GasparHet al. The phenotypic spectrum of duplication 5q35.2–q35.3 encompassing NSD1: is it really a reversed sotos syndrome? Am. J. Med. Genet. A 161(9), 2158–2166 (2013).
  • Brunetti-Pierri N , BergJS , ScagliaFet al. Recurrent reciprocal 1q21.1 deletions and duplications associated with microcephaly or macrocephaly and developmental and behavioral abnormalities. Nat. Genet.40(12), 1466–1471 (2008).
  • Lebon S , ChenQ , Martin-BrevetSet al. Defining the effect of the 16p11.2 duplication on cognition, behavior, and medical comorbidities. JAMA Psychiatry73(1), 20–30 (2016). doi:10.1001/jamapsychiatry.2015.2123
  • Jacquemont S , ReymondA , ZuffereyFet al. Mirror extreme BMI phenotypes associated with gene dosage at the chromosome 16p11.2 locus. Nature478(7367), 97–102 (2011).
  • Leblanc JJ , NelsonCA. Deletion and duplication of 16p11.2 are associated with opposing effects on visual evoked potential amplitude. Mol. Autism7(1), 30 (2016).
  • Chung WK , RobertsTP , SherrEH , SnyderLAG , SpiroJE. 16p11.2 deletion syndrome. Curr. Opin. Genet. Dev.68, 49–56 (2021).
  • Adamo A , AtashpazS , GermainPLet al. 7q11.23 dosage-dependent dysregulation in human pluripotent stem cells affects transcriptional programs in disease-relevant lineages. Nat. Genet.47(2), 132–141 (2015).
  • Strong E , ButcherDT , SinghaniaRet al. Symmetrical dose-dependent DNA-methylation profiles in children with deletion or duplication of 7q11.23. Am. J. Hum. Genet.97(2), 216–227 (2015).
  • Nussinov R , TsaiCJ , JangH. Protein ensembles link genotype to phenotype. PLOS Comput. Biol.15(6), e1006648 (2019).
  • Smajlagić D , LavrichenkoK , BerlandSet al. Population prevalence and inheritance pattern of recurrent CNVs associated with neurodevelopmental disorders in 12,252 newborns and their parents. Eur. J. Hum. Genet.29(1), 205–215 (2021).
  • Nowakowska B . Clinical interpretation of copy number variants in the human genome. J. Appl. Genet.58(4), 449–457 (2017).
  • Kerkhof J , SqueoGM , McConkeyHet al. DNA methylation episignature testing improves molecular diagnosis of Mendelian chromatinopathies. Genet. Med.24(1), 51–60 (2022).
  • Vidal S , XiolC , Pascual-alonsoA , O’callaghanM , PinedaM , ArmstrongJ. Genetic landscape of Rett syndrome spectrum: improvements and challenges. Int. J. Mol. Sci.20(16), 3925 (2019).
  • Wan TSK , HuiEKC , NgMHL. Chromosome recognition. Methods Mol. Biol.1541, 67–74 (2017).
  • Imataka G , ArisakaO. Chromosome analysis using spectral karyotyping (SKY). Cell Biochem. Biophys.62(1), 13–17 (2012).
  • Lichtenbelt KD , KnoersNVAM , Schuring-BlomGH. From karyotyping to array-CGH in prenatal diagnosis. Cytogenet. Genome Res.135(3–4), 241–250 (2011).
  • Vance GH . Cytogenetics/cytogenomics. In: Rodak’s Hematology.Elsevier, Amsterdam, The Netherlands, 525–539 (2020).
  • Miller DT , AdamMP , AradhyaSet al. Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies. Am. J. Hum. Genet.86(5), 749–764 (2010).
  • Y atsenko SA , ShawCA , OuZet al. Microarray-based comparative genomic hybridization using sex-matched reference DNA provides greater sensitivity for detection of sex chromosome imbalances than array-comparative genomic hybridization with sex-mismatched reference DNA. J. Mol. Diagn.11(3), 226–237 (2009).
  • Cheung SW , BiW. Novel applications of array comparative genomic hybridization in molecular diagnostics. Expert Rev. Mol. Diagn.18(6), 531–542 (2018).
  • Levy B , BurnsideRD. Are all chromosome microarrays the same? What clinicians need to know. Prenat. Diagn.39(3), 157–164 (2019).
  • Carter NP . Methods and strategies for analyzing copy number variation using DNA microarrays. Nat. Genet.39(Suppl. 7), S16–S21 (2007).
  • Vorstman JAS , JalaliGR , RappaportEF , HackerAM , ScottC , EmanuelBS. MLPA: a rapid, reliable, and sensitive method for detection and analysis of abnormalities of 22q. Hum. Mutat.27(8), 814–821 (2006).
  • Fernández L , LapunzinaP , ArjonaDet al. Comparative study of three diagnostic approaches (FISH, STRs and MLPA) in 30 patients with 22q11.2 deletion syndrome. Clin. Genet.68(4), 373–378 (2005).
  • Sørensen KM , AgergaardP , OlesenCet al. Detecting 22q11.2 deletions by use of multiplex ligation-dependent probe amplification on DNA from neonatal dried blood spot samples. J. Mol. Diagn.12(2), 147–151 (2010).
  • Anwar Iqbal M , BroeckelU , LevyBet al. Multi-site technical performance and concordance of optical genome mapping: constitutional postnatal study for SV, CNV, and repeat array analysis. medRxiv doi:10.1101/2021.12.27.21268432 ( Online) ( Epub ahead of print).
  • Cope H , BarseghyanH , BhattacharyaSet al. Detection of a mosaic CDKL5 deletion and inversion by optical genome mapping ends an exhaustive diagnostic odyssey. Mol. Genet. Genomic Med.9(7), e1665 (2021).
  • Sahajpal NS , BarseghyanH , KolheR , HastieA , ChaubeyA. Optical genome mapping as a next-generation cytogenomic tool for detection of structural and copy number variations for prenatal genomic analyses. Genes (Basel)12(3), 398 (2021).
  • N urk S , KorenS , RhieAet al. The complete sequence of a human genome. Science376(6588), 44–53 (2022).
  • Jelin AC , VoraN. Whole exome sequencing: applications in prenatal genetics. Obstet. Gynecol. Clin. North Am.45(1), 69–81 (2018).
  • Pirooznia M , GoesF , ZandiPP. Whole-genome CNV analysis: advances in computational approaches. Front. Genet.6(APR), 138 (2015).
  • Garrido-Cardenas JA , Garcia-MarotoF , Alvarez-BermejoJA , Manzano-AgugliaroF. DNA sequencing sensors: an overview. Sensors (Switzerland)17(3), 588 (2017).
  • Majewski J , SchwartzentruberJ , LalondeE , MontpetitA , JabadoN. What can exome sequencing do for you?J. Med. Genet.48(9), 580–589 (2011).
  • Liang WS , StephensonK , AdkinsJet al. Whole exome library construction for next generation sequencing. In: Methods in Molecular Biology.Humana Press Inc., NJ, USA, 163–174 (2018).
  • Kerkhof J , SchenkelLC , ReillyJet al. Clinical validation of copy number variant detection from targeted next-generation sequencing panels. J. Mol. Diagn.19(6), 905–920 (2017).
  • M anickam K , McClainMR , DemmerLAet al. Exome and genome sequencing for pediatric patients with congenital anomalies or intellectual disability: an evidence-based clinical guideline of the American College of Medical Genetics and Genomics (ACMG). Genet. Med.23(11), 2029–2037 (2021).
  • Brown CG , ClarkeJ. Nanopore development at Oxford Nanopore. Nat. Biotechnol.34(8), 810–811 (2016).
  • Kono N , ArakawaK. Nanopore sequencing: review of potential applications in functional genomics. Dev. Growth Differ.61(5), 316–326 (2019).
  • Xiao T , ZhouW. The third generation sequencing: the advanced approach to genetic diseases. Transl. Pediatr.9(2), 163–173 (2020).
  • Charnaud S , MunroJE , SemenecLet al. PacBio long-read amplicon sequencing enables scalable high-resolution population allele typing of the complex CYP2D6 locus. Commun. Biol.5(1), 168 (2022).
  • L evy MA , McConkeyH , KerkhofJet al. Novel diagnostic DNA methylation episignatures expand and refine the epigenetic landscapes of Mendelian disorders. HGG Adv.3(1), 100075 (2021).
  • Sadikovic B , LeviMA , KerkhofJet al. Clinical epigenomics: genome-wide DNA methylation analysis for the diagnosis of Mendelian disorders. Genet. Med.3(1), 1–10 (2021).
  • Aref-Eshghi E , KerkhofJ , PedroVPet al. Evaluation of DNA methylation episignatures for diagnosis and phenotype correlations in 42 Mendelian neurodevelopmental disorders. Am. J. Hum. Genet.106(3), 356–370 (2020).
  • Schenkel LC , Aref-EshghiE , RooneyKet al. DNA methylation epi-signature is associated with two molecularly and phenotypically distinct clinical subtypes of Phelan-McDermid syndrome. Cin. Epigenetics13(1), 1–17 (2021). doi:10.1186/s13148-020-00990-7
  • Aref-Eshghi E , BendEG , ColaiacovoSet al. Diagnostic utility of genome-wide DNA methylation testing in genetically unsolved individuals with suspected hereditary conditions. Am. J. Hum. Genet.104(4), 685–700 doi:10.1016/j.ajhg.2019.03.008 (2019) ( Online).
  • Schenkel LC , Aref-EshghiE , SkinnerCet al. Peripheral blood epi-signature of Claes–Jensen syndrome enables sensitive and specific identification of patients and healthy carriers with pathogenic mutations in KDM5C. Clin.Epigenetics.10(1), 1–11 (2018).
  • Hood RL , SchenkelLC , NikkelSMet al. The defining DNA methylation signature of Floating–Harbor Syndrome. Sci. Rep.6, 1–9 (2016) ( Online).
  • S chenkel LC , KernohanKD , McBrideAet al. Identification of epigenetic signature associated with alpha thalassemia/mental retardation X-linked syndrome. Epigenetics Chromatin10(1), 1–11 (2017).
  • Aref-Eshghi E , SchenkelLC , LinHet al. Clinical validation of a genome-wide DNA methylation assay for molecular diagnosis of imprinting disorders. J. Mol. Diagn.19(6), 848–856 doi:10.1016/j.jmoldx.2017.07.002 (2017). ( Online).
  • Joustra VW , LiYim AYF , de BruynJRet al. Peripheral blood DNA methylation profiles do not predict endoscopic post-operative recurrence in Crohn’s disease patients. Int. J. Mol. Sci.23(18), 10467 (2022).
  • Naue J , HoefslootHCJ , MookORFet al. Chronological age prediction based on DNA methylation: massive parallel sequencing and random forest regression. Forensic Sci. Int. Genet.31, 19–28 (2017).
  • Olson RS , UrbanowiczRJ , AndrewsPC , LavenderNA , KiddLC , MooreJH. Automating biomedical data science through tree-based pipeline optimization. (2016). http://arxiv.org/abs/1601.07925
  • Rooney K , LevyMA , HaghshenasSet al. Identification of a DNA methylation episignature in the 22q11.2 deletion syndrome. Int. J. Mol. Sci.22(16), 8611 (2021).
  • Rooney K , SadikovicB. DNA methylation episignatures in neurodevelopmental disorders associated with large structural copy number variants: clinical implications. Int. J. Mol. Sci.23(14), 7862 (2022). www.mdpi.com/1422-0067/23/14/7862
  • Zhou J , ZhengY , LiangGet al. Atypical deletion of Williams–Beuren syndrome reveals the mechanism of neurodevelopmental disorders. BMC Med. Genomics15(1), 79 (2022).
  • Xiao A , LiH , ShechterDet al. WSTF regulates the H2A.X DNA damage response via a novel tyrosine kinase activity. Nature457(7225), 57–62 (2009).
  • Paternoster V , EdhagerAV , QvistPet al. Inactivation of the schizophrenia-associated BRD1 gene in brain causes failure-to-thrive, seizure susceptibility and abnormal histone H3 acetylation and N-tail clipping. Mol. Neurobiol.58(9), 4495–4505 (2021).
  • L oviglio MN , LeleuM , MännikKet al. Chromosomal contacts connect loci associated with autism, BMI and head circumference phenotypes. Mol. Psychiatry22(6), 836–849 (2017).
  • Siu MT , ButcherDT , TurinskyALet al. Functional DNA methylation signatures for autism spectrum disorder genomic risk loci: 16p11.2 deletions and CHD8 variants. Cin. Epigenetics11(1), 1–19 (2019).
  • Krzyzewska IM , MaasSM , HennemanPet al. A genome-wide DNA methylation signature for SETD1B-related syndrome. Clin. Epigenetics11(1), 156 (2019).
  • Barski A , CuddapahS , CuiKet al. High-resolution profiling of histone methylations in the human genome. Cell129(4), 823–837 (2007).
  • Choufani S , CytrynbaumC , ChungBHYet al. NSD1 mutations generate a genome-wide DNA methylation signature. Nat. Commun.6, 10207 (2015).
  • Brennan K , ZhengH , FahrnerJAet al. NSD1 mutations deregulate transcription and DNA methylation of bivalent developmental genes in Sotos syndrome. Hum. Mol. Genet.31(13), 2164–2184 (2022).
  • Zhang L , SongD , ZhuB , WangX. The role of nuclear matrix protein HNRNPU in maintaining the architecture of 3D genome. Semin. Cell Dev. Biol.90, 161–167 (2019).
  • Depienne C , NavaC , KerenBet al. Genetic and phenotypic dissection of 1q43q44 microdeletion syndrome and neurodevelopmental phenotypes associated with mutations in ZBTB18 and HNRNPU. Hum. Genet.136(4), 463–479 (2017).
  • Zhu X , PetrovskiS , XiePet al. Whole-exome sequencing in undiagnosed genetic diseases: interpreting 119 trios. Genet. Med.17(10), 774–781 (2015).
  • Le TN , WilliamsSR , AlaimoJT , ElseaSH. Genotype and phenotype correlation in 103 individuals with 2q37 deletion syndrome reveals incomplete penetrance and supports HDAC4 as the primary genetic contributor. Am. J. Med. Genet. A179(5), 782–791 (2019).
  • Holland P , WildhagenM , IstreM , ReiakvamOM , DahlJA , SøraasA. Cri du chat syndrome patients have DNA methylation changes in genes linked to symptoms of the disease. Clin. Epigenetics14(1), 128 (2022).
  • Goodman SJ , CytrynbaumC , ChungBH-Yet al. EHMT1 pathogenic variants and 9q34.3 microdeletions share altered DNA methylation patterns in patients with Kleefstra syndrome. J.Transl. Genet. Genom.4, 144–158 (2020).
  • Aref-Eshghi E , BendEG , HoodRLet al. BAFopathies’ DNA methylation epi-signatures demonstrate diagnostic utility and functional continuum of Coffin–Siris and Nicolaides–Baraitser syndromes. Nat. Commun.9(1), 4885 (2018).
  • Ronzoni L , TagliaferriF , TucciA , BaccarinM , EspositoS , MilaniD. Interstitial 6q25 microdeletion syndrome: ARID1B is the key gene. Am. J. Med. Genet. A170(5), 1257–1261 (2016).
  • Awamleh Z , Chater-DiehlE , ChoufaniSet al. DNA methylation signature associated with Bohring–Opitz syndrome: a new tool for functional classification of variants in ASXL genes. Eur. J. Hum. Genet.30(6), 695–702 (2022).
  • Du Q , dela Morena MT , van OersNSC. The genetics and epigenetics of 22q11.2 deletion syndrome. Front. Genet.10, 1365 (2020).
  • K erkhof J , SqueoGM , McConkeyHet al. DNA methylation episignature testing improves molecular diagnosis of Mendelian chromatinopathies. Genet. Med.24(1), 51–60 (2021).