1,230
Views
0
CrossRef citations to date
0
Altmetric
Editorial

Epigenetic Epidemiology: Is there Cause for Optimism?

Pages 683-685 | Published online: 28 Aug 2015

Modulation of gene expression is influenced by environmental exposures, genetic variation and stochastic processes. Epigenetic biomarkers that signal the outcome of their combined impact could be important early predictors of susceptibility to chronic noncommunicable diseases. Can epigenetic epidemiology shed light on causality and mechanism?

Common complex traits arise over time and are the end result of lifestyle choices and a combination of environmental exposures during periods of developmental vulnerability and epigenetic plasticity in the presence of a combination of susceptible genetic variants. Genome-wide association studies to identify the genetic contributions to complex traits have identified many high probability allelic associations, but almost invariably each with a small effect, explaining only a modest percentage of the heritability of the trait. Notably, many estimates of heritability are inflated and do not adequately take into account-shared environment and other familial factors.

It is not new to suggest that tissue-specific epigenetic modulation plays a major role in the development of complex traits, yet few have managed to support this using a robust experimental approach. Whereas the genome remains constant, gene expression is labile, demonstrates cell type and developmental specificity, age and sex differences, and is variably responsive to changes in the environment.

What are the relationships between genetic variation and different environments that influence epigenetic responses? How do we begin to understand causal relationships that influence gene expression? From an experimental perspective, it is challenging to design studies that can address these questions, yet present a feasible and affordable approach. Large international and interdisciplinary studies are beginning to provide a good framework to better understand gene expression regulation in humans. Though open to criticism, their value should not be underestimated. They include ENCODE that examined 147 cell lines to search for functional elements in the genome [Citation1]; the Epigenomics Roadmap [Citation2] that examined 127 tissue biopsies from embryos and adults, both diseased and healthy, with a focus on identifying functional regulatory elements; the International Human Epigenome Consortium [Citation3]; and the Genome Tissue Expression project (GTEx) that explores the relationship between genetic variation and tissue-specific gene expression and alternate splicing by sequencing whole genomes and examining gene expression in over 20,000 tissue samples from 900 individuals [Citation4]. These data are beginning to give insight into cell specific, developmental stage, sex, age and ethnicity influenced, and stochastic gene expression differences at an individual and population level. The layered annotations in genome browsers highlight the importance of integrating transcriptome data (mRNA, miRNA and lncRNA) with DNA methylation, histone modification and chromatin interaction data to more fully understand function.

Epigenome-wide association studies (EWASs) to identify regulatory mechanisms and to predict their causal link to common diseases [Citation5] currently make use of a crude proxy by examining percentage methylation differences at ~2% of CpG sites in the genome, sites selected in part for their accessibility using available detection methods (viz. HumanMethylation450k BeadChip Array [Citation6]). Increasingly, it is accepted that individual CpG sites may not provide as much information as multiple CpG sites (differentially methylated regions). Several excellent reviews highlight the limitations and challenges of using archival samples from longitudinal cohort studies for epigenetic epidemiology [Citation7–10]. These DNA samples usually originate from blood, saliva or buccal cells and essentially many challenges remain as we grapple to understand which tissues and cell types would be most appropriate to investigate, what types of epigenetic changes we should be looking for, what effect sizes would be likely to have physiological impact (and therefore guide the statistical assessment of sample size for the studies), and whether we are looking at all the epigenetic mechanisms that matter.

Cellular heterogeneity in peripheral blood is recognized as a confounder in epigenetic disease and trait association studies [Citation11,Citation12]. Although bioinformatics algorithms have been developed to infer the cellular mix, this is not ideal [Citation13]. Two examples of successful epigenetic studies are highlighted, one for an environmental exposure and the other for a common trait.

Locus-specific associations of environmentally linked epigenetic changes have been described for in utero exposure to tobacco and adult smoking [Citation14,Citation15]. The AHRR locus methylation profile has been described as an epigenetic marker for tobacco exposure. Interestingly, it shows a degree of ethnic specificity when compared between European and southeast Asian populations, is sensitive to the cellular composition of the blood and is likely influenced by sex and age [Citation16]. It provides an opportunity to study objectively tested tobacco exposure with smoking-related outcomes, such as cancer and cardiovascular disease.

A recent systematic review assessed our current knowledge about the link between epigenetics and obesity [Citation17] and pointed out that some epigenetic markers for childhood obesity, in this case DNA methylation at the RXRA locus, could be detected at birth, providing opportunities for early intervention [Citation18]. What is important from an intervention perspective is that epigenetic markers such as DNA methylation and histone modifications are malleable and some of them can be altered in adult life through lifestyle changes, to result in a more favorable antiobesogenic profile in specific tissues [Citation19].

Epigenetic studies in humans remain limited for obvious practical and ethical considerations. Is there cause for optimism in unraveling complex epigenetic mechanisms for late onset chronic conditions with noncommunicable elements? Could this lead to improved prevention, prediction and treatment? Like epidemiological studies, epigenetic studies will be fraught with bias, confounding, reverse causation and bidirectional influences. However, using longitudinal cohorts and epidemiological approaches to epigenetics, is likely to yield robust findings that can be replicated [Citation20]. Statistical and modeling approaches to data analyses need to develop further to align with the complexity and ever changing nature of living organisms. The signal-to-noise ratio in genome-wide approaches to data generation and discovery remains a significant impediment to understanding biological processes, with abundant false-positive and false-negative signals. Good advice urges an approach for an integrated functional study design to begin to understand molecular mechanisms in diseases, including epigenetic modulation [Citation7]. Statistical deconvolution of tissues with mixed cell types would make legacy DNA sample collections more valuable, but their tissues of origin may still not be appropriate for the phenotype under investigation. The relatively recent and welcome focus on biobanking of specimens for future research will not only greatly enhance the ability to do epigenetic epidemiology, but underscores the need for sustainable infrastructure and funding to plan for future research. What is the subject of much debate is whose responsibility this should be, the individual researchers, the academic institutions, research funders, government bodies or other stakeholders.

It is already feasible to identify environmental causes of specific cancers by studying the mutation profiles in somatic tumor tissue [Citation21]. For example, CC to TT transitions at pyrimidine dimers are signatures of ultraviolet light exposure in nonmelanoma skin cancer, aflatoxin exposure is associated with a TP53 codon 249 mutation in hepatocellular carcinoma and G>T transversions at methylated CpG sites are characteristic of exposure to tobacco. We are beginning to detect epigenetic signals for exposure (e.g., smoking) and as markers for phenotype (e.g., obesity). When robust epigenetic and gene-expression biomarkers for risk are developed for diseases of lifestyle, more effective intervention strategies could be developed.

Conclusion

As new studies are planned and others are enriched with waves of data and biological specimen collection, it would be prudent to collect and prepare samples for future epigenetic studies. However, it is not a simple task to choose the appropriate approach and it will be an expensive addition to grant applications. Although we still lack adequate knowledge to fully inform our decisions, it is clear that cell type-specific studies are important, as tissues include many different cell types in different ratios and resulting data are therefore difficult to interpret. Fortunately, technologies are developing to assist data generation from small samples, even to the level of individual cells. Now that we have created an expectation that epigenetic signals are amenable to therapeutic manipulation, it is important to manage the expectations of the health professionals and of the patients during this important discovery phase.

Future perspective

Understanding the mechanisms that regulate gene expression and function provides an opportunity for targeted therapeutic intervention. However, its impact on reducing the health burden of common noncommunicable diseases will only be apparent when we develop a deeper understanding of the complex interrelationships between environment, behavior, physiological response and genetic predisposition. With increasing knowledge comes optimism.

Disclaimer

The views expressed do not necessarily represent the official views of the National Research Foundation of South Africa. The content is solely the view of the author and does not necessarily represent the official views of the NIH.

Financial & competing interests disclosure

M Ramsay is supported by the South African Research Chairs Initiative of the Department of Science and Technology and National Research Foundation (NRF) of South Africa. The author has no other relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript apart from those disclosed.

No writing assistance was utilized in the production of this manuscript.

Additional information

Funding

M Ramsay is supported by the South African Research Chairs Initiative of the Department of Science and Technology and National Research Foundation (NRF) of South Africa. The author has no other relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript apart from those disclosed.No writing assistance was utilized in the production of this manuscript.

References

  • Consortium EP . An integrated encyclopedia of DNA elements in the human genome . Nature489 ( 7414 ), 57 – 74 ( 2012 ).
  • Epigenomics roadmap . www.nature.com/epigenomeroadmap .
  • Time for the epigenome . Nature493 ( 7281 ), 587 ( 2010 ).
  • Genome tissue expression project . www.gtexportal.org/ .
  • Rakyan VK , DownTA , BaldingDJ , BeckS . Epigenome-wide association studies for common human diseases . Nat. Rev. Genet.12 ( 8 ), 529 – 541 ( 2011 ).
  • Sandoval J , HeynH , MoranSet al. Validation of a DNA methylation microarray for 450,000 CPG sites in the human genome . Epigenetics6 ( 6 ), 692 – 702 ( 2014 ).
  • Mill J , HeijmansBT . From promises to practical strategies in epigenetic epidemiology . Nat. Rev. Genet.14 ( 8 ), 585 – 594 ( 2013 ).
  • Heyn H , EstellerM . DNA methylation profiling in the clinic: applications and challenges . Nat. Rev. Genet.13 ( 10 ), 679 – 692 ( 2012 ).
  • Ng JW , BarrettLM , WongA , KuhD , SmithGD , ReltonCL . The role of longitudinal cohort studies in epigenetic epidemiology: challenges and opportunities . Genome Biol.13 ( 6 ), 246 ( 2012 ).
  • Bakulski KM , FallinMD . Epigenetic epidemiology: promises for public health research . Environ. Mol. Mutat.55 ( 3 ), 171 – 183 ( 2014 ).
  • Wu HC , Delgado-CruzataL , FlomJDet al. Global methylation profiles in DNA from different blood cell types . Epigenetics6 ( 1 ), 76 – 85 ( 2011 ).
  • Reinius LE , AcevedoN , JoerinkMet al. Differential DNA methylation in purified human blood cells: implications for cell lineage and studies on disease susceptibility . PLoS ONE7 ( 7 ), e41361 ( 2012 ).
  • Houseman EA , AccomandoWP , KoestlerDCet al. DNA methylation arrays as surrogate measures of cell mixture distribution . BMC Bioinformatics13 , 86 ( 2012 ).
  • Joubert BR , HabergSE , NilsenRMet al. 450k epigenome-wide scan identifies differential DNA methylation in newborns related to maternal smoking during pregnancy . Environ. Health Perspect.120 ( 10 ), 1425 – 1431 ( 2012 ).
  • Breitling LP , YangR , KornB , BurwinkelB , BrennerH . Tobacco-smoking-related differential DNA methylation: 27k discovery and replication . Am. J. Hum. Genet.88 ( 4 ), 450 – 457 ( 2011 ).
  • Elliott HR , TillinT , McardleWLet al. Differences in smoking associated DNA methylation patterns in South Asians and Europeans . Clin. Epigenet.6 ( 1 ), 4 ( 2014 ).
  • Van Dijk SJ , MolloyPL , VarinliH , MorrisonJL , MuhlhauslerBS . Members of epiSCOPE . Epigenetics and human obesity . Int. J. Obes.39 ( 1 ), 85 – 97 ( 2015 ).
  • Godfrey KM , SheppardA , GluckmanPDet al. Epigenetic gene promoter methylation at birth is associated with child’s later adiposity . Diabetes60 ( 5 ), 1528 – 1534 ( 2011 ).
  • Ge K . Epigenetic regulation of adipogenesis by histone methylation . Biochim. Biophys. Acta1819 ( 7 ), 727 – 732 ( 2012 ).
  • Relton CL , Davey SmithG . Is epidemiology ready for epigenetics?Int. J. Epidemiol.41 ( 1 ), 5 – 9 ( 2012 ).
  • Wild CP , ScalbertA , HercegZ . Measuring the exposome: a powerful basis for evaluating environmental exposures and cancer risk . Environ. Mol. Mutat.54 ( 7 ), 480 – 499 ( 2013 ).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.