54
Views
4
CrossRef citations to date
0
Altmetric
Original Research

A common genetic variation in CEBPE and acute lymphoblastic leukemia: a meta-analysis of the available evidence

, , , , , , & show all
Pages 2443-2451 | Published online: 07 Sep 2015

Abstract

Acute lymphoblastic leukemia (ALL) has been studied intensively for decades, but the details of its etiology and underlying mechanisms have yet to be fully elucidated. It is now generally acknowledged that genetic factors contribute greatly to the development of this disease. The gene encoding CCAAT/enhancer-binding protein ε (CEBPE) is involved in the development of leukemia, and in particular the rs2239633 single nucleotide polymorphism (SNP) of CEBPE. The association between rs2239633 and risk of ALL has been well studied, but remains unclear. Therefore, a meta-analysis was performed in this study to establish a more precise estimation of that relationship. A comprehensive literature search of the PubMed electronic database was conducted, and relevant studies published up to February 20, 2015 were selected for analysis. The references of the retrieved articles were also screened. The extracted data were analyzed statistically, and pooled odds ratios with 95% confidence intervals were calculated using Review Manager (version 5.2) to estimate the association strength. Finally, eleven studies were included in the meta-analysis. The pooled analyses revealed that rs2239633 was associated with an increased risk of childhood ALL in Caucasians under any contrast models (P<0.01). However, this SNP did not affect the risk of ALL in adulthood among Caucasians, or in childhood among East Asians. In conclusion, these findings confirm that the CEBPE rs2239633 SNP could be considered a good marker of pediatric ALL risk in Caucasians, but not in East Asians; it is not a good marker of adult ALL risk in Caucasians.

Background

Acute lymphoblastic leukemia (ALL) is the most common malignancy in childhood, accounting for 30% of all childhood cancers.Citation1 The etiology of childhood ALL is mostly unknown, although infections in the first years of life and some environmental factors, such as ionizing radiation and parental alcohol and tobacco use, are thought to play a causative role.Citation2

ALL is known to result from an accumulation of mutations in tumor suppressor genes and oncogenes, and genetic alterations affecting several chromosomes.Citation2 Thus, the genetic variations may be important factors in its development. This finding has recently led to the performance of several genome-wide association studies to identify common single nucleotide polymorphisms (SNPs) affecting the susceptibility to childhood ALL.Citation2Citation6 One of the most widely investigated SNPs identified in these studies is the common rs2239633 SNP of CEBPE, the gene encoding CCAAT/enhancer-binding protein ε (CEBPE), which plays an important role in the regulation of myelopoiesis.Citation7 Papaemmanuil et alCitation4 also found that CEBPE is a suppressor of myeloid leukemogenesis, and may be involved in B-cell precursor ALL development. The populations studied in the aforementioned genome-wide association studies were limited, and the many subsequent association studies that have been conducted with a view to verifying the described effects of rs2239633 on susceptibility to ALL in childhood in different populations have yielded varying results.Citation8Citation13

A meta-analysis was performed in the present study to review and compare all relevant studies, enabling more accurate conclusions to be drawn. In addition, since the genetic background of ALL differs between the adult and childhood forms, some studies were also performed to make clear if this SNP was a risk factor for adult ALL.Citation14,Citation15 In this study, those two relevant studies were also compared to examine the differential roles of rs2239633 in children and adults.

Materials and methods

Literature search strategy

Two of the authors (Zhang and Du) each performed the literature search of the PubMed database independently. The process was supervised by the third author (Zhai), and any disagreement was resolved by consensus. The search strategy was to identify all possible studies that involved the use of the following test words: (14q11.2 “or” CEBPE “or” rs2239633) “and” (leukemia). The language of publication was restricted to English. The reference lists of retrieved articles were curated manually. The literature search was last updated on February 20, 2015. This study was judged exempt regarding ethics.

Selection criteria

All studies with the aim of investigating the association between presence of the rs2239633 SNP and ALL risk were potentially included. The following inclusion criteria were applied:

  1. Published articles on human genetics (full texts or abstracts) without racial restriction.

  2. If articles contained more than one geographic or other clinical characteristic subgroup, each subgroup was considered separately.

  3. If multiple studies were derived from the same population, only the study with the largest sample was analyzed to avoid overlapping data.

  4. Presentation of sufficient data on the distribution of rs2239633 in cases and in control groups or the data necessary to calculate those distributions. If the data were deficient or presented in an inappropriate form, the original authors were contacted and asked for the raw data.

Extracted information

A standard data-collection procedure was used, in line with the aforementioned inclusion criteria. Two of the authors (Zhang and Du) independently extracted the variables from the individual eligible studies in duplicate and made the characteristics compatible in a pooled database. Any encountered disagreements were resolved by discussion in order to reach a consensus. The following information was extracted from all eligible studies: name of the first author, year of publication, ethnicities of the study population, age and number of cases and controls, and main finding. The distribution of the genotypes among the controls was also tested for conformity to Hardy–Weinberg equilibrium (HWE).

Statistical analysis

The meta-analysis examined the overall association for allele contrasts, homozygote contrasts, and recessive and dominant models. The presence of heterogeneity was assessed using Cochran’s Q-statistic (with a significance cutoff of P<0.1) and quantified using the I2 statistic, which is proportional to the degree of heterogeneity; the value of I2 statistic lies in the range 0%–100%, with a value of 0% indicating homogeneity, and a value above 50% indicating the presence of a very high degree of heterogeneity.Citation16

The raw data for genotype distribution were used to calculate the study-specific estimates of odds ratio (OR) and 95% confidence interval (CI). The overall pooled OR and 95% CI were estimated using the Mantel–Haenszel method, with a fixed-effects model when no significant clinical or statistical heterogeneity was present; when substantial heterogeneity was present, the random-effects model was used. The significance of the polled OR was determined by a test. The cutoff for statistical significance was set at P<0.05.

Potential publication bias was estimated by constructing funnel plots. If most of the data appear at the top of a funnel plot and are distributed roughly symmetrically, this suggests the absence of obvious publication bias, and vice versa.Citation13 There was no need to construct funnel plots when there were too few analyzed studies (ie, n<5). All of these statistical analyses were performed using Review Manager (version 5.2) (Cochrane, London, UK).

The conformity of the rs2239633 SNP to HWE was tested among the controls in each individual study via the chi-square test or Fisher’s exact test using SPSS software (SPSS Inc., Chicago, IL, USA). Sensitivity analyses were performed by excluding those studies for which the data deviated from HWE (P<0.05), omitting the highest weight or changing the model.

Results

Literature search and characteristics of the retrieved studies

presents a flow diagram summarizing the process of study selection. The final search generated 65 potentially relevant publications. After reading the titles and abstracts, 44 articles were found to be irrelevant to this study in that the research did not cover the association between rs2239633 and susceptibility to ALL, and were thus excluded from the analysis. Of the remaining 21 publications, 3 were case reports, 2 were reviews, and 1 was a conference proceedings; these were therefore also excluded. Finally, 15 publications were identified, in which the effect of rs2239633 on the risk of both adult and childhood ALL was evaluated. Of these, four did not supply enough data, two were about adults,Citation14,Citation15 and nine were about children.Citation4,Citation8Citation13,Citation17,Citation18 These studies, providing enough data, were conducted in various populations of different ethnicities: eight involved Caucasians,Citation4,Citation10,Citation11,Citation13Citation15,Citation17,Citation18 two involved East Asians,Citation9,Citation12 and one involved multiple races.Citation8 The basic data for every eligible study were extracted and are listed in . Although the authors of some of the publications were contacted to obtain the raw genotype data, some of the data could not be obtained.

Figure 1 Flow chart of study selection process.

Abbreviation: ALL, acute lymphoblastic leukemia.
Figure 1 Flow chart of study selection process.

Table 1 Study characteristics

Meta-analysis

rs2239633 SNP and ALL risk in childhood

As stated, nine papers studied the association between rs2239633 and risk of ALL in childhood and provided data about genotype distribution. The data from these studies were pooled and subjected to a meta-analysis. Overall, this SNP was found to be significantly associated with risk of ALL in childhood under the dominant model (, P<0.00001, OR =1.27, 95% CI =1.15–1.39), recessive model (, P<0.00001, OR =1.23, 95% CI =1.14–1.33), and allele contrast (, P<0.00001, OR =1.18, 95% CI =1.12–1.24). However, the presence of heterogeneity was found in the recessive model (, ICitation2=56%) and allele contrast (, ICitation2=57%), indicating the requirement for a subanalysis. Therefore, a submeta-analysis based on races was conducted. Among the included studies, the subjects in six were Caucasians,Citation4,Citation10,Citation11,Citation13,Citation17,Citation18 those in two were East Asians,Citation9,Citation12 and those in the remaining study were a mixed ethnic population.Citation8 The submeta-analysis revealed that under the three contrast models, rs2239633 was significantly associated with childhood ALL in Caucasians (, P<0.00001, OR =1.31, 95% CI =1.18–1.46 for dominant model; , P<0.00001, OR =1.32, 95% CI =1.21–1.44 for recessive model; , P<0.00001, OR =1.23, 95% CI =1.16–1.30 for allele contrast), but not in East Asians (). The degree of heterogeneity in both the Caucasian and East Asian subgroups was relatively small (, ICitation2=28%, 29%, 36% for Caucasian under the dominant, recessive, and allele contrast models, respectively; I2=0 for East Asian under the dominant, recessive, and allele contrast models).

Figure 2 Meta-analysis of the association between rs2239633 and childhood ALL risk.

Notes: (A) Under the dominant model; (B) under the recessive model; (C) under the allele contrast model. aIndicated another population studied in the same article.
Abbreviations: ALL, acute lymphoblastic leukemia; CI, confidence interval; M–H, Mantel–Haenszel method.
Figure 2 Meta-analysis of the association between rs2239633 and childhood ALL risk.
Figure 2 Meta-analysis of the association between rs2239633 and childhood ALL risk.

rs2239633 and ALL risk in adults

Two of the studies investigated the relationship between the rs2239633 SNP and risk of ALL in adults, and provided the genotype data.Citation14,Citation15 The subjects in both of those studies were Caucasians. After pooling these data, further analysis revealed that this SNP was not associated with risk of ALL in adults in any model of comparison (, P=0.18, OR =1.21, 95% CI =0.91–1.61 for dominant model; , P=0.79, OR =1.03, 95% CI =0.81–1.32 for recessive model; , P=0.35, OR =1.08, 95% CI =0.92–1.26 for allele contrast). There was no heterogeneity in any of the three contrast models (, ICitation2=0).

Figure 3 Meta-analysis of the association between rs2239633 and adult ALL risk.

Notes: (A) Under the dominant model; (B) under the recessive model; (C) under the allele contrast model.
Abbreviations: ALL, acute lymphoblastic leukemia; CI, confidence interval; M–H, Mantel–Haenszel method.
Figure 3 Meta-analysis of the association between rs2239633 and adult ALL risk.

Publication bias and sensitivity analysis

The publication bias for the research with pediatric subjects was assessed. Overall, no obvious publication bias was found under the recessive model, and the funnel plot exhibited good symmetry under the dominant model if the study of Vijayakrishnan et alCitation12 was excluded. The result did not change after excluding this article. The symmetry of the funnel plot was poor under the allele contrast model, which suggested obvious publication bias ().

Figure 4 The publication bias for researches of childhood.

Notes: (A) Under the dominant model; (B) under the recessive model; (C) under the allele contrast model.
Abbreviation: SE, standard error.
Figure 4 The publication bias for researches of childhood.

The stability of the results involving Caucasians was evaluated by performing a sensitivity analysis. Neither omitting Prasad et al’sCitation13 study (highest weight) nor changing the model (random-effects model) made a significant difference to the pooled effects of the three contrast models, suggesting that the meta-analysis findings were highly stable.

Discussion

CEBPE is involved in terminal differentiation and functional maturation of neutrophils and macrophages, and is critical for transcription of most granule proteins localized to specific and gelatinase granules as well as for azurophil granule proteins expressed in the late promyelocyte stage.Citation19,Citation20 It has also been reported that CEBPE is targeted by recurrent immunoglobulin heavy-chain (locating on chromosome band 14q32) chromosomal translocations in B-cell precursor ALL, and mutations in CEBPE lead to neutrophil-specific granule deficiency.Citation9,Citation19,Citation20 All of these findings suggested that CEBPE plays a role in the development of ALL. The studies conducted thus far on the relationship between rs2239633 and risk of ALL have yielded a relatively small amount of data and have produced conflicting results. Those data were evaluated in the present meta-analysis. The primary findings of this investigation are the increased OR of childhood ALL susceptibility associated with presence of the rs2239633 SNP in a fixed-effects model, and the lack of an association between this SNP and risk of ALL in adults.

Subgroup analysis for childhood ALL according to race revealed that this SNP impacted Caucasians, and not Asians. This finding was extremely robust, remaining significant irrespective of the different model assumptions (fixed-effects versus random-effects models), genetic models, and additional sensitivity analysis. This disparity between races may reflect the existence of a true population-specific disease variant, but may also be attributable to differences in genomic structure at these loci between populations. Wang et alCitation9 determined the linkage disequilibrium (LD) patterns at the locus of rs2239633 in European and Chinese populations from the HapMap database, and found that rs2239633 is located in different LD blocks in the two populations. Therefore, we speculate that the pathogenic locus might not be rs2239633, but rather other SNPs in the LD block. Other factors contributing to the differential results for the two populations include differences in allele frequency and differences in genetic and environmental backgrounds that interact with the variants.Citation12

It has been proposed that genetics plays a role in adult cancer susceptibility, although this has been difficult to evaluate because of confounders, such as environmental factors, the immune system, and exposure to pathogens. To the best of our knowledge, no predisposing genetic marker has been proven for adult ALL. No association between rs2239633 and risk of adult ALL was found in this study. Although the failure to detect any weak association may be affected by the relatively small sample, these data suggest that the genetic impact of polymorphisms at these loci is at least weaker for adult ALL than it is for pediatric ALL and highlights genetic differences between ALL occurring in adults versus children.Citation15

Based on a meta-analysis, Wang et alCitation21 concluded that rs2239633 confers an increased risk of childhood ALL, especially among Caucasians and Hispanics. The main findings of the present meta-analysis are consistent with that conclusion. While, the adults’ ALL risk related with rs2239633 was also evaluated, three different genetic models were used in our study, which showed more comprehensive evaluation of the impact of rs2239633 on ALL. In addition, we believe that the article of Wang et alCitation21 contained some data errors. The research data – and in particular the OR and 95% CI values – of Chokkalingam et alCitation22 and Walsh et alCitation23 that were used in the meta-analysis performed by Wang et alCitation21 were calculated using logistic regression. We speculate that since the OR and 95% CI values generally differed before and after data adjustment, Wang et alCitation21 did not use the raw data from the studies of Chokkalingam et alCitation22 and Walsh et alCitation23 thus potentially causing bias. It is not possible to derive raw data using the adjusted data. Before completing our study, we attempted to contact the authors of those two articles to obtain the raw genotype data,Citation22,Citation23 but were unsuccessful. Therefore, these two articles were excluded from the analysis, which we believe was the most appropriate action.

It is worth noting that there were no differences in the results among the three contrast models, although the funnel figure obtained under the allele contrast model exhibited poor symmetry, and a clear outlier appeared in that obtained under the dominant model, indicating that the impact of genetic models on the results is not large.

Our study demonstrated that genotype testing is very important, although just a statistical association was found. Based on genotype testing, the susceptible population will be able to take the necessary protective measures, which is particularly important for those with long-term exposure to environmental pollutants.

This study had some limitations. First, the effect of gene–gene and gene–environmental interactions was not addressed in the meta-analysis due to lack of data. Second, the meta-analysis was based on unadjusted estimates, and the confounding factors could not be controlled for because most studies did not provide these data. Third, for publication bias, the best symmetry was observed for the funnel figure produced under the recessive genetic model, but not under the dominant model and allele contrast. Finally, the conclusions regarding risk of ALL among adults and Asian children are based on relatively few cases, and so the statistical power thereof is not strong.

Conclusion

In summary, the findings of the present meta-analysis support a positive association between CEBPE rs2239633 and childhood risk of ALL in Caucasians, but there was no such association for risk of ALL in Caucasian adults or East Asian children. Further large-scale studies assessing gene–gene and gene–environment interactions are required.

Acknowledgments

This work was supported by the Natural Science Foundation of Shaanxi Province (number 2015JM8415) and the Fundamental Research Funds for the Central Universities of China (number 2011jdhz55).

Disclosure

The authors report no conflicts of interest in this work.

References

  • KarathanasisNVChoumerianouDMKalmantiMGene polymorphism sin childhood ALLPediatr Blood Cancer200952331832318989891
  • HanSLeeKMParkSKGenome-wide association study of childhood acute lymphoblastic leukemia in KoreaLeuk Res201034101271127420189245
  • BorstLWesolowskaAJoshiTGenome-wide analysis of cytogenetic aberrations in ETV6/RUNX1-positive childhood acute lymphoblastic leukemiaBr J Haematol2012157447648222404039
  • PapaemmanuilEHoskingFJVijayakrishnanJLoci on 7p12.2, 10q21.2 and 14q11.2 are associated with risk of childhood acute lymphoblastic leukemiaNat Genet20094191006101019684604
  • TrevinoLRYangWFrenchDGermline genomic variants associated with childhood acute lymphoblastic leukemiaNat Genet20094191001100519684603
  • YangJJChengCYangWGenome-wide interrogation of germline genetic variation associated with treatment response in childhood acute lymphoblastic leukemiaJAMA2009301439340319176441
  • AkasakaTBalasasTRussellLJFive members of the CEBP transcription factor family are targeted by recurrent IGH translocations in B-cell precursor acute lymphoblastic leukemia (BCP-ALL)Blood200710983451346117170124
  • EmerencianoMBarbosaTCLopesBAARID5B polymorphism confers an increased risk to acquire specific MLL rearrangements in early childhood leukemiaBMC Cancer20141412724564228
  • WangYChenJLiJAssociation of three polymorphisms in ARID5B, IKZF1 and CEBPE with the risk of childhood acute lymphoblastic leukemia in a Chinese populationGene2013524220320723608171
  • RossJALinaberyAMBlommerCNGenetic variants modify susceptibility to leukemia in infants: a Children’s Oncology Group reportPediatr Blood Cancer2013601313422422485
  • PastorczakAGorniakPSherborneARole of 657del5 NBN mutation and 7p12.2 (IKZF1), 9p21 (CDKN2A), 10q21.2 (ARID5B) and 14q11.2 (CEBPE) variation and risk of childhood ALL in the Polish populationLeuk Res201135111534153621889209
  • VijayakrishnanJSherborneALSawangpanichRHongengSHoulstonRSPakakasamaSVariation at 7p12.2 and 10q21.2 influences childhood acute lymphoblastic leukemia risk in the Thai population and may contribute to racial differences in leukemia incidenceLeuk Lymphoma201051101870187420919861
  • PrasadRBHoskingFJVijayakrishnanJVerification of the susceptibility loci on 7p12.2, 10q21.2, and 14q11.2 in precursor B-cell acute lymphoblastic leukemia of childhoodBlood201011591765176720042726
  • BurmeisterTBartelsGGrogerDGermline variants in IKZF1, ARID5B, and CEBPE as risk factors for adult-onset acute lymphoblastic leukemia: an analysis from the GMALL study groupHaematologica2014992e23e2524497567
  • PeyrouzePGuihardSGrardelNGenetic polymorphisms in ARID5B, CEBPE, IKZF1 and CDKN2A in relation with risk of acute lymphoblastic leukaemia in adults: a Group for Research on Adult Acute Lymphoblastic Leukaemia (GRAALL) studyBr J Haematol2012159559960223016962
  • WuZLouYQiuXAssociation of cholesteryl ester transfer protein (CETP) gene polymorphism, high density lipoprotein cholesterol and risk of coronary artery disease: a meta-analysis using a Mendelian randomization approachBMC Med Genet201415111825366166
  • Lautner-CsorbaOGezsiASemseiAFCandidate gene association study in pediatric acute lymphoblastic leukemia evaluated by Bayesian network based Bayesian multilevel analysis of relevanceBMC Med Genomics201254223021489
  • OrsiLRudantJBonaventureAGenetic polymorphisms and childhood acute lymphoblastic leukemia: GWAS of the ESCALE study (SFCE)Leukemia201226122561256422660188
  • AkagiTThoennissenNHGeorgeAIn vivo deficiency of both C/EBPbeta and C/EBPepsilon results in highly defective myeloid differentiation and lack of cytokine responsePLoS One2010511e1541921072215
  • GombartAFShioharaMKwokSHAgematsuKKomiyamaAKoefflerHPNeutrophil-specific granule deficiency: homozygous recessive inheritance of a frameshift mutation in the gene encoding transcription factor CCAAT/enhancer binding protein – epsilonBlood20019792561256711313242
  • WangCChenJSunHSunLLiuYCEBPE polymorphism confers an increased risk of childhood acute lymphoblastic leukemia: a meta-analysis of 11 case-control studies with 5,639 cases and 10,036 controlsAnn Hematol201594218118525195121
  • ChokkalingamAPHsuLIMetayerCGenetic variants in ARID5B and CEBPE are childhood ALL susceptibility loci in HispanicsCancer Causes Control201324101789179523836053
  • WalshKMChokkalingamAPHsuLIAssociations between genome-wide native American ancestry, known risk alleles and B-cell ALL risk in Hispanic childrenLeukemia201327122416241923615557
  • Enciso-MoraVHoskingFJSheridanECommon genetic variation contributes significantly to the risk of childhood B-cell precursor acute lymphoblastic leukemiaLeukemia201226102212221522456626
  • MiglioriniGFiegeBHoskingFJVariation at 10p12.2 and 10p14 influences risk of childhood B-cell acute lymphoblastic leukemia and phenotypeBlood2013122193298330723996088