1,104
Views
42
CrossRef citations to date
0
Altmetric
Research Paper

Detection of promoter methylation of tumor suppressor genes in serum DNA of breast cancer cases and benign breast disease controls

, , , , &
Pages 1258-1267 | Received 18 Jun 2012, Accepted 13 Sep 2012, Published online: 17 Sep 2012

Abstract

Tumors are capable of shedding DNA into the blood stream. This shed DNA may be recovered from serum or plasma. The objective of this study was to evaluate whether pyrosequencing promoter DNA in a panel of 12 breast cancer-related genes (APC, BRCA1, CCND2, CDH1, ESR1, GSTP1, HIN1, P16, RARβ, RASSF1, SFRP1 and TWIST) to measure the degree of methylation would lead to a useful serum-based marker of breast cancer. Serum was obtained from women who were about to undergo a breast biopsy or mastectomy at three hospitals from 1977 to 1987 in Grand Rapids, MI USA. We compared the methylation status of 12 genes in serum DNA obtained from three groups of postmenopausal women (mean age at blood collection: 63.0 y; SD 9.9; range 35–91): breast cancer cases with lymph node-positive disease (n = 241); breast cancer cases with lymph node-negative disease (n = 63); and benign breast disease control subjects (n = 234). Overall, median levels of promoter methylation were low, typically below 5%, for all genes in all study groups. For all genes, median levels of methylation were higher (by 3.3 to 47.6%) in lymph node-positive breast cancer cases than in the controls. Comparing mean methylation level between lymph-node positive cases and controls, the most statistically significant findings, after adjustment of the false-positive rate (q-value), were for TWIST (p = 0.04), SFRP1 (p = 0.16), ESR1 (p = 0.17), P16 (p = 0.19) and APC (p = 0.19). For two of these four genes (TWIST, P16), the median methylation level was also highest in lymph-node positive cases, intermediate in lymph node-negative cases and lowest in the controls. The percent of study subjects with mean methylation scores ≥ 5% was higher among lymph node-positive cases than controls for ten genes, and significantly higher for HIN1 and TWIST (22.0 vs. 12.2%, p = 0.04 and 37.9 vs. 24.5%, p = 0.004, respectively). Despite relatively consistent variation in methylation patterns among groups, these modest differences did not provide sufficient ability to distinguish between cases and controls in a clinical setting.

Introduction

Aberrant DNA methylation is a common feature of malignant cells and, thus, the methylation status of key cell cycle regulation genes is often proposed as a reliable and sensitive early marker for breast cancer.Citation1,Citation2 Classic tumor suppressor genes and other regulatory genes are frequently silenced by methylation at their promoter regions in cancerous breast tissue.Citation3,Citation4

Promising estimates for sensitivity and specificity for the detection of cancerous breast tissue have been observed using promoter methylation of a small panel of common cancer-related genes to discriminate between normal and diseased tissue. For example, Fackler and colleaguesCitation5 reported a sensitivity for detection of breast cancer of 85% and a specificity of 89% based on a panel of four genes (CCND2, HIN1, RASSF1 and TWIST) in 18 normal mammoplasty and 21 tumor specimens. Similarly, Esteller and colleaguesCitation6 reported a sensitivity of 73% using a different panel of four genes (GSTP1, BRCA1, CDH1 and P16) based on 45 breast cancer tumors. In another report by FacklerCitation7 with five candidate genes evaluated, one or more methylated genes were detected in 100% of invasive lobular cancer and 95% of lobular in situ tumors (total of 103 cases). Collectively, these data indicate that the promoter methylation status of even a relatively small number of genes is sufficient to distinguish breast cancer tissue from normal tissue.

Other data indicate that higher levels of DNA, as much as six times higher on average, can be detected in the blood of breast cancer cases than in the blood of healthy subjects.Citation8 Furthermore, DNA from plasma or serum from breast cancer patients has been suggested to be of tumor origin because in several small studies the promoter methylation patterns in serum or plasma DNA have shown high concordance to those found in corresponding tumors.Citation9 However, the proportion of free circulating DNA that is from the tumor has been reported to vary widely, from less than 0.5% to 90%.Citation10,Citation11 This variability and other technical issues have led some researchers to question the feasibility of developing a circulating DNA marker for early breast cancer.Citation12,Citation13 A number of small studies have directly assessed the ability of promoter methylation status, typically in one to four genes, in serum or plasma DNA to discriminate between breast cancer cases and controls.Citation14 For example, Hoque and colleaguesCitation15 reported a sensitivity of 62% and specificity of 85% with four candidate genes (APC, GSTP1, RASSF1A and RARβ2) using quantitative methylation-specific PCR (QMSP) in a study of 47 breast cancer cases and 38 control subjects. By contrast, a recently published larger study by Brooks and colleaguesCitation16 was not successful in distinguishing between 50 breast cancer cases and 300 cancer-free controls based on the methylation status via QMSP of same four genes that were studied by Hoque and colleagues.Citation15 Based on a re-analysis of a subset of specimens using quantitative pyrosequencing, Brooks and colleaguesCitation16 attributed their inability to detect differences in promoter methylation status between cases and controls to a high prevalence of false-positives using QMSP methods on serum DNA. In contrast to QMSP, pyrosequencing allows for the interrogation of the methylation status of multiple CpG sites, and has an internal control to test for complete bisulfite treatment.

Thus, the objective of the present study was to evaluate whether an expanded panel of 12 tumor suppressor genes (APC, BRCA1, CDH1, CCND2, ESR1, GSTP1, HIN1, p16, RASSF1, RARβ, SFRP1 and TWIST1) frequently found to be methylated in breast cancers but not in normal breast tissue, when combined with the quantitative pyrosequencing methods to assess promoter methylation status, would lead to a sufficiently sensitive and specific serum-based methylation marker. We analyzed promoter methylation in DNA from serum collected at the time of diagnosis from 325 breast cancer cases and 249 benign breast disease controls who donated blood from 1977 to 1987 to the Mayo Serum Bank, a resource established to identify new early markers of breast cancer. We examined promoter methylation in DNA from serum from lymph-node positive and lymph-node negative breast cancer cases separately based on reports that the methylation pattern of tumor suppressor genes may be associated with the clinical characteristics of the tumor.Citation17

Results

Characteristics of study participants

This study compared the promoter methylation status of 12 genes in serum DNA obtained from postmenopausal women in three study groups: breast cancer cases with lymph node-positive disease (n = 241); breast cancer cases with lymph node-negative disease (n = 63); and benign breast disease control subjects (n = 234). A description of selected baseline characteristics for women in each of the study groups is presented in . Overall, the mean age at blood collection was 63.0 y: SD 9.9; range 35–91. On average, lymph node-positive and negative breast cancer cases were six to seven years older than the benign breast disease controls. Lymph-node positive breast cancer cases also tended to be slightly more likely to be nulliparous (18.3 vs 15.8%), to have a later age at first birth (24.2 vs 23.2 y), to have a later age at menopause (47.3 vs 45.7 y) and to have a family history of breast cancer (26.7 vs 24.0%) than benign breast disease controls. Compared with benign breast disease control subjects, lymph node-negative breast cancer cases were also more likely to be nulliparous (24.4% vs. 15.8%), to have a later age at first birth (25.0 vs. 23.2 y), to have a family history of breast cancer (33.9% vs. 24.0%), and to have a later age at menopause (46.6 vs. 45.7 y). However, lymph node-positive breast cancer cases were more likely to have ever smoked than benign breast disease controls (41.0% vs. 33.0%) whereas lymph node-negative cases were less likely to have ever smoked (19.1% vs. 33.0%). The three study groups were very similar in other factors, including age at menarche, body mass index, and number of full-term pregnancies.

Table 1. Selected characteristics of the study population

Comparison of median levels of gene promoter methylation across the three groups

Pyrosequencing analysis was used to assess the percent of DNA that was methylated at specific CpG sites in the promoter regions of 12 genes. The average of the percent methylation over all the promoter CpG sites for each gene (ranging from 3 to 27 CpG sites per gene) was calculated. Median levels of the average DNA methylation for 12 genes for each of the three study groups and the corresponding 10th to 90th percentiles are presented in . Overall, median levels of promoter methylation were very low, typically below 5%, for each gene in all of the study groups. For all of the 12 genes, as hypothesized, median levels of methylation were slightly higher in breast cancer cases with positive lymph nodes than in benign breast disease control subjects (by 3.3% for RASSF1 to 47.7% for SFRP1). In addition, in seven of the 12 genes (BRAC1, CCND2, CDH1, ESR1, HIN1, P16 and TWIST), the median methylation level was also higher in breast cancer cases with negative lymph nodes than in benign breast disease subjects. In four genes (CCND2, HIN1, P16 and TWIST), median levels of methylation were highest in lymph-node positive breast cancer cases, intermediate in the lymph-node negative breast cancer cases and lowest in benign breast disease control subjects. Median levels of the average DNA methylation for 12 genes were also examined by two categories of number of positive nodes (1–2 vs 3+) among the 159 node-positive cases only. In eight of the 12 genes, the median methylation levels were slightly higher in cases with 3+ positive nodes than in cases with 1 or 2 positive nodes, although this difference was statistically significant only for ESR1 (2.7 vs 2.0%).

Table 2. Median methylation levels (average over all CpG sites per gene) by case-control status

Comparison of quartile levels of gene promoter methylation between lymph node-positive breast cancer cases and benign breast disease controls

The distribution of lymph node-positive breast cancer cases in each quartile of gene methylation (quartile cut points were based on the distribution of gene methylation in benign breast disease controls) is presented in . The six most statistically significant findings, after adjustment of the false-positive rate (q-value), were for TWIST (p = 0.04, q = 0.43), SFRP1 (p = 0.16, q = 0.43), ESR1 (p = 0.17, q = 0.43), P16 (p = 0.19, q = 0.43), APC (p = 0.19, q = 0.43) and HIN1 (p = 0.21, q = 0.43). Odds ratios and 95% confidence intervals comparing the three highest quartiles to lowest quartile of gene methylation for lymph node-positive breast cancer cases compared with benign breast disease controls are also shown. For APC and TWIST, the odds ratios comparing the highest to lowest quartiles were elevated and marginally statistically significant. Specifically, the OR comparing the highest to lowest quartile of gene methylation for TWIST was 1.75 (95% CI = 1.0–3.0). The comparable figure for APC was 1.78 (95% CI = 1.0–3.0). For ESR1 and SFRP1, the odds ratios in the top three quartiles compared with the lowest quartile all tended to be elevated; only the top quartile for ESR was marginally statistically significant (RR = 1.77; 95% CI = 1.0–3.1). For two of the remaining genes (BRCA1 and CCND2), the highest odds ratio was also observed in the top quartile.

Table 3. Association between quartile levels of mean methylation scores and node-positive breast cancer

Results were generally similar when we examined each gene separately among women with 1–2 and 3+ positive nodes compared with benign breast disease controls. For example, the three most statistically significant genes after adjustment for the Q-value were TWIST, SFRP1 and ESR1 for women with 1–2 positive nodes and ESR1, APC and TWIST for women with 3 or more positive nodes. Finally, risk estimates presented in for node-positive breast cancer cases did not change meaningfully after adjustment for age at blood collection.

Table 4. Association between quartile levels of mean methylation scores and node-negative breast cancer

Comparison of quartile levels of gene methylation between lymph-node negative breast cancer cases and benign breast disease controls

The distribution of lymph-node negative breast cancer cases in each quartile of gene methylation (quartile cutpoints were based on the distribution of gene methylation in benign breast disease controls) is presented in . Odds ratios and 95% confidence intervals comparing the three highest quartiles to lowest quartile of gene methylation for lymph-node negative breast cancer cases compared with benign breast disease controls are also shown. The two most statistically significant findings, after adjustment of the false-positive rate (q-value), were for TWIST (p = 0.04) and HIN1 (p = 0.003). Finally, risk estimates presented in for node-negative breast cancer cases did not change meaningfully after adjustment for age at blood collection

Comparison of gene methylation levels of 5% or higher across study groups

presents the distribution of women in each of the three study groups with a methylation level of at least five percent for specific genes. The proportion of individuals with ≥ 5% methylation was higher in lymph-node positive breast cancer cases than in benign breast disease control subjects for ten of 12 genes. However, for only two genes, HIN1 and TWIST, was the proportion of subjects with methylation scores of 5% or greater statistically significantly higher in the node-positive breast cancer cases as compared with controls (22.0 vs 12.2%, p = 0.04; 37.9 vs 24.5%, p = 0.004, respectively). For all but four genes (APC, GSTP1 P16 and SFRP1), the proportion of individuals with mean methylation scores of ≥ 5% was higher in lymph-node-negative breast cancer cases than in benign breast disease control subjects but there were no statistically significant differences.

Table 5. Number and percent of study subjects with mean methylation scores of 5% or more for each gene among the three study groups

We also considered the extent to which the methylation pattern in a combination of genes could distinguish between breast cancer cases from control subjects. The Random Forests statistical algorithm was applied to assess the discriminating ability of the multiple gene panel of DNA methylation profiles in distinguishing the node-positive cases from the controls.Citation18 Genes with greater than 25% of subjects missing methylation values (CCND2, HIN1 and BRAC1) were excluded from the analysis. Subjects missing any values for the remaining nine genes were excluded—this resulted in the inclusion of 103 controls and 97 node-positive cases in the analysis. The out of bag error rate of the Random Forests classifier in predicting case/control status was 0.45, with a corresponding area under the curve (AUC) estimate of 0.45. A permutation test of the statistical significance of this classifier when compared with random chance resulted in a p value of 0.15. These results from the Random Forests classifier trained on methylation profiles corresponding to the panel of nine genes indicate no significant improvement in predictive power when compared with random chance.

Discussion

We conducted a case-control study to examine the extent to which promoter methylation of 12 breast cancer-related genes could serve as a marker for the early detection of breast cancer. To our knowledge, this is the first such study to use pyrosequencing to detect DNA methylation of cancer-related genes in serum. Serum was obtained from women who donated blood to the Mayo Serum Bank during 1977 to 1987 prior to undergoing a breast biopsy or mastectomy. We selected a panel of 12 tumor suppressor genes (APC, BRCA1, CDH1, CCND2, ESR1, GSTP1, HIN1, p16, RASSF1, RARβ, SFRP1 and TWIST1) frequently found to be methylated in breast cancers but not in normal breast tissue,Citation3 including several genes that have been observed to be methylated in the serum of breast cancer patients.Citation15 We found evidence that median levels of promoter methylation in serum DNA were modestly higher in breast cancer cases than benign breast disease controls, and higher in node-positive breast cancer cases than in node-negative cases. Further, analyses focused on the distribution of lymph node-positive cases vs. benign breast disease controls in each quartile level of gene promoter methylation found the most statistically significant results for TWIST, SFRP, ESR1, P16, APC and HIN1 with p values ranging between 0.04 and 0.21. Additionally, we found the proportion of individuals with ≥ 5% methylation to be higher in lymph-node positive breast cancer cases than in benign breast disease control subjects for ten of 12 genes. Despite the modestly higher methylation patterns we observed in cases compared with controls, our ability to discriminate between cases and controls was no better than random chance.

As reviewed by others,Citation14,Citation19 a number of studies using primarily methyl-specific PCR or MethylLight assays have reported an association between serum DNA promoter methylation and breast cancer. However, most of these studies have had a small sample size, several have not included a control group, and the sensitivity estimates for individual genes have varied widely. Among the larger studies, Hoque and colleaguesCitation15 reported a sensitivity of 62% and specificity of 87% with four candidate genes (APC, GSTP1, RASSF1A and RARβ2) using quantitative methylation-specific PCR (QMSP) in a study of 47 breast cancer cases and 38 control subjects. In this study, Hoque and colleaguesCitation15 reported that only 13% of the controls had methylation in any of these four tumor-related genes. By contrast, a more recent study by the same groupCitation20 reported that nearly 50% of 157 cancer-free individuals had one or more of six tumor-related genes (APC, GSTP1, RARβ2, CCND2, MGMT and P16) methylated in serum using QMSP, including 22% in RARβ2 alone. These latter two studies highlight the variability in percent methylation in serum DNA in patients and controls that has been observed across studies. A recent case-control study nested within a prospective cohort by Brooks and colleagues found no ability to discriminate breast cancer cases diagnosed in the prior six months (n = 50) from control subjects (n = 150) using the same gene panel as Hoque and colleagues.Citation15,Citation16 The authors found that the methylation levels in breast cancer cases were less than expected but methylation levels in control subjects were higher than expected. Brooks and colleaguesCitation16 utilized quantitative methylation specific PCR (QMSP) for the main analysis but reanalyzed a small subset of samples using pyrosequencing. Because pyrosequencing frequently did not confirm their methylation results as detected by QMSP, Brooks and colleaguesCitation16 concluded that their main analysis contained a substantial number of false positives. Further, the overall null findings were attributed to non-specific amplification resulting in high-false positives, and the false negatives to the low amount of DNA template. In the present study, we employed pyrosequencing to examine methylation in serum in 12 genes yet still did not find any significant predictive ability based on methylation patterns.

There are several potential explanations for the low predictive ability we observed in our study. The first and most likely explanation is the lack of precision of the pyrosequencing assays. Indeed, the coefficients of variation (CVs) for the 0% methylation control sample across plates were poor, and even for the 50% methylation control sample we had less than desired CVs (8.5–34%).

Another potential explanation for the limited predictive power we observed is that the quality and/or quantity of the DNA in the samples could have been insufficient for accurately measuring methylation. For most of the genes we examined, however, the pyrosequencing pass rate ranged from 70 to 94%, suggesting that the DNA obtained was adequate for PCR amplification and sequencing. Still, the proportion of tumor DNA in serum may be too low to be detected reliably by pyrosequencing of bisulfite-modified DNA.Citation12,Citation13 Zanetti-Dallenbach and colleaguesCitation21 recently demonstrated that the amount of shed DNA in serum is ten times higher than in plasma, likely due to leaked white blood cells during the coagulation that occurs in the collection of serum. Thus, in the present analysis, white blood cell DNA could have diluted the methylation signal of the shed tumor DNA in serum.

Although it is also possible that promoter methylation status of another set of genes might more clearly distinguish between breast cancer cases and controls, the panel we selected was based on prior evidence that the included genes are frequently methylated in breast cancer tissue but not in normal tissue.Citation3,Citation4 Furthermore, many of these genes have also been shown in several small studies to be more frequently methylated in the serum of breast cancer cases than that of controls.Citation14,Citation15

A special aspect in our study that deserves additional consideration is that all of the controls had a prior breast biopsy with evidence of benign breast disease. Some limited evidence indicates that increased DNA methylation of some tumor suppressor genes is detectable in benign breast disease.Citation22 The types of benign breast disease included in our study have only been minimally associated with breast cancer.Citation23 The large, well-matched control group is a strength of our study. If it were in fact true that our generally null findings reflect our choice of women with benign breast disease as controls, our findings would still imply that this panel of genes would not be useful in a clinical setting because benign breast disease is a common condition.

In summary, we were able to detect modest differences in promoter methylation in serum DNA in breast cancer cases and controls. Our findings and those of the recent study by Brooks and colleaguesCitation16 emphasize the need to demonstrate the reliability and validity of the methods being proposed to measure low levels of promoter methylation in the small quantities of tumor-derived DNA in circulation, as well as the need to design sufficiently sized prospective studies with well-selected control groups.

Materials and Methods

Between 1977 and 1987, all patients about to undergo breast biopsy or mastectomy in three hospitals in Grand Rapids, Michigan were invited to provide serum as part of a study to assess new putative breast cancer markers. Participants provided written informed consent and completed an in-person interview assessing breast cancer risk factors. Information on pathology and extent of disease was abstracted from medical records. Serum was collected prior to the day of surgery or on the day of surgery for all study subjects. Before surgery, 5,358 volunteers donated 20 mL of non-fasting blood that was collected in sterile vacutainers, immediately chilled, and allowed to clot. The serum was separated within two hours and then divided into 1 mL aliquots and stored at -70°C in sealed glass vials. Specimens were then shipped frozen, in containers with dry ice, first to a central repository at the Mayo Foundation (Rochester MN) and subsequently to the National Cancer Institute and stored at -70°C to -85°C at both locations.Citation24

For the present study, eligible women were postmenopausal, had no prior cancer or diabetes, and were not currently using hormone replacement therapy or oral contraceptives at the time of the blood draw. In addition, individuals had to have 2 mL or more of serum remaining in the bank. A total of 325 breast cancer cases (69 node-negative breast cancer cases and 256 primarily node positive breast cancer cases) and 249 control subjects were selected for this study. The mean time from the date of malignant diagnosis and blood collection was 0.46 d (95% CI: -0.7, 1.6) for node-negative cases and 0.4 d (95% CI: -3.0, 3.9) for node-positive cases. Because the cases were diagnosed in the 1970s, data on estrogen receptor, progesterone receptor and HER2/neu receptor status were not available. Control subjects had benign breast disease conditions that have been linked to no or minimally increased risk of breast cancer.Citation23 Benign conditions included 71 women with non-proliferative benign changes (e.g., atrophic lobules, apocrine metaplasia) and 178 women who had low-risk proliferative conditions (e.g., cysts, fibroadenoma and mild hyperplasia).

Analysis of DNA methylation

DNA methylation analyses were conducted by EpigenDX Inc. Serum specimens were shipped on dry ice from the National Cancer Institute to EpigenDX Inc. The laboratory was blinded as to case-control status of the study subjects. DNA was extracted from 500 µL of serum and eluted in 30 µL of buffer; estimated to contain between 10 to 50 ng of DNA all of which was modified with bisulfite treatment (during which unmethylated cytosines are converted to uracil) and eluted in 18 to 20 µL of buffer. One µL of bisulfite-modified DNA was used for each gene-specific amplification and 10 µl of the PCR products were sequenced by Pyrosequencing PSQ96 HS System (Qiagen). If the pyrosequencing failed due to low signal, a nested PCR reaction was done using the DNA from the initial PCR reaction and a second pyrosequencing was conducted. The methylation status of each locus (see for number of CpG sites per gene) was analyzed individually as a T/C SNP using QCpG software (Qiagen). Resulting pyrograms and percent methylation scores for each CpG site were received from EpigenDX, Inc. Each pyrogram was visually inspected for quality controls including bisulfite-conversion, expected sequence order and peak height. Data from only those pyrograms that passed all quality controls were included in the analyses. The pass rate ranged from 70% to 94%.

For the methylation analysis of each gene, serum samples from cases and control subjects were arranged randomly on eight 96-well plates (the same order of cases and controls for each gene). Due to technical laboratory issues, 36 study subjects on the first plate for each gene were omitted from all analyses. Thus, the final sample size consisted of 304 breast cancer cases (241 with primarily node-positive and 63 with node-negative disease) and 234 control subjects. Of the 241 primarily node-positive cases, 159 had at least one documented positive node, and the node status on the remaining 82 breast cancer cases was uncertain. Thus, the node-positive group may include some individuals who were node-negative. The mean time between the original date of blood collection and the methylation analysis was 29.4, 28.2 and 28.8 y for node-positive breast cancer cases, node negative breast cancer cases, and control subjects, respectively.

The number of CpG sites interrogated per gene varied from a low of 3 for RARβ to a high of 27 for HIN1. As displayed in , the percentage of subjects with available gene methylation data ranged from a low of 51.1, 58.5, and 56.0% for HIN1 to 88.9, 92.5 and 94.9% for RARβ for node-positive breast cancer cases, node-negative breast cancer cases and benign breast disease control subjects, respectively.

As part of the quality control procedure, 0%, 50% and 100% methylated DNA samples (provided by the laboratory) were included on each plate. The median (range) of the gene-specific coefficient of variation (CV) among the replicate measurements of the 0% methylated DNA sample was 52.5% (12–139%). The median (range) of CV among the replicate measurements of the 50% DNA methylated sample was 27% (8.5–34%). Lastly, among the replicates of the 100% DNA methylated sample, the median (range) of CV was 3.9% (0.68- 19%). For all genes, the replicate measurements of the 100% methylated sample resulted in the lowest CV.

Statistical analysis

Evaluation of the association between case-control status and the percent of subjects in each quartile of mean methylation levels for each gene was based on logistic regression models. Two separate sets of analyses were conducted to compare (1) the controls to the node-positive subjects, and (2) the controls to the node-negative subjects. The mean methylation levels across the CpG sites in each gene were categorized into quartiles based on the distribution in the control subjects. Two-sided p values were calculated based on likelihood ratio tests. Univariate models as well as multivariable models adjusted for age were fit.

As several genes were evaluated in this study, adjustment for multiple comparisons was performed based on the q value procedure.Citation25 If all genes below a specific p value threshold are denoted as “true markers,” the associated q value provides a measure of the proportion of false discoveries. This procedure is less conservative than traditional methods, such as the Bonferroni correction, and is more powerful in detecting true associations.

A multivariable analysis assessing the predictive power of the panel of genes was conducted using the Random Forests algorithm.Citation18 The classifier was trained by setting the number of trees to 10,000 and by considering a random subset of 3 genes at each split of every tree. The predictive power of the panel of genes was assessed based on the proportion of subjects that were correctly classified as cases or controls, among samples left out of the training process (out of bag samples). The predictive power of the panel of genes was assessed based on the proportion of subjects that were correctly classified as cases or controls (i.e., sensitivity and specificity) among samples left out of the training process (out of bag samples). A p value denoting the statistical significance of the Random Forests classifier was calculated by a permutation test—the subjects’ case/control labels were randomly permuted 100 times and the distribution of the out of bag error rate was estimated. The p value corresponding to the Random Forests classifier was calculated as the proportion of random permutations in which the out of bag error rate was equal to or lower than the observed out-of-bag error rate.

Box plots showing the distribution of mean percent methylation levels for each of the 12 genes among the control, node-negative and node-positive subjects are included in the Supplemental Material. The lines at the bottom, middle and top of each box denote the 25th, 50th (median) and 75th percentiles of each distribution. Circles represent outliers in both tails of the distribution.

Supplemental material

Additional material

Download Zip (218.3 KB)

Acknowledgments

This research was supported by a grant from Susan G. Komen for the Cure.

Disclosure of Potential Conflicts of Interest

No potential conflicts of interest were disclosed.

Supplemental Materials

Supplemental materials may be found here: www.landesbioscience.com/journals/epigenetics/article/22220

References

  • Levenson VV. Biomarkers for early detection of breast cancer: what, when, and where?. Biochim Biophys Acta 2007; 1770:847 - 56; http://dx.doi.org/10.1016/j.bbagen.2007.01.017; PMID: 17368950
  • Suijkerbuijk KP, van Diest PJ, van der Wall E. Improving early breast cancer detection: focus on methylation. Ann Oncol 2011; 22:24 - 9; http://dx.doi.org/10.1093/annonc/mdq305; PMID: 20591821
  • Widschwendter M, Jones PA. DNA methylation and breast carcinogenesis. Oncogene 2002; 21:5462 - 82; http://dx.doi.org/10.1038/sj.onc.1205606; PMID: 12154408
  • Campan M, Weisenberger DJ, Laird PW. DNA methylation profiles of female steroid hormone-driven human malignancies. Curr Top Microbiol Immunol 2006; 310:141 - 78; http://dx.doi.org/10.1007/3-540-31181-5_8; PMID: 16909910
  • Fackler MJ, McVeigh M, Mehrotra J, Blum MA, Lange J, Lapides A, et al. Quantitative multiplex methylation-specific PCR assay for the detection of promoter hypermethylation in multiple genes in breast cancer. Cancer Res 2004; 64:4442 - 52; http://dx.doi.org/10.1158/0008-5472.CAN-03-3341; PMID: 15231653
  • Esteller M, Corn PG, Baylin SB, Herman JG. A gene hypermethylation profile of human cancer. Cancer Res 2001; 61:3225 - 9; PMID: 11309270
  • Fackler MJ, McVeigh M, Evron E, Garrett E, Mehrotra J, Polyak K, et al. DNA methylation of RASSF1A, HIN-1, RAR-beta, Cyclin D2 and Twist in in situ and invasive lobular breast carcinoma. Int J Cancer 2003; 107:970 - 5; http://dx.doi.org/10.1002/ijc.11508; PMID: 14601057
  • Gormally E, Caboux E, Vineis P, Hainaut P. Circulating free DNA in plasma or serum as biomarker of carcinogenesis: practical aspects and biological significance. Mutat Res 2007; 635:105 - 17; http://dx.doi.org/10.1016/j.mrrev.2006.11.002; PMID: 17257890
  • Dulaimi E, Hillinck J, Ibanez de Caceres I, Al-Saleem T, Cairns P. Tumor suppressor gene promoter hypermethylation in serum of breast cancer patients. Clin Cancer Res 2004; 10:6189 - 93; http://dx.doi.org/10.1158/1078-0432.CCR-04-0597; PMID: 15448006
  • Hibi K, Nakao A. [Molecular detection of tumor DNA in the serum of colorectal cancer patients]. Nihon Rinsho 2003; 61:Suppl 7 197 - 200; PMID: 14574881
  • Jahr S, Hentze H, Englisch S, Hardt D, Fackelmayer FO, Hesch RD, et al. DNA fragments in the blood plasma of cancer patients: quantitations and evidence for their origin from apoptotic and necrotic cells. Cancer Res 2001; 61:1659 - 65; PMID: 11245480
  • Kohler C, Barekati Z, Radpour R, Zhong XY. Cell-free DNA in the circulation as a potential cancer biomarker. Anticancer Res 2011; 31:2623 - 8; PMID: 21778314
  • van der Vaart M, Pretorius PJ. Is the role of circulating DNA as a biomarker of cancer being prematurely overrated?. Clin Biochem 2010; 43:26 - 36; http://dx.doi.org/10.1016/j.clinbiochem.2009.08.027; PMID: 19747472
  • Brooks J, Cairns P, Zeleniuch-Jacquotte A. Promoter methylation and the detection of breast cancer. Cancer Causes Control 2009; 20:1539 - 50; http://dx.doi.org/10.1007/s10552-009-9415-y; PMID: 19768562
  • Hoque MO, Feng Q, Toure P, Dem A, Critchlow CW, Hawes SE, et al. Detection of aberrant methylation of four genes in plasma DNA for the detection of breast cancer. J Clin Oncol 2006; 24:4262 - 9; http://dx.doi.org/10.1200/JCO.2005.01.3516; PMID: 16908936
  • Brooks JD, Cairns P, Shore RE, Klein CB, Wirgin I, Afanasyeva Y, et al. DNA methylation in pre-diagnostic serum samples of breast cancer cases: results of a nested case-control study. Cancer Epidemiol 2010; 34:717 - 23; http://dx.doi.org/10.1016/j.canep.2010.05.006; PMID: 20627767
  • Swift-Scanlan T, Vang R, Blackford A, Fackler MJ, Sukumar S. Methylated genes in breast cancer: associations with clinical and histopathological features in a familial breast cancer cohort. Cancer Biol Ther 2011; 11:853 - 65; http://dx.doi.org/10.4161/cbt.11.10.15177; PMID: 21383541
  • Breiman L. Random forests. Mach Learn 2001; 45:5; http://dx.doi.org/10.1023/A:1010933404324
  • Van De Voorde L, Speeckaert R, Van Gestel D, Bracke M, De Neve W, Delanghe J, et al. DNA methylation-based biomarkers in serum of patients with breast cancer. Mutat Res 2012; In press; PMID: 22698615
  • Brait M, Ford JG, Papaiahgari S, Garza MA, Lee JI, Loyo M, et al. Association between lifestyle factors and CpG island methylation in a cancer-free population. Cancer Epidemiol Biomarkers Prev 2009; 18:2984 - 91; http://dx.doi.org/10.1158/1055-9965.EPI-08-1245; PMID: 19861513
  • Zanetti-Dällenbach R, Wight E, Fan AX, Lapaire O, Hahn S, Holzgreve W, et al. Positive correlation of cell-free DNA in plasma/serum in patients with malignant and benign breast disease. Anticancer Res 2008; 28:2A 921 - 5; PMID: 18507037
  • Euhus DM, Bu D, Milchgrub S, Xie XJ, Bian A, Leitch AM, et al. DNA methylation in benign breast epithelium in relation to age and breast cancer risk. Cancer Epidemiol Biomarkers Prev 2008; 17:1051 - 9; http://dx.doi.org/10.1158/1055-9965.EPI-07-2582; PMID: 18483325
  • Fitzgibbons PL, Henson DE, Hutter RV, Cancer Committee of the College of American Pathologists. Benign breast changes and the risk for subsequent breast cancer: an update of the 1985 consensus statement. Arch Pathol Lab Med 1998; 122:1053 - 5; PMID: 9870852
  • Masnyk IJ, Sears M, Corle D, Go VL, Dunsmore M. Serum bank for biological markers for breast cancer. J Natl Cancer Inst 1983; 71:875 - 6; PMID: 6578377
  • Storey JD. The positive false discovery rate. J R Stat Soc Series B Stat Methodol 2002; 64:479 - 98; http://dx.doi.org/10.1111/1467-9868.00346

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.