2,384
Views
3
CrossRef citations to date
0
Altmetric
Research Paper

Identification of significant genes and therapeutic agents for breast cancer by integrated genomics

, , ORCID Icon, , , & show all
Pages 2140-2154 | Received 24 Mar 2021, Accepted 13 May 2021, Published online: 21 Jun 2021

ABSTRACT

Breast cancer is the most commonly diagnosed malignancy in women; thus, more cancer prevention research is urgently needed. The aim of this study was to predict potential therapeutic agents for breast cancer and determine their molecular mechanisms using integrated bioinformatics. Summary data from a large genome-wide association study of breast cancer was derived from the UK Biobank. The gene expression profile of breast cancer was from the Oncomine database. We performed a network-wide association study and gene set enrichment analysis to identify the significant genes in breast cancer. Then, we performed Gene Ontology analysis using the STRING database and conducted Kyoto Encyclopedia of Genes and Genomes pathway analysis using Cytoscape software. We verified our results using the Gene Expression Profile Interactive Analysis, PROgeneV2, and Human Protein Atlas databases. Connectivity map analysis was used to identify small-molecule compounds that are potential therapeutic agents for breast cancer. We identified 10 significant genes in breast cancer based on the gene expression profile and genome-wide association study. A total of 65 small-molecule compounds were found to be potential therapeutic agents for breast cancer.

Graphical Abstract

1. Introduction

Breast cancer is a frequently diagnosed cancer in women with a family history [Citation1]. Breast cancer is a heterogeneous disease with different molecular subtypes and biological behaviors. Gene microarray technology and immunohistochemical techniques have classified breast cancers into different types [Citation2]. The estrogen receptor (ER) is the most important prognostic and predictive immunohistochemical marker in breast cancer. ER-negative tumors tend to be of higher histological grade, are more sensitive to chemotherapy, and are more likely to metastasize to visceral organs [Citation3,Citation4]. Breast cancer does not have a poor prognosis, and there is no lack of therapeutic targets. ER positive tumors represent about 70% of all breast cancers and there are a lot of therapeutic targets, as well as for HER2 positive breast cancer (about 20% of all BC). The only subtype lacking for target therapies is the triple negative subtype [Citation5,Citation6]. There is an urgent need to find available drugs and clarify their molecular mechanisms in breast cancer treatment.

Most previous studies have focused on identifying novel prognostic markers and drug targets for breast cancer [Citation7–9]. Sulaiman et al. [Citation10] reported that a synthetic azaspirane targets the Janus kinase/signal transducer and activation of transcription 3 pathway in breast cancer. Huang et al. [Citation11] demonstrated that the Gαh-PLCδ1 signaling axis drives metastatic progression in breast cancer. However, due to toxicity, cost, the chemical effects of novel prognostic markers and drug targets for breast cancer that need further research [Citation12], not all previous findings contribute to breast cancer treatment; breast cancer still lacks therapeutic targets and with poorer prognosis. And there is still an urgent need to identify additional therapeutic and prognostic targets in breast cancer [Citation13].

Genome-wide association studies (GWAS) are widely used to characterize the genetic mechanisms that underlie complex diseases. Integrative analyses of GWAS data are rapidly becoming a standard approach to explore the genetic basis of disease susceptibility [Citation14]. Network-wide association studies (NetWAS) can identify relevant disease-gene associations by integrating tissue-specific networks and GWAS results [Citation15,Citation16]. Prior studies have shown that the network-associated analysis of GWAS data is highly efficient when used to identify novel causal genes of complex diseases [Citation17,Citation18].

In this study, to better understand the molecular mechanisms and find therapeutic agents for breast cancer, we identified novel candidate therapeutic agents for breast cancer treatment by integrating genomic data with drug database analysis. In total, 65 small-molecule compounds were identified, including trichostatin A, LY-294,002, econazole, prestwick-1082, and vorinostat. Our study demonstrates the usefulness of this approach for evaluating the relationship among genes, diseases, and drugs. These findings will pave the way for the discovery of potential therapeutic targets for breast cancer.

2. Methods

2.1 Summary of GWAS datasets in breast cancer

The UK Biobank is a large, population-based prospective UK study, which was established to identify genetic and nongenetic determinants of various diseases. It comprises approximately 500,000 individuals with extensively detailed phenotypes. Their genotypes were determined using an array that included 847,441 genetic polymorphisms, enabling the identification of novel genetic variants in a uniformly genotyped and phenotyped cohort of unprecedented size [Citation19]. Using data from the UK Biobank, samples from the participants were genotyped on the UK Biobank Axiom array and UK BiLEVE custom array. Genotype imputation was conducted with IMPUTE software against the UK10K haplotype panel and the 1000 Genomes Project phase 3 panel. GWAS analysis was performed by SNPTEST using a logistic regression model. A genome-wide gene-association study was performed using the MAGMA gene analysis tool, and multiple genes and genetic variants were identified. The Icelandic GWAS dataset from the deCODE Genetics genealogical database was based on whole-genome sequencing using Illumina technology. Finally, meta-analysis of small nucleotide polymorphisms (SNPs) in the UK Biobank and deCODE sample was performed using the METAL analysis tool [Citation20].

The atlas of genetic associations in the UK Biobank (GeneATLAS, http://geneatlas.roslin.ed.ac.uk) helps researchers effectively analyze UK Biobank results without high computational costs. It also allows users to query genome-wide association results for 9,113,133 genetic variants and download GWAS summary statistics for more than 30 million imputed genetic variants (>23 billion phenotype–genotype pairs) [Citation21]. We downloaded large-scale GWAS breast cancer summary data from the atlas of genetic associations. Detailed descriptions of sample characteristics, experimental designs, statistical analyses, and quality control can be found in previous studies.

2.2 Gene expression datasets

Oncomine (https://www.oncomine.org) is a cancer microarray database and web-based data mining platform for facilitating discovery. In this study, differentially expressed genes (DEGs) in breast cancer were identified by comparing cancer samples to respective normal samples using the Oncomine database. The heatmap of significant DEGs in breast cancer was driven from the Oncomine.

2.3 Identification of significant genes in breast cancer

NetWAS (https://hb.flatironinstitute.org/netwas/) integrates tissue-specific networks and significant GWAS association results, and identifies relevant disease-gene associations based on genomics. Briefly, SNP-level association statistics were converted into gene-level statistics (gene-based P values), which then were integrated with tissue-specific networks to predict the causal genes [Citation18]. Greene et al. [Citation13] demonstrated that NetWAS is more accurate than GWAS alone. In this study, we identified the most relevant genes in breast cancer using NetWAS.

2.4 Kyoto Encyclopedia of Genes and Genomes pathway and Gene Ontology analyses

Cytoscape is one of the most successful network biology analysis and visualization tools. It exposes more than 270 core functions and 34 applications as REST-callable functions with standardized JSON interfaces supported by Swagger documentation [Citation22]. CluePedia, a plug-in in Cytoscape, can search for certain Kyoto Encyclopedia of Genes and Genomes (KEGG) signaling pathways of certain genes by calculating linear and nonlinear statistical dependencies from experimental data [Citation23]. KEGG signaling pathways were identified by CluePedia. Search Tool for the Retrieval of Interacting Genes (STRING) (https://string-db.org/cgi/input.pl) is an online tool that for Gene ontology (GO) analysis in gene sets [Citation24,Citation25]. GO is a commonly used bioinformatics tool that provides comprehensive information on the gene function of individual genomic products based on defined features consisting of three domains: biological process (BP), cellular component (CC), and molecular function (MF) [Citation26]. We conducted GO analysis using the STRING database.

2.5 Analysis of the correlation between significant genes and breast cancer

Gene Expression Profiling Interactive Analysis (GEPIA, http://gepia.cancer-pku.cn) is a web server for analyzing RNA-sequencing expression data of 9,736 tumors and 8,587 normal samples from The Cancer Genome Atlas and Genotype-Tissue Expression projects, using a standard processing pipeline [Citation27]. The Human Protein Atlas (HPA, www.proteinatlas.org) is an immunohistochemistry-based map of protein expression profiles in normal tissues, cancer tissues, and cell lines, and provides a resource for pathology-based biomedical research, including protein biomarker discovery [Citation28–30]. Correlations between significant genes and breast cancers were analyzed with GEPIA and HPA.

2.6 Analysis of the correlation between significant gene expression and overall survival

PROGgeneV2 (http://www.compbio.iupui.edu/proggene), a tool that can be used to predict the prognostic implication of genes in cancers, is written in PHP5 with a MySQL database backend, which stores gene expression data, covariates data, and metadata for cataloged studies in the form of relational database tables. Survival analysis in PROGgeneV2 is done using the backend R script; users can input multiple genes and use combined analysis to create survival plots for different genes of interest [Citation31]. We used PROGgeneV2 to analyze the relationship between overall survival and genes that were overexpressed and underexpressed in breast cancer.

2.7 Drug prediction analysis

CMap (https://portals.broadinstitute.org/cmap/) is a collection of genome-wide transcriptional expression data from cultured human cells treated with bioactive small molecules and simple pattern-matching algorithms that together enable the discovery of functional connections among drugs, genes, and diseases through the transitory feature of common gene expression changes [Citation32–34]. We used CMap to identify small-molecule compounds as potential therapeutic agents to target the significant genes in breast cancer.

3. Results

3.1 Identification of significant DEGs in breast cancer

To identify the significant DEGs in breast cancer, we retrieved GWAS summary data (C50-C50) of breast cancer from the UK Biobank, and microarray expression profiles of breast cancer from the Oncomine database. C50-C50 contained 10,478 malignant neoplasm of breast cases and 235,016 controls for the analyses, and the data were consolidated and normalized ().

Figure 1. A) Q-Q Plot of C50-C50 (β = 1.05, λmean = 1.07, λmedian = 1.06) Containing 9,113,133 imputed variants that passed quality control (QC), with a P different than 0; b) Manhattan PLOT (IMPUTED) CONTAINING all (QC and non-QC) 30,798,054 imputed variants; c) Venn diagram of significant genes (CLDN7,MLLT10,RBM33,SH3RF1,SSBP4, UBE2Z,BMPER,FGF7,MSRB3,TNRC6B) in breast cancer (BC)

Figure 1. A) Q-Q Plot of C50-C50 (β = 1.05, λmean = 1.07, λmedian = 1.06) Containing 9,113,133 imputed variants that passed quality control (QC), with a P different than 0; b) Manhattan PLOT (IMPUTED) CONTAINING all (QC and non-QC) 30,798,054 imputed variants; c) Venn diagram of significant genes (CLDN7,MLLT10,RBM33,SH3RF1,SSBP4, UBE2Z,BMPER,FGF7,MSRB3,TNRC6B) in breast cancer (BC)

From NetWAS of GWAS summary data, we converted SNP-level association statistics into gene-level statistics (gene-based P values) and identified the 127 most relevant genes in breast cancer (). A total of 1019 overexpressed genes (Supplementary Figure 1) and 1019 underexpressed genes (Supplementary Figure 2) were identified by Oncomine. The top 20 DEGs in breast cancer compared to the normal controls are shown in a heatmap ().

Figure 2. Heatmap of significant different expression genes in BC. a)Top 20 Over-expressed genes; b) Top 20 Under-expressed genes

Figure 2. Heatmap of significant different expression genes in BC. a)Top 20 Over-expressed genes; b) Top 20 Under-expressed genes

Table 1. Significant genes in breast cancer identified using NetWAS

After overlapping the 127 most relevant genes with the 2038 DEGs in breast cancer, we identified 10 significant genes (CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, UBE2Z, BMPER, FGF7, MSRB3, and TNRC6B) in breast cancer (). Among them, CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, and UBE2Z were overexpressed; and BMPER, FGF7, MSRB3, and TNRC6B were underexpressed.

3.2 GO and KEGG enrichment analyses of significant DEGs in breast cancer

To explore the roles of the significant DEGs in breast cancer, we played GO and KEGG enrichment analyses. BP analysis revealed that the significant genes in breast cancer were mainly enriched in the Wnt signaling pathway, calcium-modulating pathway, protein repair, gene silencing by microRNA (miRNA), mRNA cleavage involved in gene silencing by miRNA, and positive regulation of epithelial cell proliferation involved in lung morphogenesis (). MF analysis showed that significant genes were enriched in functions related to oxidoreductase activity, acting on a sulfur group of donors and disulfide as acceptor, and phosphoinositide 3-kinase (PI3K) and PIK3CA activities (). CC analysis showed that significant genes were enriched in P-bodies. KEGG analysis revealed that significant genes in breast cancer were mainly involved in pathways in cancer, breast cancer, gastric cancer, melanoma, the PI3K/Akt signaling pathway, mitogen-activated protein kinase (MAPK) signaling pathway, Ras signaling pathway, tight junctions, and ubiquitin-mediated proteolysis ().

Figure 3 KEGG pathway analysis of significant genes (CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, UBE2Z, BMPER, FGF7, MSRB3 and TNRC6B) in BC

Figure 3 KEGG pathway analysis of significant genes (CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, UBE2Z, BMPER, FGF7, MSRB3 and TNRC6B) in BC

Table 2. Gene ontology (GO) enrichment result of significant genes in breast cancer

3.3 Correlation between significant DEGs and breast cancer

To verify the significant DEGs of breast cancer, we further explore the DEGs. Consistent with the identification of significant genes, protein profiling in breast cancer samples from the HPA using immunohistochemistry showed that the gene expression of CLDN7, RBM33, SH3RF1, and UBE2Z was significantly enriched in breast cancer, whereas there was no significant enrichment of FGF7 and TNRC6B ().

Figure 4. The immunohistochemistry of significant genes(CLDN7, RBM33, SH3RF1, UBE2Z, FGF7 and TNRC6B) in BC

Figure 4. The immunohistochemistry of significant genes(CLDN7, RBM33, SH3RF1, UBE2Z, FGF7 and TNRC6B) in BC

The significant DEGs (CLDN7, BMPER, FGF7, MSRB3) in breast cancer samples compared to normal samples also showed coincident results of significant gene identification ().

Figure 5. The different expression of significant genes genes(CLDN7, BMPER, FGF7 and MSRB3) in breast cancer samples to normal samples, the red box mean in breast cancer samples and the black box mean in normal samples. *:p < 0.05

Figure 5. The different expression of significant genes genes(CLDN7, BMPER, FGF7 and MSRB3) in breast cancer samples to normal samples, the red box mean in breast cancer samples and the black box mean in normal samples. *:p < 0.05

3.4 Correlation between overall survival and significant DEGs in breast cancer

In the analysis of the correlation between overall survival and significant DEG expression (CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, and UBE2Z) in breast cancer, we found a shorter survival time based on GSE5881 () and GSE42568 () (P< 0.05). The significantly underexpressed genes (BMPER, FGF7, MSRB3, and TNRC6B) in breast cancer were correlated with a longer survival time based on GSE42568 () and GSE37751 () (P> 0.05), this correlation cannot be demonstrated. The overall survival analysis combined significantly underexpressed genes (BMPER, MSRB3, and TNRC6B) in breast cancer based on GSE1456_U133B () and GSE3494_U133B (); combined FGF7 and TNRC6B based on GSE3494_U133A (); and combined single FGF7 based on GSE9893 () were also correlated with longer survival in breast cancer (P< 0.05).

Figure 6. Overall survival analysis combined multiple genes expression. a)Combined significant Over-expressed genes(CLDN7, MLLT10, RBM33, SH3RF1, SSBP4 and UBE2Z) in BC based on GSE58812; b)Combined significant Over-expressed genes(CLDN7, MLLT10, RBM33, SH3RF1, SSBP4 and UBE2Z) in BC based on GSE42568; c)Combined significant under-expressed genes(BMPER, FGF7, MSRB3 and TNRC6B) in BC based on GSE42568; d)Combined BMPER,FGF7,MSRB3 and TNRC6B in BC based on GSE37751; e)Combined BMPER,MSRB3 and TNRC6B in BC based on GSE1456_U133B; f)Combined BMPER,MSRB3 and TNRC6B in BC based on GSE3494_U133B; g)Combined FGF7 and TNRC6B in BC based on GSE3494_U133A; h)Combined FGF7 in BC based on GSE9893

Figure 6. Overall survival analysis combined multiple genes expression. a)Combined significant Over-expressed genes(CLDN7, MLLT10, RBM33, SH3RF1, SSBP4 and UBE2Z) in BC based on GSE58812; b)Combined significant Over-expressed genes(CLDN7, MLLT10, RBM33, SH3RF1, SSBP4 and UBE2Z) in BC based on GSE42568; c)Combined significant under-expressed genes(BMPER, FGF7, MSRB3 and TNRC6B) in BC based on GSE42568; d)Combined BMPER,FGF7,MSRB3 and TNRC6B in BC based on GSE37751; e)Combined BMPER,MSRB3 and TNRC6B in BC based on GSE1456_U133B; f)Combined BMPER,MSRB3 and TNRC6B in BC based on GSE3494_U133B; g)Combined FGF7 and TNRC6B in BC based on GSE3494_U133A; h)Combined FGF7 in BC based on GSE9893

3.5 Drug prediction analysis

To identify potential small-molecule compounds with therapeutic effects on breast cancer, drug prediction analysis was performed using CMap. A total of 65 drugs were predicted, and the 10 most significant were trichostatin A, LY-294,002, econazole, Prestwick-1082, vorinostat, lomefloxacin, clorsulon, amantadine, thiostrepton, and orciprenaline ().

Table 3. The most significant drugs provided by cmap to reverse core genes of breast cancers

4. Discussion

Breast cancer is the most commonly diagnosed malignancy in women worldwide and is the main cause of cancer-related death in women [Citation35–37]. Although there are a lot of effective therapeutic agents for breast cancer, breast cancer remains a major health problem and is a top biomedical research priority [Citation38–40], as there is an urgent need for effective breast cancer treatments.

In this study, we identified 10 significant genes (CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, UBE2Z, BMPER, FGF7, MSRB3, and TNRC6B) in breast cancer using combined GWAS data and profiling of DEGs. Protein profiling in breast cancer samples from the HPA using immunohistochemistry and analysis of significant DEGs in breast cancer samples compared to normal samples from GEPIA further verified the results. Significantly overexpressed genes (CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, and UBE2Z) were correlated with shorter survival, whereas underexpressed genes (BMPER, FGF7, MSRB3, and TNRC6B) were correlated with longer survival in breast cancer.

Consistent with our findings, previous studies have shown that some of these genes play important roles in the development of breast cancer. For example, Bernardi et al. [Citation41] showed that CLDN7 is associated with a shorter time to recurrence, suggesting its contribution to the aggressiveness of breast cancer. In a GWAS, Guo et al. [Citation42] identified common genetic loci for breast cancer risk including SSBP4. Whole transcriptome analysis by Bauer et al. [Citation43] demonstrated that BMPER plays a possible therapeutic role in breast cancer. Fu et al. [Citation44] demonstrated that acetylation, expression and recruitment of FGF7 promoters induce cancer growth and progression. Zhu et al. [Citation45] found that targeting FGF7 can exert oncogenic functions in breast cancer. A previous study showed that the ZEB1-MSRB3 axis is related to breast cancer genome stability [Citation46]. Interestingly, other DEGs in breast cancer identified in this study, including MLLT10, RBM33, SH3RF1, UBE2Z, and TNRC6B, have not been proven in previous studies. We believe that these are potentially novel key genes in breast cancer.

BP analysis in GO annotation indicated that the 10 significant genes are mainly enriched in the Wnt signaling pathway, which plays an important role in the occurrence and development of many cancers. Inhibiting this pathway can suppress breast cancer growth and metastasis [Citation47–49]. MF analysis of GO suggested that the DEGs were most significantly enriched in functions related to oxidoreductase activity. The redox reaction is accompanied by tumor development. CC analysis of GO annotation showed that the 10 DEGs were enriched in P-bodies. A previous study suggested that P-body disassembly correlates with breast cancer progression [Citation50].

KEGG analysis of the 10 DEGs showed their enrichment in breast cancer, gastric cancer, melanoma, the PI3K/Akt signaling pathway, MAPK signaling pathway, Ras signaling pathway, tight junctions, and ubiquitin-mediated proteolysis. Some of these pathways contribute to the development of breast cancer. For example, the PI3K pathway is found in many types of cancer and plays an important role in breast cancer cell proliferation [Citation51]. Ras signaling is a key determinant of poor survival in breast cancer patients [Citation52]. Abnormal MAPK signaling plays a core role in the regulation of growth and survival, and the development of drug resistance in triple-negative breast cancer [Citation53].

The aim of this work was to identify significant genes and potential therapeutic agents for breast cancer based on genomics. We found 65 potentially small-molecule compounds to reverse significant genes in breast cancer. The 10 most significant drugs were trichostatin A, LY-294,002, econazole, Prestwick-1082, vorinostat, lomefloxacin, clorsulon, amantadine, thiostrepton, and orciprenaline. Consistent, with our study, it has been reported that trichostatin A, a histone deacetylase inhibitor, has therapeutic potential in breast cancer [Citation54]. Jiang et al. [Citation55] showed that trichostatin A sensitizes ER-negative breast cancer cells to tamoxifen. LY294002, a specific inhibitor of the PI3K pathway, can decrease the rate of cell growth and increase the therapeutic sensitivity in MCF7 cells expressing wild-type p53, which may be useful for the treatment of breast cancer [Citation56]. Econazole is a novel PI3K/AKT signaling pathway inhibitor, which can be used to overcome adriamycin resistance and improve chemotherapy sensitivity in breast cancer [Citation57]. A preclinical study showed that vorinostat can prevent the formation of brain metastases in breast cancer [Citation58]. Yang et al. [Citation59] suggested that thiostrepton is a promising agent for triple-negative breast cancer. Kwok et al. [Citation60] showed that thiostrepton selectively targets breast cancer cells through inhibition of Forkhead box M1 expression. However, some of the predicted drugs, such as Prestwick-1082, lomefloxacin, clorsulon, amantadine, and orciprenaline, have not been shown to directly play a role in breast cancer. Thus, future studies are needed to confirm our findings.

Compared to previous studies [Citation61–63], we conducted an analysis combining genomic data with drug database analysis to identify novel candidate therapeutic agents for breast cancer treatment. Our study demonstrates the usefulness of this approach for evaluating the relationship among genes, diseases, and drugs. These findings will pave the way for the discovery of potential therapeutic targets for breast cancer.

5. Conclusion:

Combined analyses of network-wide association studies, gene expression profiles, and drug databases are helpful for identifying potential therapeutic agents for diseases. This method is a new paradigm that can guide future research directions.

Availability of data and materials

All materials are available by the corresponding author.

Declarations

Ethical statement

Our study did not require an ethical board approval because it did not contain human or animal trials.

Consent for publication

Not applicable.

Highlights

  1. Combined analyses of network-wide association studies, gene expression profiles, and drug databases.

  2. A useful approach for evaluating the relationship among genes, diseases, and drugs.

These findings will pave the way for the discovery of potential therapeutic targets for breast cancer.

Supplemental material

Supplemental Material

Download ()

Acknowledgements

The authors acknowledge the National Natural Science Foundation of China 81972861, Innovation Capacity Support Plan of Shaanxi Province 2018TD‐002. We thank LetPub (www.letpub.com) for its linguistic assistance during the preparation of this manuscript.

Disclosure Statement

The authors declare no competing interests.

Supplementary Material

Supplemental data for this article can be accessed here.

Additional information

Funding

This work was supported by the National Science Foundation of China [81972861]; The scientific development funding of the first affiliated hospital of Xi’an Jiaotong University (2020QN-07); The fundamental research funds for the central universities (xzy012020044).

References

  • Nyblade L, Stockton M, Travasso S, et al. A qualitative exploration of cervical and breast cancer stigma in Karnataka, India. BMC Womens Health. 2017;17(1):58.
  • Cancer Genome Atlas Network. Comprehensive molecular portraits of human breast tumours. Nature.2012;490(7418):61–70.
  • Hess KR, Pusztai L, Buzdar AU, et al. Estrogen receptors and distinct patterns of breast cancer relapse. Breast Cancer Res Treat. 2003;78(1):105–18. .
  • Andre F, Job B, Dessen P, et al. Molecular characterization of breast cancer with high-resolution oligonucleotide comparative genomic hybridization array. Clin Cancer Res. 2009;15(2):441–51.
  • Cocco S, Piezzo M, Calabrese A, et al. Biomarkers in triple-negative breast cancer: state-of-the-art and future perspectives. Int J Mol Sci. 2020;21(13):13. .
  • Piezzo M, Cocco S, Caputo R, et al. Targeting cell cycle in breast cancer: CDK4/6 INHIBITORS. Int J Mol Sci. 2020;21(18):18.
  • Fu CH, Lin R-J, Yu J, et al. A novel oncogenic role of inositol phosphatase SHIP2 in ER-negative breast cancer stem cells: involvement of JNK/vimentin activation. Stem Cells. 2014;32(8):2048–60.
  • Sirinian C, Papanastasiou AD, Schizas M, et al. RANK-c attenuates aggressive properties of ER-negative breast cancer by inhibiting NF-kappaB activation and EGFR signaling. Oncogene. 2018;37(37):5101–5114.
  • Gao S, Ge A, Xu S, et al. PSAT1 is regulated by ATF4 and enhances cell proliferation via the GSK3beta/beta-catenin/cyclin D1 signaling pathway in ER-negative breast cancer. J Exp Clin Cancer Res. 2017;36(1):179.
  • Nb S, Cd, M, S, B, et al. An azaspirane derivative suppresses growth and induces apoptosis of ER-positive and ER-negative breast cancer cells through the modulation of JAK2/STAT3 signaling pathway. Int J Oncol. 2016;49(3):1221–9.
  • Sp H, Py, L, Cj, K, et al. The Gαh-PLCδ1 signaling axis drives metastatic progression in triple-negative breast cancer. J Hematol Oncol. 2017;10(1):114.
  • Cj H, Yl H, Yf H, et al. Molecular mechanisms of anticancer effects of phytoestrogens in breast cancer. Breast cancer research and treatment. 2018;19(3):323–332.
  • Espinal AC, Buas MF, Wang D, et al. FOXA1 hypermethylation: link between parity and ER-negative breast cancer in African American women? Breast Cancer Res Treat. 2017;166(2):559–568.
  • Hirschhorn JN, Daly MJ. Genome-wide association studies for common diseases and complex traits. Nat Rev Genet. 2005;6(2):95–108.
  • Greene CS, Krishnan A, Wong AK, et al. Understanding multicellular function and disease with human tissue-specific networks. Nat Genet. 2015;47(6):569–76.
  • Wang J, Zheng J, Wang Z, et al. Inferring gene-disease association by an integrative analysis of eQTL genome-wide association study and protein-protein interaction data. Hum Hered. 2018;83(3):117–129.
  • Song A, Yan J, Kim S, et al. Network-based analysis of genetic variants associated with hippocampal volume in Alzheimer’s disease: a study of ADNI cohorts. BioData Min. 2016;9(1):3.
  • Ma C, Gu C, Huo Y, et al. The integrated landscape of causal genes and pathways in schizophrenia. Transl Psychiatry. 2018;8(1):67.
  • Sudlow C, Gallacher J, Allen N, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12(3):e1001779.
  • Hammerschlag AR, Stringer S. Genome-wide association analysis of insomnia complaints identifies risk genes and genetic overlap with psychiatric and metabolic traits. Nat Genet. 2017;49(11):1584–1592.
  • Canela-Xandri O, Rawlik K, Tenesa A. An atlas of genetic associations in UK Biobank. Nat Genet. 2018;50(11):1593–1599.
  • Otasek D, Morris JH, Bouças J, et al. Cytoscape Automation: empowering workflow-based network analysis. Genome Biol. 2019;20(1):185.
  • Bindea G, Galon J, Mlecnik B. CluePedia Cytoscape plugin: pathway insights using integrated experimental and in silico data. Bioinformatics. 2013;29(5):661–3.
  • Szklarczyk D, Franceschini A, Wyder S, et al. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015;43(D1):D447–52. Database issue.
  • Szklarczyk D.  Morris J H, Cook H, et al. The STRING database in 2017: quality-controlled protein-protein association networks. made broadly accessible. Nucleic Acids Res. 2017;45(D1):D362–d368.
  • Y, Z, Y Z, Y F, et al. Identification of biomarkers, pathways and potential therapeutic agents for white adipocyte insulin resistance using bioinformatics analysis. Adipocyte. 2019;8(1):318–329.
  • Z, T, C L, B K,  et al. GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Res. 2017;45(null):W98–W102.
  • Ponten F, Jirstrom K, Uhlen M. The human protein atlas–a tool for pathology. J Pathol. 2008;216(4):387–93.
  • Uhlen M, Zhang C, Lee S, et al. A pathology atlas of the human cancer transcriptome. Science. 2017;357(6352):6352.
  • Thul PJ, Åkesson L, Wiking M, et al. A subcellular map of the human proteome. Science. 2017;356(6340):6340.
  • Goswami CP, Nakshatri H. PROGgeneV2: enhancements on the existing database. BMC Cancer. 2014;14(1):970.
  • Lamb J, et al. The connectivity map: using gene-expression signatures to connect small molecules, genes, and disease. Science. 2006;313(5795):1929–35.
  • Musa A, Ghoraie LS, Zhang S-D, et al. A review of connectivity map and computational approaches in pharmacogenomics. Brief Bioinform. 2018;19(3):506–523.
  • Raghavan R, Hyter S, Pathak HB, et al. Drug discovery using clinical outcome-based connectivity mapping: application to ovarian cancer. BMC Genomics. 2016;17(1):811.
  • North CM, Christiani DC. Women and lung cancer: what is new? Semin Thorac Cardiovasc Surg. 2013;25(2):87–94.
  • Woolston C. Breast cancer. Nature. 2015;527(7578):S101.
  • Tsikouras P, Zervoudis S, Manav B, et al. Cervical cancer: screening, diagnosis and staging. J Buon. 2016;21(2):320–5.
  • Anastasiadi Z, Lianos GD, Ignatiadou E, et al. Breast cancer in young women: an overview. Updates Surg. 2017;69(3):313–317. .
  • Musselwhite LW, Oliveira CM, Kwaramba T, et al. Racial/ethnic disparities in cervical cancer screening and outcomes. Acta Cytol. 2016;60(6):518–526. .
  • Kossai M, Leary A, Scoazec J-Y, et al. Ovarian Cancer: a heterogeneous disease. Pathobiology. 2018;85(1–2):41–49.
  • Bernardi M, Logullo A, Pasini F, et al. Prognostic significance of CD24 and claudin-7 immunoexpression in ductal invasive breast cancer. Oncol Rep. 2012;27(1):28–38.
  • Guo X, Lin W, Bao J, et al. A Comprehensive cis-eQTL analysis revealed target genes in breast cancer susceptibility loci identified in genome-wide association studies. Am J Hum Genet. 2018;102(5):890–903.
  • Bauer D, Mazzio E, Soliman KFA. Whole transcriptomic analysis of apigenin on TNFα immuno-activated MDA-MB-231 breast cancer cells. Cancer Genomics Proteomics. 2019;16(6):421–431.
  • Fu R, Han C, Ni T, et al. A ZEB1/p53 signaling axis in stromal fibroblasts promotes mammary epithelial tumours. Nat Commun. 2019;10(1):3210.
  • Zhu Y, Yang L, Chong Q-Y, et al. Long noncoding RNA Linc00460 promotes breast cancer progression by regulating the miR-489-5p/FGF7/AKT axis. Cancer Management and Research. 2019;11(p):5983–6001.
  • Morel AP, Ginestier C, Pommier RM, et al. A stemness-related ZEB1-MSRB3 axis governs cellular pliancy and breast cancer genome stability. Nat Med. 2017;23(5):568–578.
  • Lin Q, He Y, Wang X, et al. Targeting pyruvate carboxylase by a small molecule suppresses breast cancer progression. Adv Sci (Weinh). 2020;7(9):1903483.
  • Khawaled S, Nigita G, Distefano R, et al. Pleiotropic tumor suppressor functions of WWOX antagonize metastasis. Signal Transduction and Targeted Therapy. 2020;5(1):43. .
  • Eyre R, Alférez DG, Santiago-Gómez A, et al. Microenvironmental IL1β promotes breast cancer metastatic colonisation in the bone via activation of Wnt signalling. Nature Communications. 2019;10(1):5016. .
  • Cougot N, Daguenet É, Baguet A, et al. Overexpression of MLN51 triggers P-body disassembly and formation of a new type of RNA granules. J Cell Sci. 2014;127(21):4692–701. .
  • A, K, V LN, N N, et al. Benzo[b]furan derivatives induces apoptosis by targeting the PI3K/Akt/mTOR signaling pathway in human breast cancer cells. chemistry SRNJB. 2016;66(undefined):124–131.
  • Kl W, JR A, JC L, et al. Ras signaling is a key determinant for metastatic dissemination and poor survival of luminal breast cancer patients. Cancer Res. 2015;75(22):4960–72.
  • W, C, J L, Q H, et al. AMP-activated protein kinase: a potential therapeutic target for triple-negative breast cancer. Breast Cancer Res. 2019;21(1):29.
  • Alao J, Stavropoulou A, Lam E, et al. Histone deacetylase inhibitor, trichostatin A induces ubiquitin-dependent cyclin D1 degradation in MCF-7 breast cancer cells. Mol Cancer. 2006;5(p):8.
  • Jang E, Lim S, Lee E, et al. The histone deacetylase inhibitor trichostatin A sensitizes estrogen receptor alpha-negative breast cancer cells to tamoxifen. Oncogene. 2004;23(9):1724–36.
  • Zheng L,  Ren J, Li H, et al. Downregulation of wild-type p53 protein by HER-2/neu mediated PI3K pathway activation in human breast cancer cells: its effect on cell proliferation and implication for therapy. Cell Res. 2004;14(6):497–506.
  • Dong C,  Chen Y, Ma J, et al. Econazole nitrate reversed the resistance of breast cancer cells to Adriamycin through inhibiting the PI3K/AKT signaling pathway. Am J Cancer Res. 2020;10(1):263–274.
  • Palmieri D, Lockman P, Thomas F, et al. Vorinostat inhibits brain metastatic colonization in a model of triple-negative breast cancer and induces DNA double-strand breaks. Clinical cancer res. 2009;15(19):6148–57.
  • Yang N, Zhou T, Lei X, et al. Inhibition of sonic hedgehog signaling pathway by thiazole antibiotic thiostrepton attenuates the CD44+/CD24-stem-like population and sphere-forming capacity in triple-negative breast cancer. Cell Physiol Biochem. 2016;38(3):1157–70.
  • Kwok J, Myatt S, Marson C, et al. Thiostrepton selectively targets breast cancer cells through inhibition of forkhead box M1 expression. Mol Cancer Ther. 2008;7(7):2022–32.
  • Xu X, Zhang J, Zhang Z, et al. Systems pharmacology in combination with proteomics reveals underlying mechanisms of Xihuang pill against triple-negative breast cancer. Bioengineered. 2020;11(1):1170–1188. .
  • Ye Q, Han X, Wu Z. Bioinformatics analysis to screen key prognostic genes in the breast cancer tumor microenvironment. Bioengineered. 2020;11(1):1280–1300.
  • Yan S, Xu J, Liu B, et al. Long non-coding RNA BCAR4 aggravated proliferation and migration in esophageal squamous cell carcinoma by negatively regulating p53/p21 signaling pathway. Bioengineered. 2021;12(1):682–696.