307
Views
62
CrossRef citations to date
0
Altmetric
Original Research

Application of weighted gene co-expression network analysis to identify key modules and hub genes in oral squamous cell carcinoma tumorigenesis

, , , , , & show all
Pages 6001-6021 | Published online: 19 Sep 2018

Figures & data

Figure 1 Clustering of samples and determination of soft-thresholding power.

Notes: (A) The clustering was based on the expression data of GSE30784, which contained 167 OSCC, 17 dysplasia and 45 normal samples. The top 5,000 genes with the highest SD values were used for the analysis by WGCNA. The color intensity was proportional to disease status (normal, dysplasia and OSCC), sex and age. (B) Analysis of the scale-free fit index for various soft-thresholding powers (β). (C) Analysis of the mean connectivity for various soft-thresholding powers. In all, 9 was the most fit power value.
Abbreviations: OSCC, oral squamous cell carcinoma; WGCNA, weighted gene co-expression network analysis.
Figure 1 Clustering of samples and determination of soft-thresholding power.

Figure 2 Construction of co-expression modules by WGCNA package in R.

Notes: (A) The cluster dendrogram of module eigengenes. (B) The cluster dendrogram of genes in GSE39784. Each branch in the figure represents one gene, and every color below represents one co-expression module.
Abbreviation: WGCNA, weighted gene co-expression network analysis.
Figure 2 Construction of co-expression modules by WGCNA package in R.

Figure 3 (A) Interaction relationship analysis of co-expression genes. Different colors of horizontal axis and vertical axis represent different modules. The brightness of yellow in the middle represents the degree of connectivity of different modules. There was no significant difference in interactions among different modules, indicating a high-scale independence degree among these modules. (B) Heatmap of the correlation between module eigengenes and the disease status of OSCC. The turquoise module was the most positively correlated with status, and the brown module was the most negatively correlated with status. (C) Hierarchical clustering of module hub genes that summarize the modules yielded in the clustering analysis. (D) Heatmap plot of the adjacencies in the hub gene network.

Abbreviation: OSCC, oral squamous cell carcinoma.
Figure 3 (A) Interaction relationship analysis of co-expression genes. Different colors of horizontal axis and vertical axis represent different modules. The brightness of yellow in the middle represents the degree of connectivity of different modules. There was no significant difference in interactions among different modules, indicating a high-scale independence degree among these modules. (B) Heatmap of the correlation between module eigengenes and the disease status of OSCC. The turquoise module was the most positively correlated with status, and the brown module was the most negatively correlated with status. (C) Hierarchical clustering of module hub genes that summarize the modules yielded in the clustering analysis. (D) Heatmap plot of the adjacencies in the hub gene network.

Figure 4 (A) Scatter plot of module eigengenes in the turquoise module. (B) Scatter plot of module eigengenes in the brown module. (C) The top 30 hub genes in the turquoise module. (D) The top 30 hub genes in the brown module. Nodes represent genes, and node size is correlated with connectivity of the gene by degree.

Figure 4 (A) Scatter plot of module eigengenes in the turquoise module. (B) Scatter plot of module eigengenes in the brown module. (C) The top 30 hub genes in the turquoise module. (D) The top 30 hub genes in the brown module. Nodes represent genes, and node size is correlated with connectivity of the gene by degree.

Figure 5 Survival analysis of hub genes.

Notes: (AE) Five hub genes in the turquoise module with the highest node degree and significant results of survival analysis (P⩽0.05 was regarded as significant). They were MMP1, TNFRSF12A, PLAU, FSCN1 and PDPN, respectively. (FJ) Five hub genes in the brown module with the highest node degree and significant results of survival analysis (P⩽0.05 was regarded as significant). They were KRT78, EVPL, GGT6, SMIM5 and CYSRT1, respectively.
Figure 5 Survival analysis of hub genes.

Table 1 GO enrichment analysis of turquoise module and brown module

Table 2 KEGG pathway enrichment analysis of turquoise module and brown module

Figure 6 Validation of hub genes in GSE31056.

Notes: (A) Volcano plot visualizing DEGs in GSE31056 (23 normal samples and 21 OSCC). The vertical lines demarcate the fold change values. The right vertical line corresponds to more than or equal to twofold up changes and the left vertical line more than or equal to twofold down changes, while the horizontal line marks a –log10 adjusted P-value of 0.01. (B) Heat map hierarchical clustering reveals DEGs in OSCC groups compared with those in control groups. (C) Identification of common genes between DEGs and the turquoise module by overlapping them. The five hub genes in the turquoise module were also DEGs in GSE31056. (D) Identification of common genes between DEGs and the brown module by overlapping them. The five hub genes in the brown module were also DEGs in GSE31056.
Abbreviations: DEGs, differently expressed genes; OSCC, oral squamous cell carcinoma.
Figure 6 Validation of hub genes in GSE31056.

Figure 7 Validation of hub genes in the transcriptional level.

Notes: (A) Validation of hub genes in GSE30784. The expression status of five hub genes in the turquoise module was positively correlated with the disease status, indicating they played an important role in promoting tumorigenesis. This was in accordance with the results of WGCNA. The expression status of five hub genes in the brown module was negatively correlated with the disease status, indicating they played an important role in suppressing tumorigenesis. This was in accordance with the results of WGCNA. (B) Validation of hub genes in GSE74530 and the results were the same as earlier. (C) Validation of hub genes by TCGA HNSC data in GEPIA and we obtained similar results as above. These findings indicated that our results were reliable. *** indicates ⩽0.001, **** indicates ⩽0.0001.
Abbreviations: WGCNA, weighted gene co-expression network analysis; HNSC, head and neck squamous cell carcinoma.
Figure 7 Validation of hub genes in the transcriptional level.
Figure 7 Validation of hub genes in the transcriptional level.

Figure 8 Validation of hub genes in the translational level.

Notes: (AD) Validation of five hub genes in turquoise module by The Human Protein Atlas database (IHC). There was no related IHC samples of MMP1 in the database. The translational expression level of the rest four hub genes was positively correlated with disease status as they were upregulated in OSCC samples. (EI) Validation of five hub genes in the brown module by The Human Protein Atlas database (IHC). The translational expression level of five hub genes was negatively correlated with disease status, as they were downregulated in OSCC samples.
Abbreviations: IHC, immunohistochemistry; OSCC, oral squamous cell carcinoma.
Figure 8 Validation of hub genes in the translational level.

Figure 9 Genetic alterations associated with hub genes in TCGA HNSC.

Notes: (A) A visual summary across on a query of 10 hub genes showing genetic alteration of 10 hub genes in TCGA HNSC patients. (B) The total alteration frequency of 10 hub genes in TCGA HNSC is illustrated. (C) The network contains 55 nodes, including our 10 query genes and the 50 most frequently altered neighbor genes (only four out of 10 were correlated with the 50 genes). Relationship between hub genes and tumor drugs is also illustrated.
Abbreviation: HNSC, head and neck squamous cell carcinoma.
Figure 9 Genetic alterations associated with hub genes in TCGA HNSC.

Figure S1 The medianRank and Zsummary statistics of module preservation.

Notes: Module preservation was evaluated by medianRank and Zsummary statistics, which correlated to connectivity and density of networks. If Zsummary ⩾10, there is a strong evidence that the module is preserved. The module with a lower medianRank tends to exhibit stronger observed preservation than the module with a higher medianRank if both of them are preserved. (A) Module preservation analysis for the process of normal to dysplasia. (B) Module preservation analysis for the process of dysplasia to neoplasm. The pink module was found to be highly preserved in both processes of normal to dysplasia and dysplasia to OSCC.
Abbreviation: OSCC, oral squamous cell carcinoma.
Figure S1 The medianRank and Zsummary statistics of module preservation.

Figure S2 (A) Volcano plot visualizing DEGs in GSE30784 (45 normal samples and 167 OSCC). The vertical lines demark the fold change values. The right vertical line corresponds to more than or equal to twofold up changes, and the left vertical line corresponds to twofold down changes, while the horizontal line marks a –log10 adjusted P-value of 0.01. (B) Heat map hierarchical clustering reveals DEGs in OSCC groups compared with control groups.

Abbreviations: DEG, differently expressed genes; OSCC, oral squamous cell carcinoma.
Figure S2 (A) Volcano plot visualizing DEGs in GSE30784 (45 normal samples and 167 OSCC). The vertical lines demark the fold change values. The right vertical line corresponds to more than or equal to twofold up changes, and the left vertical line corresponds to twofold down changes, while the horizontal line marks a –log10 adjusted P-value of 0.01. (B) Heat map hierarchical clustering reveals DEGs in OSCC groups compared with control groups.

Figure S3 Survival analysis of the 10 hub genes using GSE41613 for validation.

Figure S3 Survival analysis of the 10 hub genes using GSE41613 for validation.

Table S1 GO enrichment analysis of DEGs

Table S2 KEGG pathway enrichment analysis of turquoise module and brown module