Figures & data
Figure 1 WGCNA of the OSA dataset (GSE135917). (A) The sample clustering results. (B) The sample dendrogram and trait heatmaps. (C) A soft threshold of 18 was the most suitable value to OSA. (D) Sixteen modules were related to OSA. (E) The corresponding heatmaps of module-trait relationships. (F) The Zsummary scores of 16 modules calculated by module preservation.
![Figure 1 WGCNA of the OSA dataset (GSE135917). (A) The sample clustering results. (B) The sample dendrogram and trait heatmaps. (C) A soft threshold of 18 was the most suitable value to OSA. (D) Sixteen modules were related to OSA. (E) The corresponding heatmaps of module-trait relationships. (F) The Zsummary scores of 16 modules calculated by module preservation.](/cms/asset/3f40160b-e70c-4d48-ab5e-6c250286f691/dijg_a_12168428_f0001_c.jpg)
Figure 2 WGCNA of lung cancer dataset in The Cancer Genome Atlas (TCGA). (A) The sample clustering results. (B) The sample dendrogram and trait heatmaps. (C) A soft threshold of five was the most suitable value to lung cancer. (D) Thirty-five modules were related to lung cancer. (E) The corresponding heatmaps of module-trait relationships. (F) The Zsummary scores of 35 modules calculated by module preservation.
![Figure 2 WGCNA of lung cancer dataset in The Cancer Genome Atlas (TCGA). (A) The sample clustering results. (B) The sample dendrogram and trait heatmaps. (C) A soft threshold of five was the most suitable value to lung cancer. (D) Thirty-five modules were related to lung cancer. (E) The corresponding heatmaps of module-trait relationships. (F) The Zsummary scores of 35 modules calculated by module preservation.](/cms/asset/c75b1bc8-e358-4941-ab5d-0da3fc881fd7/dijg_a_12168428_f0002_c.jpg)
Figure 3 Screening for key modules and common genes. (A) Dark gray and ivory modules were screened in OSA and (B) blue and yellow modules were screened in lung cancer. (C) A total of 104 overlapped common genes in key modules of OSA and lung cancer were screened. (D) Gene Ontology analysis of 104 common genes in OSA and lung cancer.
![Figure 3 Screening for key modules and common genes. (A) Dark gray and ivory modules were screened in OSA and (B) blue and yellow modules were screened in lung cancer. (C) A total of 104 overlapped common genes in key modules of OSA and lung cancer were screened. (D) Gene Ontology analysis of 104 common genes in OSA and lung cancer.](/cms/asset/3c5dfd6d-e3e1-40a5-8bfa-c8d7dfa2ac89/dijg_a_12168428_f0003_c.jpg)
Table 1 Functional Enrichment Analysis and the Identified 10 Enriched Pathways
Figure 4 Identification of key genes associated with prognosis of lung cancer. (A) Fourteen genes and (B) four genes were related to prognosis by univariate Cox regression algorithm analysis and multivariate Cox regression algorithm analysis, respectively. The expressions of MOAP1, CBX7, PDGFB, and MAP2K3 were downregulated in both The Cancer Genome Atlas (TCGA) dataset (C) and the GSE30219 lung cancer samples (D). P values less than 0.05 are considered significant.
![Figure 4 Identification of key genes associated with prognosis of lung cancer. (A) Fourteen genes and (B) four genes were related to prognosis by univariate Cox regression algorithm analysis and multivariate Cox regression algorithm analysis, respectively. The expressions of MOAP1, CBX7, PDGFB, and MAP2K3 were downregulated in both The Cancer Genome Atlas (TCGA) dataset (C) and the GSE30219 lung cancer samples (D). P values less than 0.05 are considered significant.](/cms/asset/31b4a10f-bc69-4a54-8cf5-70fb4438c05b/dijg_a_12168428_f0004_c.jpg)
Table 2 The Coefficients of 4 Key Prognostic Genes
Figure 5 The prognostic assessment of the signature in The Cancer Genome Atlas (TCGA) dataset. (A) Survival status and survival duration of patients and heatmap of the four key prognostic genes expression in patients with lung cancer. (B) The risk score distribution of patients with lung cancer in TCGA dataset. (C) The Kaplan–Meier curve of the overall survival between the high-risk and low-risk groups stratified by the median risk score in TCGA dataset. (D) ROC analysis of the predictive efficiency of the four-gene prognostic model on overall survival based on risk score.
![Figure 5 The prognostic assessment of the signature in The Cancer Genome Atlas (TCGA) dataset. (A) Survival status and survival duration of patients and heatmap of the four key prognostic genes expression in patients with lung cancer. (B) The risk score distribution of patients with lung cancer in TCGA dataset. (C) The Kaplan–Meier curve of the overall survival between the high-risk and low-risk groups stratified by the median risk score in TCGA dataset. (D) ROC analysis of the predictive efficiency of the four-gene prognostic model on overall survival based on risk score.](/cms/asset/f737e3cc-2273-4694-95cd-660ac06fbc25/dijg_a_12168428_f0005_c.jpg)
Figure 6 The prognostic assessment of the signature in the GSE30219 dataset. (A) Survival status and survival duration of patients and heatmap of the four key prognostic genes expression in patients with lung cancer. (B) The risk score distribution of patients with lung cancer in the GSE30219 dataset. (C) The Kaplan–Meier curve of the overall survival between the high-risk and low-risk groups stratified by the median risk score in the GSE30219 dataset. (D) ROC analysis of the predictive efficiency of the four-gene prognostic model on overall survival based on risk score.
![Figure 6 The prognostic assessment of the signature in the GSE30219 dataset. (A) Survival status and survival duration of patients and heatmap of the four key prognostic genes expression in patients with lung cancer. (B) The risk score distribution of patients with lung cancer in the GSE30219 dataset. (C) The Kaplan–Meier curve of the overall survival between the high-risk and low-risk groups stratified by the median risk score in the GSE30219 dataset. (D) ROC analysis of the predictive efficiency of the four-gene prognostic model on overall survival based on risk score.](/cms/asset/12b1944f-9f3a-47c1-add0-bf2a6580a113/dijg_a_12168428_f0006_c.jpg)
Figure 7 Univariate and multivariate analyses in The Cancer Genome Atlas (TCGA) training set and the GSE30219 validation set. (A) Univariate analysis in TCGA training set. (B) Multivariate analysis in TCGA training set. (C) Univariate analysis in the GSE30219 validation set. (D) Multivariate analysis in the GSE30219 validation set. P values less than 0.05 are considered significant.
![Figure 7 Univariate and multivariate analyses in The Cancer Genome Atlas (TCGA) training set and the GSE30219 validation set. (A) Univariate analysis in TCGA training set. (B) Multivariate analysis in TCGA training set. (C) Univariate analysis in the GSE30219 validation set. (D) Multivariate analysis in the GSE30219 validation set. P values less than 0.05 are considered significant.](/cms/asset/d7475163-3361-40c0-8161-c1299f6c8ec8/dijg_a_12168428_f0007_c.jpg)
Figure 8 Validation of the nomogram in predicting overall survival of lung cancer in The Cancer Genome Atlas (TCGA) training set and the GSE30219 validation set. (A and C) A nomogram to predict survival probability at the 1-, 3-, 5-, 7-, and 10-year timepoints for patients with lung cancer on the basis of results derived from TCGA training and GSE30219 validation sets. (B and D) Calibration curve for the nomogram when predicting the 1-, 3-, 5-, 7-, and 10-year overall survival in TCGA training and GSE30219 validation sets.
![Figure 8 Validation of the nomogram in predicting overall survival of lung cancer in The Cancer Genome Atlas (TCGA) training set and the GSE30219 validation set. (A and C) A nomogram to predict survival probability at the 1-, 3-, 5-, 7-, and 10-year timepoints for patients with lung cancer on the basis of results derived from TCGA training and GSE30219 validation sets. (B and D) Calibration curve for the nomogram when predicting the 1-, 3-, 5-, 7-, and 10-year overall survival in TCGA training and GSE30219 validation sets.](/cms/asset/3d4a09f7-02f9-4e29-a2d3-dad3176a7e8e/dijg_a_12168428_f0008_c.jpg)