Search in:

Connection Science Volume 35, 2023 - Issue 1

Submit an article Journal homepage

Open access

744

Views

CrossRef citations to date

Altmetric

Research Article

GASN: gamma distribution test for driver genes identification based on similarity networks

Dazhi Jianga Department of Computer Science, Shantou University, Shantou, People's Republic of ChinaView further author information

Runguo Weia Department of Computer Science, Shantou University, Shantou, People's Republic of ChinaView further author information

Zhihui Hea Department of Computer Science, Shantou University, Shantou, People's Republic of ChinaView further author information

Senlin Linb High Performance Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, People's Republic of China

https://orcid.org/0000-0002-3925-9381 View further author information

Cheng Liua Department of Computer Science, Shantou University, Shantou, People's Republic of ChinaView further author information

Yingqing Lina Department of Computer Science, Shantou University, Shantou, People's Republic of ChinaCorrespondence[email protected]
View further author information

Article: 2167937 | Received 22 Mar 2022, Accepted 05 Jan 2023, Published online: 28 Jan 2023

Cite this article
https://doi.org/10.1080/09540091.2023.2167937
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Figures & data

Figure 1. The workflow of GASN is divided into 6 steps. Specifically, (a) is used to predict the genetic characteristics of gene mutation function influence score and the observed FIS, which are fused as the feature vector for constructing a similarity network. (b) Taking gene $g_{i}$ as an example, the similarity between $g_{i}$ and other genes is calculated. (c) The top k-nearest neighbor genes $g_{s k}$ in the similarity ranking of each gene are selected to form the topology. (d) Taking gene $g_{i}$ as an example, the gene similarity network is constructed, and its order is $g_{i}, g_{s 1}, g_{i}, g_{s 2}, \dots, g_{i}, g_{s k}$ , the similarity network of all genes is used as the input of convolutional neural network. (e) It shows the 5-layer basic structure of the convolutional neural network used in this paper, including the input layer, convolution layer, pooling layer, full connection layer, and output layer. (f) The gene background distribution is fitted by gamma distribution in the subclass, and the observed FIS is compared with the predicted FIS in the background distribution, to obtain the p value of each gene and select the gene with significant deviation as the driver gene.

Figure 2. Histogram of gene background distribution in 9 cancer types.

Table 1. 12 genetic characteristics of multimers data sources.

Download CSV Display Table

Table 2. Performance indexes of gene similarity network convolution neural network under different k values.

Display Table

Figure 3. The number of NCG6.0 genes identified by different methods in 10 cancer types.

Figure 4. The precision of NCG6.0 genes identified by different methods in 10 cancer types.

Figure 5. The recall of NCG6.0 genes identified by different methods in 10 cancer types.

Figure 6. The number of CGC genes identified by different methods in 10 cancer types.

Figure 7. The precision of CGC genes identified by different methods in 10 cancer types.

Figure 8. The recall of CGC genes identified by different methods in 10 cancer types.

Acemel, R. D., Tena, J. J., Irastorza-Azcarate, I., Marlétaz, F., Gómez-Marín, C., & Gómez-Skarmeta, J. L. (2016). A single three-dimensional chromatin compartment in amphioxus indicates a stepwise evolution of vertebrate Hox bimodal regulation. Nature Genetics, 48(3), 336–341. https://doi.org/10.1038/ng.3497

PubMed Web of Science ®Google Scholar

Gu, H., Xu, X., Qin, P., & Wang, J. (2020). FI-net: Identification of cancer driver genes by using functional impact prediction neural network. Frontiers in Genetics, 11, Article 564839. https://doi.org/10.3389/fgene.2020.564839

Web of Science ®Google Scholar

Wendl, M. C., Wallis, J. W., Lin, L., Kandoth, C., Mardis, E. R., Wilson, R. K., & Ding, L. (2011). PathScan: A tool for discerning mutational significance in groups of putative cancer genes. Bioinformatics (Oxford, England), 27(12), 1595–1602. https://doi.org/10.1093/bioinformatics/btr193

PubMed Web of Science ®Google Scholar

Khan, J., Wei, J. S., Saal, L. H., Ladanyi, M., & Meltzer, P. S. (2001). Classication and diagnostic prediction of cancers using gene expression proling and articial neural networks. Nature Medicine, 7(6), 673–679. https://doi.org/10.1038/89044

PubMed Web of Science ®Google Scholar

Wyckoff, G. J., Malcom, C. M., Vallender, E. J., & Lahn, B. T. (2005). A highly unexpected strong correlation between fixation probability of nonsynonymous mutations and mutation rate. Trends in Genetics, 21(7), 381–385. https://doi.org/10.1016/j.tig.2005.05.005

PubMed Web of Science ®Google Scholar

Huntley, R., Dimmer, E., Barrell, D., Binns, D., & Apweiler, R. (2009). The gene ontology annotation (goa) database. Nature Precedings, 10, 1–1. https://doi.org/10.1038/npre.2009.3154.1

Google Scholar

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

GASN: gamma distribution test for driver genes identification based on similarity networks

Table 1. 12 genetic characteristics of multimers data sources.

Table 2. Performance indexes of gene similarity network convolution neural network under different k values.

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

GASN: gamma distribution test for driver genes identification based on similarity networks

Figures & data

Table 1. 12 genetic characteristics of multimers data sources.

Table 2. Performance indexes of gene similarity network convolution neural network under different k values.

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date