ABSTRACT
Cancer is a disease with a complex genome of altered functions. However, most existing driver gene identification approaches rarely consider driver genes may have the same functional properties. To overcome this issue, we propose the gamma distribution test for the driver gene identification based on similarity networks, termed GASN, which identifies driver genes by combining machine learning and distributional statistics methods. Similarity networks are able to learn gene similarities and key features that represent the functional impact of genes. In addition, we classify genes into different cellular compartments and use the gamma distribution test within cellular compartments to identify significant driver genes. The experimental results show that our method outperforms the other 17 comparative methods.
Acknowledgments
The authors would like to thank anonymous reviewers for their very detailed and helpful reviews. Dazhi Jiang: Conceptualization, Programming and writing. Runguo Wei: Conceptualization, Methodology, Writing and proofreading. Zhihui He: Programming and proofreading. Cheng Liu: Methodology, Writing and proofreading. Senlin Lin: Writing and proofreading. Yingqing Lin: Conceptualization, Methodology, Writing and proofreading.
Disclosure statement
No potential conflict of interest was reported by the authors