166
Views
0
CrossRef citations to date
0
Altmetric
Research Articles

An enhanced methodology for predicting protein-protein interactions between human and hepatitis C virus via ensemble learning algorithms

, ORCID Icon, , , &
Pages 10592-10602 | Received 05 Jan 2021, Accepted 17 Jun 2021, Published online: 11 Jul 2021
 

Abstract

Hepatitis C virus (HCV) is responsible for a variety of human life-threatening diseases, which include liver cirrhosis, chronic hepatitis, fibrosis and hepatocellular carcinoma (HCC) . Computational study of protein-protein interactions between human and HCV could boost the findings of antiviral drugs in HCV therapy and might optimize the treatment procedures for HCV infections. In this analysis, we constructed a prediction model for protein-protein interactions between HCV and human by incorporating the features generated by pseudo amino acid compositions, which were then carried out at two levels: categories and features. In brief, extra-tree was initially used for feature selection while SVM was then used to build the classification model. After that, the most suitable models for each category and each feature were selected by comparing with the three ensemble learning algorithms, that is, Random Forest, Adaboost, and Xgboost. According to our results, profile-based features were more suitable for building predictive models among the four categories. AUC value of the model constructed by Xgboost algorithm on independent data set could reach 92.66%. Moreover, Distance-based Residue, Physicochemical Distance Transformation and Profile-based Physicochemical Distance Transformation performed much better among the 17 features. AUC value of the Adaboost classifier constructed by Profile-based Physicochemical Distance Transformation on the independent dataset achieved 93.74%. Taken together, we proposed a better model with improved prediction capacity for protein-protein interactions between human and HCV in this study, which could provide practical reference for further experimental investigation into HCV-related diseases in future.

Communicated by Ramaswamy H. Sarma

Disclosure statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Authors’ contributions

XL and XZ proposed the core ideas of the project. XL and LW collected and processed the data, performed the experiments, and contribute to the writing of the manuscript. CL, YL and TY validated the results and critically reviewed the manuscript. All authors read and approved the final manuscript and consent the publication of this study.

Additional information

Funding

Prof. Xin Liu greatly appreciated the funding of Xuzhou Science and Technology Project (KC17123), Jiangsu Postdoctoral Science Foundation (1601080 C, 1701062B), Jiangsu University Natural Science Foundation (17KJB310015), and Research Foundation for Talented Scholars in Xuzhou Medical University (D2015001). Prof. Liang Wang acknowledged the financial support of National Natural Science Foundation of China (31900022), Natural Science Foundation of Jiangsu Province (BK20180997), Jiangsu Qinglan Project (2020), and the Funding of Innovative Science and Technology Team of Young Scientists at Xuzhou Medical University (TD202001).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.