Views

CrossRef citations to date

Altmetric

Research Article

Enhancing the Human Health Status Prediction: The ATHLOS Project

P. Anagnostoua Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, GreeceCorrespondence[email protected]

https://orcid.org/0000-0002-4775-9220 View further author information

S. Tasoulisa Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece

https://orcid.org/0000-0001-9536-4090 View further author information

A. G. Vrahatisa Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, GreeceView further author information

S. Georgakopoulosa Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, GreeceView further author information

M. Prinab Social Epidemiology Research Group. Health Service and Population Research Department, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UK;c Global Health Institute, King’s College London, London, UK

https://orcid.org/0000-0001-6698-3263 View further author information

J. L. Ayuso-Mateosd Centro De Investigación Biomédica En Red De Salud Mental, CIBERSAM, Madrid, Spain;e Department of Psychiatry, Universidad Autónoma De Madrid, Madrid, Spain;f Hospital Universitario De La Princesa, Instituto De Investigación Sanitaria Princesa (IIS Princesa), Madrid, Spain

https://orcid.org/0000-0002-7544-826X View further author information

J. Bickenbachg Swiss Paraplegic Research, Guido A. Zäch Institute (GZI), Nottwil, Switzerland;h Department of Health Sciences & Health Policy, University of Lucerne, Lucerne, SwitzerlandView further author information

I. Bayes-Marind Centro De Investigación Biomédica En Red De Salud Mental, CIBERSAM, Madrid, Spain;i Research, Innovation and Teaching Unit. Parc Sanitari Sant Joan De Déu, Sant Boi De Llobregat, SpainView further author information

F. F. Caballeroj Department Preventive Medicine and Public Health, Universidad Autónoma De Madrid/Idipaz, Madrid, Spain;k Centro De Investigación Biomédica En Red De Epidemiología Y Salud Pública, CIBERESP, Madrid, SpainView further author information

L. Egea-Cortési Research, Innovation and Teaching Unit. Parc Sanitari Sant Joan De Déu, Sant Boi De Llobregat, SpainView further author information

E. García-Esquinasj Department Preventive Medicine and Public Health, Universidad Autónoma De Madrid/Idipaz, Madrid, Spain;k Centro De Investigación Biomédica En Red De Epidemiología Y Salud Pública, CIBERESP, Madrid, SpainView further author information

M. Leonardil Fondazione IRCCS Istituto Neurologico Carlo Besta, Milan, ItalyView further author information

S. Scherbovm International Institute for Applied Systems Analysis, World Population Program, Wittgenstein Centre for Demography and Global Human Capital, Laxenburg, Austria;n Austrian Academy of Science, Vienna Institute of Demography, Vienna, Austria;o Russian Presidential Academy of National Economy and Public Administration (RANEPA), Moscow, Russian FederationView further author information

A. Tamosiunasp Lithuanian University of Health Sciences, Kaunas, LithuaniaView further author information

A. Galasq Department of Epidemiology and Preventive Medicine, Jagiellonian University, Krakow, PolandView further author information

J. M. Harod Centro De Investigación Biomédica En Red De Salud Mental, CIBERSAM, Madrid, Spain;i Research, Innovation and Teaching Unit. Parc Sanitari Sant Joan De Déu, Sant Boi De Llobregat, Spain

https://orcid.org/0000-0002-3984-277X View further author information

A. Sanchez-Niubod Centro De Investigación Biomédica En Red De Salud Mental, CIBERSAM, Madrid, Spain;i Research, Innovation and Teaching Unit. Parc Sanitari Sant Joan De Déu, Sant Boi De Llobregat, Spain

https://orcid.org/0000-0003-0309-181X View further author information

V. Plagianakosa Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, GreeceView further author information

D. Panagiotakosr Department of Nutrition and Dietetics, School of Health Science and Education, Harokopio University, Athens, GreeceView further author information

show all

Figures & data

Table 1. Comparison of five imputation methods (Linear Regression (LR), Mean Imputation (Mean), Multiple Linear Regression (MLR), Dual Imputation Method (DIM), and Vtreat) in regression tasks using six different regression techniques (Deep Neural Network (DNN) 1, DNN2, k-Nearest Neighbors (kNN), Linear Regression (LR), Random Forests (RF), and XGBoost). The table contains the mean (standard error) values (%) of the R-squared measure and the mean (standard error) values of Root Mean Square Error (RMSE) from 80 independent executions. The best value among imputation methods for each classifier is depicted in bold, and the highest value of all imputation methods for all classifiers is depicted in bold italics

Download CSV Display Table

Figure 1. Each radar plot contains the visual representation of the classification results for each imputation method used in this paper. The methods are Mean Imputation (Mean), Linear Regression (LR) imputation, Multi Linear Regression (MLR) imputation, Dual Imputation Model (DIM), and Vtreat imputation. The axes of the radar plots are metrics accuracy, precision, recall, sensitivity, and specificity. Finally, there is one radar plot for each of the classification models utilized. Namely, for the implementation of the Logistic Regression model, the kNN Classification model, the Random Forests model, the XGBoost model, and the two Deep Neural Network models (DNN1 and DNN2).Figure 1(a). Logistic Regression Figure 1(b). kNN Classification Figure 1(c). Random Forests Figure 1(d). XGBoost Figure 1(e). DNN1 Figure 1(f). DNN2

Figure 2. Scatter plots (left column) depict the first two principal components of PCA performed on the five imputed ATHLOS datasets using Linear Regression, Mean, Dual Imputation Model, and Vtreat imputation. Circular points with orange, yellow, and light blue colors illustrate the low, medium, and high HS scores. Above and right to each scatter plot, their data distribution is illustrated. Heatmap-Scatter plots (right column) depict the correlation of predicted and real HS score of the five imputation methods using the Principal Components Regression (PCR) technique. The red to green color graduation of boxes indicates the number of samples from low to high amounts, respectively. Above and right to each heatmap-scatter plot is illustrated the marginal distribution of the HS and the HS estimation as univariate histograms with a density curve on the vertical and horizontal axes of the scatter plot, respectively

Table 2. Comparison of 5 imputation methods using the Principal Components Regression technique. The table contains the (%) of the R-squared measure and the mean (standard error) values of Root Mean Square Error (RMSE). The best value among imputation methods for each measure is depicted in bold

Download CSV Display Table

Figure 3. The horizontal bars illustrate the most (left) and the least (right) important variables regarding their effectiveness in the HealthStatus prediction by applying the XGBoost classification algorithm. The x-axis imprints the variable importance score, while the y-axis includes the feature names defined by the ATHLOS project (see supplementary sheet S1)

Supplemental material

Supplemental Material

Download MS Excel (76.5 KB)

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Enhancing the Human Health Status Prediction: The ATHLOS Project

Table 2. Comparison of 5 imputation methods using the Principal Components Regression technique. The table contains the (%) of the R-squared measure and the mean (standard error) values of Root Mean Square Error (RMSE). The best value among imputation methods for each measure is depicted in bold

Supplemental Material

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Enhancing the Human Health Status Prediction: The ATHLOS Project

Figures & data

Table 2. Comparison of 5 imputation methods using the Principal Components Regression technique. The table contains the (%) of the R-squared measure and the mean (standard error) values of Root Mean Square Error (RMSE). The best value among imputation methods for each measure is depicted in bold

Supplemental Material

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date