245
Views
12
CrossRef citations to date
0
Altmetric
Original Research

Detection of cognitive impairment using a machine-learning algorithm

, , , , , , & show all
Pages 2939-2945 | Published online: 01 Nov 2018

Abstract

Purpose

The Mini-Mental State Examination (MMSE) is one of the most frequently used bedside screening measures of cognition. However, the Korean Dementia Screening Questionnaire (KDSQ) is an easier and more reliable screening method. Instead, other clinical variables and raw data were used for this study without the consideration of a cutoff value. The objective of this study was to develop a machine-learning algorithm for the detection of cognitive impairment (CI) based on the KDSQ and the MMSE.

Patients and methods

The original dataset from the Clinical Research Center for Dementia of South Korea study was obtained. In total, 9,885 and 300 patients were randomly allocated to the training and test datasets, respectively. We selected up to 24 variables including sex, age, education duration, diabetes mellitus, and hypertension. We trained a machine-learning algorithm using TensorFlow based on the training dataset and then calculated its accuracy using the test dataset. The cost was calculated by conducting a logistic regression.

Results

The accuracy of the model in predicting CI based on the KDSQ only, the MMSE only, and the combination of the KDSQ and MMSE was 84.3%, 88.3%, and 86.3%, respectively. For the KDSQ, the sensitivity for detecting CI was 91.50% and the specificity for detecting normal cognition (NL) was 59.60%. The sensitivity of the MMSE was 94.35%, and the specificity was 59.62%. When combining the KDSQ and the MMSE, the sensitivity for detecting CI was 91.5% and the specificity for detecting NL was 61.5%.

Conclusion

The algorithm predicting CI based on the MMSE is superior. However, the KDSQ can be administered more easily in clinical practice and the algorithm using KDSQ is a comparable screening tool.

View correction statement:
Detection of cognitive impairment using a machine-learning algorithm [Corrigendum]

Introduction

The prevention of dementia is one of the highest priorities for public health, and a predictive test is needed for its early intervention. Recently reported and several ongoing prevention trials have focused on the issues of screening of at-risk dementia patients or dementia prediction.Citation1Citation3 Several dementia prediction models have been proposed.Citation4Citation7

The Mini-Mental State Examination (MMSE) was developed as a brief screening tool to provide a quantitative assessment of cognitive impairment (CI) and is one of the most frequently used bedside screening measures of cognition.Citation8 However, subsequent studies have suggested several limitations of its use. For example, it can overestimate impairments in those older than the age of 60 years and in those with less education.Citation9,Citation10 The MMSE is also insensitive to CI from subcortical lesions in the right hemisphere and frontal lobes.Citation11 These limitations are considered to have far-reaching implications for dementia screening.Citation12

A questionnaire for dementia screening is a possible alternative. Compared to the MMSE, the Korean Dementia Screening Questionnaire (KDSQ) is a dementia screening questionnaire that does not have to be conducted in person and can be administered by an interviewer without specialized skills. It is not influenced by age or educational level. The KDSQ is sensitive in identifying early dementia patients, and its validity and reliability have previously been evaluated. Conducting a KDSQ is an easier and more reliable screening method ().Citation13

Machine-learning models were originally designed to analyze large, complex medical datasets.Citation14 Machine-learning algorithms have been used to detect various diseases such as coronary artery disease and liver malfunction and select genes for cancer detection. Recently, machine-learning techniques for diagnosing dementia have been studied, the majority of which involved analyzing brain magnetic resonance imaging (MRI), positron emission tomography, and electroencephalography and some involved analyzing risk factors and gene data.Citation15Citation20 However, the intent of this current study was to identify cognitively impaired patients with memory problems using a large dataset of dementia screening test results and to investigate the utility of the KDSQ.

Our objective herein was to evaluate and comprehensively compare machine learning for the diagnosis of CI based on the KDSQ and MMSE. Significant differences are not expected to occur between machine learning using the KDSQ and that using the MMSE.

Patients and methods

We trained a machine-learning algorithm using TensorFlow (https://www.tensorflow.org/) to distinguish between the patients with CI and those with normal cognition (NL) based on the data that were obtained from the Clinical Research Center for Dementia of South Korea (CREDOS).Citation21 Some of the patients were tested using this algorithm to evaluate the accuracy rate. The design and protocol of this prospective study were approved by the institutional review board of Chung-Ang University Hospital (registration no I2007040 [98]). TensorFlow is an open source software library for machine learning developed by Google based on the python computer language.Citation22

Participants

A total of 10,185 patients were selected from the CREDOS study. The CREDOS study was a prospective, multi-center, hospital-based cohort study designed to assess the occurrence and risk factors of cognitive disorders. The “clinical diagnosis” dataset was composed of participants categorized as one of three diagnostic classes: cognitively normal, mild cognitive impairment (MCI), and Alzheimer’s disease (AD). The cognitively normal class technically included subjective cognitive decline (SCD) because the CREDOS data were from a hospital-based study.

The inclusion criteria for SCD were as follows: 1) sustained subjective memory complaints; 2) normal general cognition (within 1 SD of the age- and education-adjusted norms of the Korean version of the MMSE [K-MMSE]Citation23 and a score of >26); 3) intact activities of daily living (ADL); and 4) no abnormality (within 1 SD of age- and education-adjusted norms) on a comprehensive neuropsychological battery (Seoul Neuropsychological Screening Battery).Citation24,Citation25

The criteria for MCI in the CREDOS study were as follows: 1) the presence of memory complaints; 2) intact function in ADL; 3) objective CI (≥1 SD below age- and education-adjusted norms) in more than one cognitive domain on a comprehensive neuropsychological battery (Seoul Neuropsychological Screening Battery);Citation24,Citation25 4) a clinical dementia rating (CDR) of 0.5; and 5) not demented according to the Diagnostic and Statistical Manual of Mental Disorders (DSM)-IV criteria.

The patients with AD met the probable AD criteria proposed by the National Institute of Neurological and Communicative Disorders and Stroke and Alzheimer’s Disease and Related Disorders AssociationCitation26 as well as the DSM-IV.

Patients with clinical evidence of a stroke; structural lesions such as territorial infarction, intracranial hemorrhage, brain tumor, and hydrocephalus; and current or past neurological or psychiatric illnesses such as schizophrenia, epilepsy, brain tumor, encephalitis, and severe head trauma were excluded.

The original dataset we received from CREDOS had 786 variables for 10,185 patients. The data were mined for the objective variables only, and subjective and leading variables that would indicate CI were removed. That is, the variables represented by numbers or scores were adopted, and variables such as neuropsychiatric inventory obtained through caregiver’s interviews and a Global Deterioration Score or CDR suggestive of CI were excluded. The missing values that were identified in the past medical history were ultimately enrolled. Ultimately, there were 24 variables (including the outcome variable). The following is a sample of the variables such as sex, age at the time of visit, education duration, diabetes mellitus (DM), hypertension (HT), hypercholesterolemia, stroke history, 15-item score of the KDSQ, MMSE score, and the outcome variable.

Model training

We divided the patients into two groups: cognitively normal and cognitively impaired, the latter of which included MCI and dementia patients because this study was conducted to develop an algorithm to identify CI among patients who required further inquiry.

The first step in modeling the data comprised the following preprocessing steps. The cardinality variables were standardized. The variables (age at the time of visit, education duration, and MMSE score) were normalized as follows: χ=χµσ where χ′ denotes standardized cardinality of variables, χ denotes raw cardinality of variables, µ denotes mean, and σ denotes SD.

In the second step, the dataset was then split randomly into train and test sets, and each set was developed to create feature (x_data) and outcome (y_data) variables. The third step involved model training with the train dataset using TensorFlow. The cost was calculated by a logistic regression. The fourth step involved calculation of the accuracy using the test dataset ().

Statistical analyses

The difference between CI and NL with regard to age, education duration, and MMSE and KDSQ scores was evaluated using a Student’s t-test, and the sex ratio, frequency of DM and HT, and stroke history were evaluated using a chi-squared test.

The accuracies of the MMSE, KDSQ, and MMSE with KDSQ in the prediction of CI were measured by the frequency of correct estimations by the trained algorithm in the test dataset. Therefore, the ratio of correct answers among all patients in the test dataset was used.

The sensitivity and specificity of the algorithm and the receiver operating characteristic (ROC) curve were calculated using Microsoft Excel version 16.0 (Microsoft Corporation, Chicago, IL, USA) and SPSS version 23 (IBM Corporation, Armonk, NY, USA).

Results

After data mining of raw data from CREDOS, a brief inspection of the data revealed that it was an imbalanced dataset. Cases of CI were 5.5-fold more prevalent in the dataset; 84.7% of the cases had CI and 15.3% of the cases were cognitively normal. The gender distribution was 31.2% males and 68.8% females ().

Table 1 Demographics of the dataset

In total, 9,885 and 300 patients were allocated to the training and test datasets, respectively. The test dataset was composed of 52 cognitively normal and 248 cognitively impaired patients. All were tested with the trained model.

The accuracy of the model for predicting CI based on the KDSQ only, MMSE only, and KDSQ and MMSE combined was 84.3%, 88.3%, and 86.3%, respectively. The KDSQ had a 91.5% sensitivity for detecting CI and a 59.6% specificity for detecting cognitively normal cases. The sensitivity and specificity of the MMSE were 94.35% and 59.62%, respectively. When combining the KDSQ and MMSE, the sensitivity for detecting CI was 91.5% and the specificity for detecting cognitively normal cases was 61.5%.

The ROC curves of the three predictors of the KDSQ, MMSE, and the combination of KDSQ and MMSE are provided in . To discriminate CI from NL, the MMSE had the highest area under the ROC curve (0.770), followed by the combination of the KDSQ and MMSE (0.765) and the MMSE only (0.756).

Figure 1 The ROC curve of three predictors of the KDSQ, MMSE, and the combination of KDSQ and MMSE.

Note: The ROC curves were obtained by conducting a logistic regression using TensorFlow on a dataset from the CREDOS.
Abbreviations: CREDOS, Clinical Research Center for Dementia of South Korea; KDSQ, Korean Dementia Screening Questionnaire; MMSE, Mini-Mental State Examination; ROC, receiver operating characteristic.
Figure 1 The ROC curve of three predictors of the KDSQ, MMSE, and the combination of KDSQ and MMSE.

Discussion

This study investigated the utility of an algorithm for predicting CI using a large dataset from the CREDOS study. It also verified the accuracy rate for predicting patients with CI.

The accuracies of the screening tests in this study are lower than those expected in machine learning.Citation27 However, the sensitivity of the algorithm using the KDSQ to detect the CI was 91.5%, which is good for a screening tool. Even after adding the MMSE, the algorithm did not improve in the detection of CI; however, the specificity improved slightly. The improved specificity may be the reason that the accuracy improved. This algorithm exhibited a higher sensitivity and a lower specificity; however, there are possible implications from a dementia screening perspective. Screening tools should not miss the patients who have CI; therefore, they generally require a higher sensitivity even if this results in a lower specificity.Citation28

The algorithm using the MMSE only exhibited the highest sensitivity and accuracy rates. The first reason is that this algorithm was trained with variables including age and education levels, which particularly affect the accuracy of the MMSE compared to that of the KDSQ. The second reason is that this outcome was caused by the inclusion of diagnostic criteria for NL, MCI, and AD in the CREDOS data subset, which already included an MMSE cutoff score between cognitively normal and MCI.

A previous study that investigated screening methods using the KDSQ and MMSE demonstrated that the combination of the KDSQ and MMSE had the highest area under the ROC curve (0.784).Citation13 The area under the ROC curve of our algorithm was 0.765, which is similar to that of the previous study; however, the MMSE had the highest area under the ROC curve (0.770). The differences in the values between the current study and the previous study may be due to the previous study already having a cutoff value for the MMSE and KDSQ. Instead, other clinical variables and raw data were used for this study without the consideration of a cutoff value. In addition, this study included data from a substantial number of patients from the CREDOS.

As a screening tool, compared to the KDSQ, the MMSE has several limitations. For example, it must be conducted face-to-face, and the interviewer must be skilled. In addition, it is influenced by age and education level. Therefore, the KDSQ is easier to utilize, particularly in primary care practice.

Significant differences were not expected to occur between machine learning using the KDSQ without and with the MMSE for screening CI because there was no difference in the sensitivity level. The addition of the MMSE improved only the specificity for detecting NL.

The machine-learning algorithm using only the KDSQ and several clinical variables could be useful for screening patients with CI in primary care practice.

However, there are several limitations. The sensitivity of the model is >90%, but even as a screening tool for CI, the specificity is low and the accuracy is not sufficient to reach the level that is typically expected in machine learning. Additionally, the data provided by the CREDOS were an imbalanced dataset since patients with CI were 5.5-fold more prevalent. Furthermore, the model is only applicable to differentiating between cognitively normal persons and patients with MCI or dementia and cannot be used to differentiate dementia subtypes.

Conclusion

We trained and tested a machine-learning algorithm model of the KDSQ for distinguishing cognitively normal and cognitively impaired patients using the CREDOS data and suggest its possibility as a screening tool for CI.

Author contributions

All authors contributed toward data analysis and drafting and revision of the paper, gave final approval of the version to be published, and agree to be accountable for all aspects of the work.

Acknowledgments

This work was supported by the Ministry of Education of the Republic of Korea and the National Research Foundation of Korea (NRF-2017S1A6A3A01078538), Korea Ministry of Health & Welfare, and Original Technology Research Program for Brain Science through the National Research Foundation of Korea which was funded by the Korean Government (MSIP; No 2014M3C7A1064752).

Supplementary materials

Table S1 KDSQ cognition

Table S2 The TensorFlow code using CREDOS data to predict CI

Disclosure

The authors report no conflicts of interest in this work.

References

  • ShaikMAKhooCHThiagarajahAGPilot Evaluation of a Dementia Case Finding Clinical Service Using the Informant AD8 for At-Risk Older Adults in Primary Health Care: A Brief ReportJ Am Med Dir Assoc2016177673.e5673.e8
  • MendonçaMDAlvesLBugalhoPFrom Subjective Cognitive Complaints to Dementia: Who is at Risk? A Systematic ReviewAm J Alzheimers Dis2016312105114
  • BregmanNRegevKMooreOGiladiNAshEA Simple Tool to Reach Populations at Risk for Developing Dementia and Alzheimer’s DiseaseJ Alzheimers Dis201546115115525720401
  • SolomonASoininenHDementia: Risk prediction models in dementia preventionNat Rev Neurol201511737537725986502
  • RondeauVMauguenALaurentABerrCHelmerCDynamic prediction models for clustered and interval-censored outcomes: Investigating the intra-couple correlation in the risk of dementiaStat Methods Med Res20172652168218326184832
  • LebedevaAKWestmanEBorzaTMRI-Based Classification Models in Prediction of Mild Cognitive Impairment and Dementia in Late-Life DepressionFront Aging Neurosci201791328210220
  • StephanBCKurthTMatthewsFEBrayneCDufouilCDementia risk prediction in the population: are screening models accurate?Nat Rev Neurol20106631832620498679
  • FolsteinMFFolsteinSEMcHughPR“Mini-mental state”. A practical method for grading the cognitive state of patients for the clinicianJ Psychiatr Res19751231891981202204
  • CrumRMAnthonyJCBassettSSFolsteinMFPopulation-based norms for the Mini-Mental State Examination by age and educational levelJAMA199326918238623918479064
  • GrigolettoFZappalàGAndersonDWLebowitzBDNorms for the Mini-Mental State Examination in a healthy populationNeurology199953231532010430420
  • OhEKangYShinJHYeonBKA Validity Study of K-MMSE as a Screening Test for Dementia: Comparison Against a Comprehensive Neuropsychological EvaluationDement Neurocognitive Disord20109812
  • LancuIOlmerAThe minimental state examination – an up-to-date reviewHarefuah2006145968769017078433
  • ChoiSHParkMHThree screening methods for cognitive dysfunction using the Mini-Mental State Examination and Korean Dementia Screening QuestionnaireGeriatr Gerontol Int201616225225825655174
  • KononenkoIMachine learning for medical diagnosis: history, state of the art and perspectiveArtif Intell Med20012318910911470218
  • BryanRNMachine Learning Applied to Alzheimer DiseaseRadiology2016281366566827870619
  • ChenRHerskovitsEHMachine-learning techniques for building a diagnostic model for very mild dementiaNeuroimage201052123424420382237
  • MathotaarachchiSPascoalTAShinMIdentifying incipient dementia individuals using machine learning and amyloid imagingNeurobiol Aging201759809028756942
  • ScheubertLLuštrekMSchmidtRRepsilberDFuellenGTissue-based Alzheimer gene expression markers-comparison of multiple machine learning approaches and investigation of redundancy in small biomarker setsBMC Bioinformatics20121326623066814
  • ShankleWRManiaSDickMBPazzaniMJSimple models for estimating dementia severity using machine learningStud Health Technol Inform199852Pt 147247610384501
  • TohkaJMoradiEHuttunenHAlzheimer’s Disease Neuroimaging I. Comparison of Feature Selection Techniques in Machine Learning for Anatomical Brain MRI in DementiaNeuroinformatics20161427929626803769
  • ParkHKNaDLHanSHClinical characteristics of a nationwide hospital-based registry of mild-to-moderate Alzheimer’s disease patients in Korea: a CREDOS (Clinical Research Center for Dementia of South Korea) studyJ Korean Med Sci20112691219122621935279
  • RampasekLGoldenbergATensorFlow: Biology’s Gateway to Deep Learning?Cell Syst201621121427136685
  • HanCJoSAJoIKimEParkMHKangYAn adaptation of the Korean mini-mental state examination (K-MMSE) in elderly Koreans: demographic influence and population-based norms (the AGE study)Arch Gerontol Geriatr200847330231017936377
  • AhnHJChinJParkASeoul Neuropsychological Screening Battery-dementia version (SNSB-D): a useful tool for assessing and monitoring cognitive impairments in dementia patientsJ Korean Med Sci20102571071107620592901
  • JahngSNaDLKangYConstructing a Composite Score for the Seoul Neuropsychological Screening Battery-CoreDementia Neurocognitive Disord2015144137142
  • DuboisBFeldmanHHJacovaCResearch criteria for the diagnosis of Alzheimer’s disease: revising the NINCDS-ADRDA criteriaLancet Neurol20076873474617616482
  • PrashanthRDutta RoySMandalPKGhoshSHigh-Accuracy Detection of Early Parkinson’s Disease through Multimodal Features and Machine LearningInt J Med Inform201690132127103193
  • IsellaVMapelliCSiriCValidation and attempts of revision of the MDS-recommended tests for the screening of Parkinson’s disease dementiaParkinsonism Relat Disord2014201323624084382