5,637
Views
20
CrossRef citations to date
0
Altmetric
Original Articles

Use of machine learning methods in prediction of short-term outcome in autism spectrum disorders

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, , , ORCID Icon, , , , & ORCID Icon show all
Pages 320-325 | Received 16 Aug 2018, Accepted 03 Nov 2018, Published online: 16 Nov 2018

ABSTRACT

OBJECTIVE

Studies show partial improvements in some core symptoms of Autism Spectrum Disorders (ASD) in time. However, the predictive factors (e.g. pretreatment IQ, comorbid psychiatric disorders, adaptive, and language skills, etc.) for a better the outcome was not studied with machine learning methods. We aimed to examine the predictors of outcome with machine learning methods, which are novel computational methods including statistical estimation, information theories and mathematical learning automatically discovering useful patterns in large amounts of data.

METHOD

The study the group comprised 433 children (mean age: 72.3 ± 45.9 months) with ASD diagnosis. The ASD symptoms were assessed by the Autism Behavior Checklist, Aberrant Behavior Checklist, Clinical Global Impression scales at baseline (T0) and 12th (T1), 24th (T2), and 36th (T3) months. We tested the performance of for machine learning algorithms (Naive Bayes, Generalized Linear Model, Logistic Regression, Decision Tree) on our data, including the 254 items in the baseline forms. Patients with ≤2 CGI points in ASD symptoms at in 36 months were accepted as the group who has “better outcome” as the prediction class.

RESULTS

The significant proportion of the cases showed significant improvement in ASD symptoms (39.7% in T1, 60.7% in T2; 77.8% in T3). Our machine learning model in T3 showed that diagnosis group affected the prognosis. In the autism group, older father and mother age; in PDD-NOS group, MR comorbidity, less birth weight and older age at diagnosis have a worse outcome. In Asperger’s Disorder age at diagnosis, age at first evaluation and developmental cornerstones has affected prognosis.

CONCLUSION

In accordance with other studies we found early age diagnosis, early start rehabilitation, the severity of ASD symptoms at baseline assessment predicted outcome. Also, we found comorbid psychiatric diagnoses are affecting the outcome of ASD symptoms in clinical observation. The machine learning models reveal several others are more significant (e.g. parental age, birth weight, sociodemographic variables, etc.) in terms of prognostic information and also planning treatment of children with ASD.

Introduction

Autism Spectrum Disorder (ASD) refers to a group of complex neurodevelopmental conditions that occurs within early years of life [Citation1]. ASD characterized with deficits in social interaction, repetitive or stereotypic behaviours, sensory abnormalities and verbal and non-verbal communication [Citation2]. In the treatment of ASD, special education [Citation3], behavioural therapy [Citation4] and psychopharmacological therapies for comorbid psychiatric disorders and irritability [Citation5] are indicated. Recent studies showed improvement in core symptoms of ASD but different results among individual participants had reported affected by various child and family-specific factors, such as pretreatment IQ, comorbid psychiatric disorders, adaptive, and language skills [Citation6]. A major group of research has focused on heritable and genetic causes of ASD [Citation7], which shows to have a highly heritable component and these research point out etiological heterogeneity. Also, research has shown that in children with ASD the rate of de novo genetic variants, that means mutations probably resulted in environmental stressors [Citation8]. As a result, recent studies showed prenatal maternal environment [Citation9,Citation10], parental age [Citation11], early behavioural intervention [Citation12] increasingly being recognized as important for the developmental outcome.

Machine learning methods are novel computational methods include mathematical learning, statistical estimation, and information theories automatically discovering useful patterns in large amounts of data. This method has the advantage of accurate and reliable prediction using data with very large numbers of variables and causal inference within non-experimental data sets [Citation13]. Recent studies in psychiatry are showed these methods been successfully used in diagnostics of ASD [Citation14], classification by altered event-related potentials in Attention Deficit Hyperactivity Disorder [Citation15], and classification by free speech analysis in Schizophrenia [Citation16]. Bishop et al. [Citation17] used machine learning methods to examine adult ASD patients lifetime health problems, and these methods precisely predicted health problems, including cardiovascular, urinary, respiratory systems.

We examined predictors of outcome after 3-year clinical observation, special education and psychiatric treatment in the clinical-based group. To our knowledge in literature, no previous study has examined the psychiatric, developmental and sociodemographic aspects of effecting prognosis on children with ASD using machine learning methods. We used our follow up data for the ASD group and tried to predict short-time outcomes of ASD and find clinical and individual factors affecting improvement in core symptoms of autism.

Methods

The study was carried in Ondokuz Mayis University Samsun, in the years, 2013–2016 with a naturalistic design. All participants in the clinical group had been previously diagnosed according to with the criteria described in the Diagnostic and Statistical Manual of Mental Disorders-IV TR by child psychiatry specialists and also comorbid psychiatric diagnosis is assessed and treatment planned by child psychiatry specialists. Caregivers filled sociodemographic and rehabilitation information form. ASD symptoms were assessed by child psychiatrist using the Autism Behavior Checklist, Aberrant Behavior Checklist, Clinical Global Impression scales at baseline (T0) and 12 (T1), 24 (T2), 36 (T3) months.

Sociodemographic data form

A sociodemographic form was completed at T0 by researchers with information related to the child, father, mother. Questions form were related to the mother and father’s age, the number of siblings, the economic status of the family (low 1500TL, middle 1500–5000TL and high-income 5000TL), the patient’s and the family’s level of education (educated years), medical history (chronic medical diseases), and self-reported history of the psychiatric diseases in first and second degree relatives.

Clinical information form

Child and Adolescent Psychiatry residents completed rehabilitation hours and medical treatments at T0, T1, T2, and T3 in this data form.

Autism Behavior Checklist

Autistic type behaviour problems were measured by the clinician using the Autism Behavior Checklist (ABC) a questionnaire pertaining to the core symptoms of autism [Citation18]. High Test scores indicate different degrees of autistic behaviour such as socially non-responsive behaviour, stereotypic behaviour, perceptual oddities, and echolalic speech. The maximum possible score is 158. The Turkish language version of the ABC has been validated, a cut-off point of 39 was determined and the alpha coefficient and split half reliability for the ABC total score were 0.92 [Citation19]. Child and Adolescent psychiatry specialists completed the Aberrant Behavior Checklist scale at T0, T1, T2, and T3.

Aberrant Behavior Checklist

Rated on a 4-point Likert scale, 58-item informant-based measure of problem behaviours of people with autism or other developmental disabilities [Citation20]. There are five subscales: Social Withdrawal (16 items), Hyperactivity (16 items), Irritability (15 items), Stereotypic Behavior (7 items), and Inappropriate Speech (4 items). Alpha coefficients have ranged from 0.77 to 0.95 across subscales [Citation21]. The Turkish translation and adaptation of the Aberrant Behavior Checklist was conducted on adolescent individuals with intellectual disability [Citation22,Citation23]. Adaptation study of Aberrant Behavior Checklist for preadolescent children was performed later; to do this, three items were rewritten to be congruent with this age range [Citation24]. Internal consistencies for all Aberrant Behavior Checklist subscales were found to be moderate to high. Cronbach’s alpha values were as follows: (I) Irritability, Agitation, Crying: 0.90; (II) Lethargy, Social Withdrawal: 0.81; (III) Stereotypic Behavior: 0.83; (IV) Hyperactivity, Non-compliance: 0.89; (V) Inappropriate Speech: 0.68. Child and Adolescent psychiatry specialists completed the Aberrant Behavior Checklist scale at T0, T1, T2 and T3.

Clinical global impression

The Clinical Global Impression Scale (CGI) was developed by Guy et al. to evaluate the course of all psychiatric disorders at all ages. The CGI is a scale 7-point Likert type scoring and has three dimensions to evaluate the treatment responses of individuals with psychiatric disorders and is completed by the physician during the interview. In this study, the subscale of Clinical Global Impression Scale-Severity was used to evaluate patients. Child and Adolescent psychiatry specialists completed CGI scale at T0, T1, T2, T3.

Machine learning methods

We pre-processed the data in two steps. Firstly, we have cleaned data by using imputation for missing values. We used column-wise delete (if more than 30% data is missing), the last-observation-carried-forward method for CGI points for imputation and for other missing values imputed by using k-nearest neighbourhood method. Secondly, we have applied dimension reduction via principal component analysis. We have converted CGI points into a discrete classification problem in order to apply most of the classification algorithms. Variables which had high rates of correlation (if more than 60%) with CGI points did not use for outcome overfitting. We used last-observation-carried forward method for lost patients in follow-up.

We tested the performance of for machine learning algorithms (Naive Bayes, Generalized Linear Model, Logistic Regression, Decision Tree) on our data, using the 254 items in the sociodemographic and clinical data form, Autistic Behavior Checklist, Aberrant Behavior Checklist as features, and the CGI based (Patients with ≤2 CGI points in ASD symptoms at in 36 months were accepted as the group who has “better outcome”) as the prediction class. We used tenfold cross-validation for train and test sets, where the data split is repeated 10 times with 90% training to 10% test set by changing the test set each time. At the end, the success rates and error rates are calculated by the average of each repetition.

We created prediction models using four machine learning classifiers independently with tenfold cross-validation (). This process was repeated using all sub-sets and means acquired by this process calculated for Receiver Operating Characteristic curves with Area Under Curve. In order to prevent overfitting issues, algorithms modified with their major settings and left the minor tuning unchanged default values. Modelling was performed with RapidMiner Studio software [Citation25]. After results of cross-validation, we chose automatically optimized decision tree method for the highest AUC (0.707) value ().

Figure 1. Decision tree method for prediction of short-term outcome.

Figure 1. Decision tree method for prediction of short-term outcome.

Table 1. Results of cross-validation for each classifier.

Ethics committee approval

Approval for the study was granted by the Clinical Research Ethics Committee of Ondokuz University with approval no. B.30.2.ODM.0.20.08/1806, dated 07/2018.

Results

The study group comprised 433 children with ASD diagnosis. 79.4% were male and, 42.2% were diagnosed as Autistic Disorder, 48.5% were diagnosed with Pervasive Developmental Disorder-NOS and %9.2 were Asperger Disorder at time T0. At T0, participants mean age was 72.3 ± 45.9 months and %24.4 of them were younger than 36 months. The comorbid diagnosis was Attention Deficit and Hyperactivity Disorder (n:265, 61.2%), Obsessive Compulsive Disorder (n:105, 24.2%) and Bipolar Disorder (n:32, 7.4%). Other medical diagnoses (epilepsy, cerebral palsy, genetic disorders etc.) are presented of 22.3% of the group.

265 of 433 (61.2%) children were in follow-up more than 12 months. At T2, 219 (50.5%), at T3; 185 (42.7%) children were in follow up. shows sociodemographic and clinical variables and weights for machine learning model and shows decision tree diagram for prediction “better outcome” (AUC: 0.707, sensitivity: 81.1%, specificity: 61.3%).

Table 2. Weights of sociodemographic and clinical variables of the study group.

Discussion

Our aim was to analyse several sociodemographic and clinical factors with regard to ASD prognosis. We evaluated the predictors of “better outcome” in 36-month follow-up among 433 patients. We found improvement in symptoms was substantial: 39.7% at 12 months, 60.7% at 24 months and 77.8% at 36 months. Diagnostic groups as per DSM-IV-TR affected prognoses. Autism group, father and mother age are the most powerful factors for prognosis. In PDD-NOS group, MR comorbidity, birth weight, and age at diagnosis have effected a better outcome. Asperger’s Disorder age at diagnosis, age at first evaluation and developmental cornerstone (speak two words) has effected a better outcome group.

In our analysis of predictors, we compared better and poor outcomes regarding the frequency of several clinical factors. We used four machine learning methods for predict outcome and us choose a decision tree for data visualization, as it has the highest AUC and accuracy belong to this methods. The decision tree is a non-parametric supervised learning method used for classification and regression, and widely used to visually and explicitly represent prediction and decision making in clinical research [Citation25]. This method classifies patient groups into branch-like segments that construct an inverted tree with a root node and leaf nodes. Our best model showed (81% sensitivity, 61% specificity) as a decision tree (), diagnosis groups effects prognosis. These results are preliminary but in ASD like complex neurodevelopmental disorders, this method may help clinicians to identify subgroups of patients. This approach may provide different treatment strategies to achieve the optimal medical outcome.

We found different factors had different weights on predicting the outcome (). ASD severity and sub-groups of diagnosis is affected prognosis. DSM-IV subgroups discussed and gathered in one large diagnosis, Autism Spectrum Disorders DSM-5 but still different prognostic outcomes showed between literature [Citation26]. Also, comorbid psychiatric and neurological diagnosis is effected prognosis. We found high rates of ADHD in our study group. ADHD has been shown to occur at higher rates in children with ASD than their typically developing peers [Citation27] and prognostic relevance of social dysfunctioning associated with ADHD [Citation28]. Mental retardation comorbidity effected prognosis, which low cognitive, delayed developmental skills are known to have negative effects on the outcome [Citation29]. Other neurological conditions (epilepsy, cerebral palsy etc.) are common in autism [Citation30,Citation31] and we found they have effects on the important determinant of short-term outcome.

Mother and father age at birth had effected the outcome in our study. In literature advancing paternal age associated with autism which explained by epigenetic alterations associated with aging [Citation32]. Also advancing parental age was more strongly associated with autism with intellectual disabilities [Citation11]. Maternal age is associated with younger age of first evaluation and reported as cofactor of maternal education and other sociodemographic characteristics [Citation34]. We found the birth weight is also an important determinant for the outcome. Literature shows, birth weight of <2500 g were increased risk for autism, although the magnitude of risk from these factors varied according to autism with comorbidities [Citation35]. Low birth weight also associated with a high risk of developing psychiatric symptoms and disorders by adolescence [Citation36] and also emotional, behavioural, social and also academic problems at later times [Citation37].

Autism research requires a more detailed characterization of core ASD symptoms (repetitive behaviours, social deficit) for clinical reasons and neurobiological reasons. Researchers will need more data to identify this symptom, that can help understand the disorder. Also, the prognosis is so heterogeneity that we need reliable and efficient, dimensional measures to help our clinical decisions. Our results provide preliminary support to the decision tree method as an informative method that can assist clinical decision in predicting treatment response in Autism Spectrum Disorders, though further work is required to provide enhanced levels of classification and prediction performance.

Limitations

First of all, there are limitations to the decision tree method. The main disadvantage is that it can be subject to overfitting. There are many factors affecting the outcome of autism, overfitting occurs particularly when using small data sets. We used cross-fold validation to avoid this problem, it is unlikely that this represents overfitting and overestimation of the AUC. Still, this problem can limit the generalizability and robustness of the resultant models. Our data were retrospective and we used short-term variables and also clinician-rated improvement in autism, that may cause overlook of factors that effect core symptoms in treatment. Complex cases with neurological comorbidities did not investigate, that lower generalizability of this study. Also genetic etiologies (i.e. Fragile X, Di-George Syndrome) did not investigated, may affected the outcome [Citation33,Citation34].

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

References

  • Wing L. The autistic spectrum. Lancet. 1997;350:1761–1766. doi: 10.1016/S0140-6736(97)09218-0
  • Diagnostic and statistical manual of mental disorders (DSM-5®): American Psychiatric Pub; 2013.
  • Wei X, Wagner M, Christiano ER, et al. Special education services received by students with autism spectrum disorders from preschool through high school. J Spec Educ. 2014;48:167–179. doi: 10.1177/0022466913483576
  • Wong C, Odom SL, Hume KA, et al. Evidence-based practices for children, youth, and young adults with autism spectrum disorder: A comprehensive review. J Autism Dev Disord. 2015;45:1951–1966. doi: 10.1007/s10803-014-2351-z
  • Bertelli MO, Rossi M, Keller R, et al. Update on psychopharmacology for autism spectrum disorders. Adv Mental Health Intellectual Disabilities. 2016;10(1):6–26. doi: 10.1108/AMHID-10-2015-0049
  • Fava L, Strauss K. Response to early intensive behavioral intervention for autism – an umbrella approach to issues critical to treatment individualization. Int J Dev Neurosci. 2014;39:49–58. doi: 10.1016/j.ijdevneu.2014.05.004
  • Vorstman JA, Parr JR, Moreno-De-Luca D, et al. Autism genetics: opportunities and challenges for clinical translation. Nat Rev Genet. 2017;18:362. doi: 10.1038/nrg.2017.4
  • Ronemus M, Iossifov I, Levy D, et al. The role of de novo mutations in the genetics of autism spectrum disorders. Nat Rev Genet. 2014;15:133. doi: 10.1038/nrg3585
  • Varcin KJ, Alvares GA, Uljarević M, et al. Prenatal maternal stress events and phenotypic outcomes in autism spectrum disorder. Autism Res. 2017;10:1866–1877. doi: 10.1002/aur.1830
  • Hornig M, Bresnahan M, Che X, et al. Prenatal fever and autism risk. Mol Psychiatry. 2018;23:759. doi: 10.1038/mp.2017.119
  • Idring S, Magnusson C, Lundberg M, et al. Parental age and the risk of autism spectrum disorders: findings from a Swedish population-based cohort. Int J Epidemiol. 2014;43:107–115. doi: 10.1093/ije/dyt262
  • Estes A, Munson J, Rogers SJ, et al. Long-term outcomes of early intervention in 6-year-old children with autism spectrum disorder. J Am Acad Child Adolesc Psychiatry. 2015;54:580–587. doi: 10.1016/j.jaac.2015.04.005
  • Saxe GN, Ma S, Ren J, et al. Machine learning methods to predict child posttraumatic stress: a proof of concept study. BMC Psychiatry. 2017;17:223. doi: 10.1186/s12888-017-1384-1
  • Wall D, Kosmicki J, Deluca T, et al. Use of machine learning to shorten observation-based screening and diagnosis of autism. Transl Psychiatry. 2012;2:100. doi: 10.1038/tp.2012.10
  • Mueller A, Candrian G, Kropotov JD, et al. Classification of ADHD patients on the basis of independent ERP components using a machine learning system. Nonlinear Biomed Phys. 2010;4:1. doi: 10.1186/1753-4631-4-S1-S1
  • Bedi G, Carrillo F, Cecchi GA, et al. Automated analysis of free speech predicts psychosis onset in high-risk youths. NPJ Schizophr. 2015;1:15030. doi: 10.1038/npjschz.2015.30
  • Bishop- Fitzpatrick et al. Using machine learning to identify patterns of lifetime health problems in decedents with autism spectrum disorder. Autism Res. 2018;11(8):1120–1128. doi: 10.1002/aur.1960
  • Krug DA, Arick J, Almond P. Behavior checklist for identifying severely handicapped individuals with high levels of autistic behavior. J Child Psychol Psychiatry. 1980;21:221–229. doi: 10.1111/j.1469-7610.1980.tb01797.x
  • Irmak TY, Sütçü ST, Aydın A, et al. An investigation of validity and reliabilty of Autism Behavior Checklist (ABC). Turk J Child Adolesc Ment Health. 2007;14:13–23.
  • Aman MG, Singh NN, Stewart AW, et al. The aberrant behavior checklist: a behavior rating scale for the assessment of treatment effects. Am J Ment Defic. 1985;89:485–491.
  • Brown EC, Aman MG, Havercamp SM. Factor analysis and norms for parent ratings on the aberrant behavior checklist-community for young people in special education. Res Dev Disabil. 2002;23:45–60. doi: 10.1016/S0891-4222(01)00091-9
  • Sucuoglu B. The psychometric characteristics of the Turkish form of the Aberrant Behavior Checklist. Turk J Psychol. 2003;18: 93–96.
  • Karabekiroglu K, Aman MG. Validity of the aberrant behavior checklist in a clinical sample of toddlers. Child Psychiatry Hum Dev. 2009;40:99–110. doi: 10.1007/s10578-008-0108-7
  • Rickels K, Howard K, Guy W. Clinical global impressions. ECDEU Assessment Manual for Psychopharmacology/Rev ed US Government Printing Office. 1976: 218–222.
  • Farr S. Free licenses for All RapidMiner products: machine learning without coding. C2C Digital Mag. 2017;1:19.
  • Song Y-Y, Ying L. Decision tree methods: applications for classification and prediction. Shanghai Arch Psychiatry. 2015;27:13024.
  • Gibbs V, Aldridge F, Chandler F, et al. Brief report: an exploratory study comparing diagnostic outcomes for autism spectrum disorders under DSM-IV-TR with the proposed DSM-5 revision. J Autism Dev Disord. 2012;42:1750–1756. doi: 10.1007/s10803-012-1560-6
  • Salazar F, Baird G, Chandler S, et al. Co-occurring psychiatric disorders in preschool and elementary school-aged children with autism spectrum disorder. J Autism Dev Disord. 2015;45(8):2283–2294. doi: 10.1007/s10803-015-2361-5
  • Nijmeijer JS, Minderaa RB, Buitelaar JK, et al. Attention-deficit/hyperactivity disorder and social dysfunctioning. Clin Psychol Rev. 2008;28:692–708. doi: 10.1016/j.cpr.2007.10.003
  • Darrou C, Pry R, Pernon E, et al. Outcome of young children with autism: does the amount of intervention influence developmental trajectories? Autism. 2010;14:663–677. doi: 10.1177/1362361310374156
  • Hirschberger RG, Kuban KC, O’Shea TM, et al. Co-occurrence and severity of neurodevelopmental burden (cognitive impairment, cerebral palsy, autism spectrum disorder, and epilepsy) at age ten years in children born extremely preterm. Pediatr Neurol. 2018;79:45–52. doi: 10.1016/j.pediatrneurol.2017.11.002
  • Reilly C, Atkinson P, Das KB, et al. Neurobehavioral comorbidities in children with active epilepsy: a population-based study. Pediatrics. 2013;3787.
  • Hultman C, Sandin S, Levine S, et al. Advancing paternal age and risk of autism: new evidence from a population-based study and a meta-analysis of epidemiological studies. Mol Psychiatry. 2011;16:1203. doi: 10.1038/mp.2010.121
  • Rubenstein E, Durkin MS, Harrington RA, et al. Relationship between advanced maternal Age and timing of first developmental evaluation in children with autism. J Dev Behav Pediatr. 2018.
  • Schendel D, Bhasin TK. Birth weight and gestational age characteristics of children with autism, including a comparison with other developmental disabilities. Pediatrics. 2008;121:1155–1164. doi: 10.1542/peds.2007-1049
  • Indredavik M, Vik T, Heyerdahl S, et al. Psychiatric symptoms and disorders in adolescents with low birth weight. Arch Dis Child Fetal Neonatal. 2004;89:445–F50. doi: 10.1136/adc.2003.038943
  • Indredavik MS, Vik T, Heyerdahl S, et al. Psychiatric symptoms in low birth weight adolescents, assessed by screening questionnaires. Eur Child Adolesc Psychiatry. 2005;14:226–236. doi: 10.1007/s00787-005-0459-6