960
Views
15
CrossRef citations to date
0
Altmetric
Applications and Case Studies

Variable Selection for Skewed Model-Based Clustering: Application to the Identification of Novel Sleep Phenotypes

, ORCID Icon, , ORCID Icon &
Pages 95-110 | Received 01 Aug 2015, Published online: 16 May 2018
 

ABSTRACT

In sleep research, applying finite mixture models to sleep characteristics captured through multiple data types, including self-reported sleep diary, a wrist monitor capturing movement (actigraphy), and brain waves (polysomnography), may suggest new phenotypes that reflect underlying disease mechanisms. However, a direct mixture model application is challenging because there are many sleep variables from which to choose, and sleep variables are often highly skewed even in homogenous samples. Moreover, previous sleep research findings indicate that some of the most clinically interesting solutions will be those that incorporate all three data types. Thus, we present two novel skewed variable selection algorithms based on the multivariate skew normal (MSN) distribution: one that selects the best set of variables ignoring data type and another that embraces the exploratory nature of clustering and suggests multiple statistically plausible sets of variables that each incorporate all data types. Through a simulation study, we empirically compare our approach with other asymmetric and normal dimension reduction strategies for clustering. Finally, we demonstrate our methods using a sample of older adults with and without insomnia. The proposed MSN-based variable selection algorithm appears to be suitable for both MSN and multivariate normal cluster distributions, especially with moderate to large-sample sizes. Supplementary materials for this article are available online.

Supplementary Materials

The R Code is provided in the online supplementary materials.

Acknowledgments

Dr. Buysse reports receiving consultation fees from Bayer HealthCare, BeHealth Solutions, Cereve, Inc., CME Institute, CME Outfitters, Emmi Solutions, Medscape, and Merck; and grants from NIH, outside the submitted work. In addition, Dr. Buysse receives licensing fees (royalties) for the Pittsburgh Sleep Quality Index (PSQI), which is copyrighted by the University of Pittsburgh.

Additional information

Funding

This work was supported by National Institute of Health grants: K01MH096944, AG020677, RR024153.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 343.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.