2,653
Views
63
CrossRef citations to date
0
Altmetric
Articles

Model Selection in Finite Mixture Models: A k-Fold Cross-Validation Approach

Pages 246-256 | Published online: 05 Dec 2016
 

Abstract

Finite mixture models, whether latent class models, growth mixture models, latent profile models, or factor mixture models, have become an important statistical tool in social science research. One of the biggest and most debated challenges in mixture modeling is the evaluation of model fit and model comparison. In the application of mixture models, researchers often fit a collection of models and then decide on a single optimal model based on a variety of model fit information. We propose a k-fold cross-validation procedure to model selection whereby the model is repeatedly fit to k1 different partitions of the data set, the resulting model is then applied to kth partition of the sample, and the distribution of fit indexes is examined. This method is illustrated with growth mixture models fit to longitudinal data on reading ability collected as part of the Early Childhood Longitudinal Study–Kindergarten Cohort.

FUNDING

This work was supported by National Science Foundation Grant REAL-1252463 awarded to the University of Virginia, David Grissmer (Principal Investigator), and Christopher Hulleman (Co-Principal Investigator).

Notes

1 Despite some debate regarding the results presented by Lo et al. (Citation2001; see Jeffries, Citation2003), we report the VLMR LRT and aLMR LRT because they remain widely used in the mixture modeling literature.

2 Although a test sample size of 5 (1% of N = 496) might seem too small when performing 100-fold cross-validation, no parameters are estimated when the model is applied to the test sample. The test sample only needs to consist of at least one participant (essentially inserting the participant’s data into xi in Equation 1 and calculating the 2LL using the model-implied mean and covariance structure when the model was estimated using the training sample), which occurs when performing leave-one-out cross-validation (i.e., k-fold cross-validation with k = N).

Additional information

Funding

This work was supported by National Science Foundation Grant REAL-1252463 awarded to the University of Virginia, David Grissmer (Principal Investigator), and Christopher Hulleman (Co-Principal Investigator).

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 412.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.