848
Views
58
CrossRef citations to date
0
Altmetric
Original Articles

Evaluation of Structural Equation Mixture Models: Parameter Estimates and Correct Class Assignment

&
Pages 165-192 | Published online: 19 Apr 2010
 

Abstract

Structural equation mixture models (SEMMs) are latent class models that permit the estimation of a structural equation model within each class. Fitting SEMMs is illustrated using data from 1 wave of the Notre Dame Longitudinal Study of Aging. Based on the model used in the illustration, SEMM parameter estimation and correct class assignment are investigated in a large-scale simulation study. Design factors of the simulation study are (im)balanced class proportions, (im)balanced factor variances, sample size, and class separation. We compare the fit of models with correct and misspecified within-class structural relations. In addition, we investigate the potential to fit SEMMs with binary indicators. The structure of within-class distributions can be recovered under a wide variety of conditions, indicating the general potential and flexibility of SEMMs to test complex within-class models. Correct class assignment is limited.

Notes

1SEMM performance should first be evaluated at a single time point before combining time points in a more complex model fitted to longitudinal data. Extension to structural equation growth mixture modeling is currently being evaluated.

2Item selection procedures in the context of mixture models are currently being investigated by the authors.

3Researchers intending to fit SEMMs must set aside the necessary computational resources, as models with categorical indicators could take considerable computation time.

4Entropy has been suggested as a measure of classification accuracy when true class membership is unknown. However, entropy can perform poorly in the context of FMMs (CitationLubke & Muthén, 2007).

5The multivariate Mahalanobis distance between two classes is given as MD = CitationAnderson and Bahadur (1962) provide a distance measure which is more accurate when class covariance matrices are unequal. In the current work, the MD are approximately equal to the Anderson and Bahadur measure in most conditions, and results for those conditions in which this is not true are marked by empty cells in the results table.

6In this study, error variance was held constant across all conditions, with the side effect that not only factor variances but also item reliabilities differ when comparing balanced and unbalanced factor variance conditions. However, CitationLubke and Muthén (2007) found that comparable differences in item reliabilities only had minor effects on FMM performance.

7Whereas different class proportions do not influence class separation, factor variances do influence class separation. The mean of Factor 1 in Class 1 E(F 1C1) is adjusted across (im)balanced factor variance conditions such that class-invariant and class-specific factor variances are not confounded with class separation as measured by the multivariate Mahalanobis distance.

8Formulas for the BIC and saBIC are given in the Appendix.

9For example, in unbalanced factor variance conditions, the empirical standard errors are much larger than average standard errors, which could indicate an insufficient number of data sets within condition.

10An insufficient number of starting values can also lead to nonconvergence. In empirical analyses, the number of starting values can be increased as needed. However, increasing the number of starting values will not compensate for model misspecification or model nonidentification.

11When reporting on parameter estimates in a mixture simulation study, the issue of class switching must be addressed. Because class labeling is arbitrary, there can be fitted models in which Class 1 subjects were labeled Class 2 subjects and vice versa. This creates problems when averaging a parameter across data sets within a condition. When the within-class factor models are the same, switching can be corrected by relabeling the class-specific parameters prior to summarizing. In this study, this was not possible for the SEMM123 model but is feasible for the unstructured FMM. For the conditions examined herein, switching was not problematic except in conditions where convergence rates were low enough to preclude serious interpretation of parameter estimates as discussed at the end of the section on convergence rates.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.