897
Views
26
CrossRef citations to date
0
Altmetric
Original Articles

Bayesian Inference for Growth Mixture Models with Latent Class Dependent Missing Data

, &
Pages 567-597 | Published online: 08 Aug 2011
 

Abstract

Growth mixture models (GMMs) with nonignorable missing data have drawn increasing attention in research communities but have not been fully studied. The goal of this article is to propose and to evaluate a Bayesian method to estimate the GMMs with latent class dependent missing data. An extended GMM is first presented in which class probabilities depend on some observed explanatory variables and data missingness depends on both the explanatory variables and a latent class variable. A full Bayesian method is then proposed to estimate the model. Through the data augmentation method, conditional posterior distributions for all model parameters and missing data are obtained. A Gibbs sampling procedure is then used to generate Markov chains of model parameters for statistical inference. The application of the model and the method is first demonstrated through the analysis of mathematical ability growth data from the National Longitudinal Survey of Youth 1997 (Bureau of Labor Statistics, U.S. Department of Labor, 1997). A simulation study considering 3 main factors (the sample size, the class probability, and the missing data mechanism) is then conducted and the results show that the proposed Bayesian estimation approach performs very well under the studied conditions. Finally, some implications of this study, including the misspecified missingness mechanism, the sample size, the sensitivity of the model, the number of latent classes, the model comparison, and the future directions of the approach, are discussed.

Notes

1Throughout the article, MNn denotes an n-dimensional multivariate normal distribution.

2Here we have two probabilities that need to be distinguished. The class probability π k is a class-specific population parameter in the model, whereas the posthoc posterior probability is an individual variable that is computed for each individual once model parameters have been estimated.

3Note that this is only one way to specify a regression model for categorical variables.

4Specifically, suppose for each k (k = 1, 2, …, K) there exists an underlying continuous random variable cik *, which follows a normal distribution with mean ϕ k0 + x i ′ ϕ k1 and variance 1,

where ei ~ N(0, 1). The outcome yi comes from the first k classes when cik * is positive. In other words,

5With 10,000 burn-ins, the Markov chains for all parameters converged.

6This method tests the convergence of Markov chain by comparing the means of two subsets of the chain.

7With 70,000 iterations, the ratio of MCse/sd is less than 0.05 for all parameters, which indicates that the estimates are accurate. An example of inaccurate estimates obtained with 2,000 burn-ins and 5,000 iterations can be found on our website: (http://nd.psychstat.org/research/luzhanglubke2010) for comparison.

a The significance of parameter estimates can be judged based on the confidence intervals. If zero is included in the interval, then the parameter estimate is not significantly different from zero.

b The growth curve parameters for Class 1. Specifically, β[1]: initial level; β[2]: slope; Ψ[11]: variance of initial level; Ψ[22]: variance of slope; Ψ[12]: covariance of initial level and slope; φ: variance of error.

c The growth curve parameters for Class 2.

d The probit parameters of class proportion as in Equation (6).

e The probit parameters of missing data rate. Note that although the γ0t * and γ1t * here are different with the γ zt 1 and γ zt 2 in Equation (9), they are equivalent after reparameterizing γ0t * = γ zt 1 and γ1t * = γ zt 2 – γ zt 1 .

8To be consistent with the real data analysis, γ zt 1 and γ zt 2 are reparameterized as γ0t * and γ1t * with γ0t * = γ zt 1 and γ1t * = γ zt 2 – γ zt 1 .

a The average absolute relative bias across all model parameters, defined by |Bias.rel| = Σ j=1 p |Bias.rel j |p.

b The average absolute difference between the empirical SDs and the average Bayesian SDs across all model parameters, defined by |SD.diff| = Σ j=1 p |SD.emp j – SD.avg j |p.

c The average coverage probability across all model parameters, defined by HPD.cvr = Σ j=1 p HPD.cvr j /p.

d With a sample size of 500, the convergence rate under unequal classes and MNAR missingness is 100/147 ≈ 67%. MNAR = missing not at random; MCAR = missing completely at random.

a Modeling GMM and latent class dependent missingness.

b Modeling GMM only, ignore the missingness mechanism.

c GMM with latent class dependent missing data.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.