4,416
Views
14
CrossRef citations to date
0
Altmetric
Evidence Base Update

Statistical Approaches for Highly Skewed Data: Evaluating Relations between Maltreatment and Young Adults’ Non-Suicidal Self-injury

, &

Abstract

Objective

Clinical phenomena often feature skewed distributions with an overabundance of zeros. Unfortunately, empirical methods for dealing with this violation of distributional assumptions underlying regression are typically discussed in statistical journals with limited translation to applied researchers. Therefore, this investigation compared statistical approaches for addressing highly skewed data as applied to the evaluation of relations between child maltreatment and non-suicidal self-injury (NSSI). Method: College students (N = 2,651; 64.2% female; 85.2% nonwhite) completed the Child Abuse and Trauma Scale and the Functional Assessment of Self-Mutilation. Statistical models were applied to cross-sectional data to provide illustrative comparisons across predictions to a) raw, highly skewed NSSI outcomes, b) natural log, square-root, and inverse NSSI transformations to reduce skew, c) zero-inflated Poisson (ZIP) and negative-binomial zero-inflated (NBZI) regression models to account for both disproportionate zeros and skewness in the NSSI data, and d) the skew-t distribution to model NSSI skewness. Results: Child maltreatment was significantly and positively related to NSSI frequency in the raw, transformation, and zero-inflated models, but this relation was negative in the skew-t model. Conclusions: These findings highlight the importance of using zero-inflated models rather than transformation approaches to address data skew. Moreover, whereas the skew-t distribution has been used to model skewed non-clinical data, this study suggests that the skew-t approach may not be well-suited to address skewed clinical data.

Statistical regression is a useful and widely used tool to evaluate relations between a dependent variable and one or more explanatory variables given a set of controls (Cohen, Cohen, West, & Aiken, Citation2003). However, the accuracy of the obtained parameters rests on the degree to which the data adhere to specific assumptions relevant to the regression model itself and to the model errors (i.e., observed values minus values predicted from the true regression model for the population; Osborne & Waters, Citation2002; Williams, Grajales, & Kurkiewicz, Citation2013). At the model level, the dependent variable should be a linear function of the regression parameters, such that a graph of residuals (i.e., observed values minus values predicted by the estimated regression model; Weisberg, Citation2005) against predicted values of the dependent variable appears linear in form. At the level of errors, following estimation of the regression model, the residuals (as the best available proxy for errors) should feature a normal distribution and equivalent variance across all levels of the predictors (i.e., homoscedasticity; Cohen et al., Citation2003). Of the myriad response outcomes to which regression models have been applied across varied domains of psychological science (e.g., extraversion, achievement), clinical response outcomes often pose a special challenge because they tend to feature lower-base rates with an overabundance of zero values. Thus, the current paper drew on the clinical case of non-suicidal self-injury (NSSI; i.e., direct and deliberate damage of one’s body tissue without suicidal intent; Nock, Citation2010) to illustrate the application of varied statistical approaches for addressing the kinds of highly skewed data distributions that characterize many clinical phenomena.

The Phenomenology of NSSI

Over the past 20 years, NSSI has emerged as a prominent public health concern, particularly among young people ages 15–25 (Whitlock, Eckenrode, & Silverman, Citation2006). A recent review across 128 prevalence estimates in the extant literature confirms significant rates of NSSI among adolescents (17.2%), young adults (13.4%), and adults (5.5%; Swannell, Martin, Page, Hasking, & St John, Citation2014). Despite elevated rates of NSSI within some sub-populations (e.g., psychiatric inpatient adolescents; Nock & Prinstein, Citation2004), pooled prevalence estimates indicate that NSSI has a relatively low base rate, such that the number of individuals who do not endorse NSSI far exceeds the number that does (i.e., an overabundance of zeros). Moreover, a relatively small subset of individuals who engage in NSSI do so at a high frequency (i.e., the distribution is positively skewed; Benjet et al., Citation2017; Gillies et al., Citation2018).

In light of the unique distributional features of NSSI, researchers typically use robust methods (e.g., Thomassin, Shaffer, Madden, & Londino, Citation2016) or transformations to render their data more amenable to statistical regression in the service of identifying predictors of NSSI. Commonly used transformation methods in the NSSI literature include natural log (e.g., Midkiff, Lindsey, & Meadows, Citation2018; You, Zheng, Lin, & Leung, Citation2016), square-root (e.g., Boone & Brausch, Citation2016; Robertson, Miskey, Mitchell, & Nelson-Gray, Citation2013), and inverse (e.g., Andover & Gibb, Citation2010; Buser, Pertuit, & Muller, Citation2019) approaches, with researchers typically adding a value of one to all scores to remove zero-values that cannot be transformed using natural log or inverse transformation approaches. Despite these efforts, NSSI data often remain highly skewed after transformation approaches.

In light of the prevalence of NSSI, as well as its clinical relevance as a marker of significant psychological distress and impairment (Benjet et al., Citation2017; Klonsky & Olino, Citation2008), research that uses appropriate methods to address the highly skewed nature of NSSI data is needed to evaluate NSSI predictors that have been suggested in prior studies. The etiologic contribution of child maltreatment experiences to later NSSI has received consistent empirical support in previous studies (e.g., Kaess et al., Citation2013; Lang & Sharma-Patel, Citation2011; Liu, Scopelliti, Pittman, & Zamora, Citation2018; Serafini et al., Citation2017). However, as researchers continue to struggle with best analytic practices to adequately address the distributional skewness of NSSI, this investigation compared approaches to address the expected skew in NSSI outcomes by conducting a series of regression analyses to evaluate previously-documented associations between child maltreatment and NSSI.

Contemporary Approaches to Address Distributional Violations

Data skewness refers to a lack of symmetry in the data distribution (i.e., one tail being longer than the other), which biases the estimation of parameters in regression models (e.g., Abbott & Gutgesell, Citation1994; Godfrey & Orme, Citation1991). Skewness estimates capture the degree of this asymmetry with positive skewness reflecting an overabundance of low scores and a waning tail in the positive direction, and negative skewness reflecting an excess of high scores and a waxing tail from the negative direction (MacGillivray, Citation1986). As a result of their large data spreads (i.e., overdispersion), skewed distributions tend to feature larger variances than mean values. In turn, traditional regression models, which are fit by minimizing the sum of squared residuals, will lead to inaccurate parameter estimates and confidence intervals. In a series of simulated comparisons, Curran, West, and Finch (Citation1996) found that skewness beyond a maximum level of 2 yielded unreliable parameter estimates. However, recent researchers have suggested even more conservative limits (e.g., skewness <1; Driscoll et al., Citation2019; Morse, Klingman, Jacob, & Kodali, Citation2019) and encouraged the application of statistical approaches to mitigate the negative impact of data skewness on parameter estimation.

Data skewness approaches share the common goal of supporting the reliable estimation of statistical parameters when observed data violate distributional assumptions of normality. Transformation approaches attempt to reduce the level of skewness, while other approaches, such as Poisson regression and the skew-t distribution, directly model skewness in the data when estimating regression parameters.

Variable Transformation Approaches

Transforming observed data is the most common approach to address skewness (Hall, Citation1992), and has been widely used in prior research on NSSI (e.g., Boone & Brausch, Citation2016; Buser et al., Citation2019; Midkiff et al., Citation2018). There are multiple forms of nonlinear and linear transformation approaches, such as natural log, square-root, and inverse transformations. Moreover, multiple transformations can be used to obtain skewness values nearest to zero with the fewest outliers. Natural log, square-root, and inverse transformations are used when the observed variable is characterized by moderate, substantial, and severe skewness, respectively (Tabachnick & Fidell, Citation2001). Although transformation approaches can complicate the interpretation of parameter estimates because they change the scaling of the variable, reversing the transformation before interpretation can ameliorate this problem. Transformation approaches are also valuable because they support standardized model fit indices (e.g., RMSEA, SRMR) and effect size estimates (e.g., Cohen’s f2). Thus, Tabachnick and Fidell (Citation2001) recommend considering transformation in all skewness situations, unless there is a specific reason not to, such as when using non-parametric analyses.

Zero-Inflated Approaches

Zero-inflated models can be used to analyze data when there is an overabundance of zeros, which is the case for many clinical phenomena, and have been used in prior studies of NSSI (e.g., Allen, Fox, Schatten, & Hooley, Citation2019; Esposito, Bacchini, & Affuso, Citation2019; Kranzler et al., Citation2018; Vergara, Stewart, Cosby, Lincoln, & Auerbach, Citation2019). These models adopt a dual-process conceptualization of psychopathology wherein one process (or set of processes) is thought to predict disorder onset and a second process (or set of processes) is thought to account for disorder maintenance and/or severity (Lewinsohn, Allen, Seeley, & Gotlib, Citation1999; Yates, Citation2009). Thus, zero-inflated models yield two parameters, one that estimates the probability of whether or not an individual endorses symptomatology (e.g., 0 for non-injurers and 1 for injurers) using a logistic regression, and a second that estimates the degree to which the person expresses few or multiple symptoms (e.g., how many times a person has repeated NSSI) using either a Poisson or a negative binomial distribution.

Both zero-inflated Poisson (ZIP; Lambert, Citation1992) and negative binomial zero-inflated (NBZI; Minami, Lennert-Cody, Gao, & Román-Verdesoto, Citation2007) regression models evaluate covariates separately, yet simultaneously, to predict a) whether or not an individual expresses a symptom or engages in a behavior, and b) how many symptoms or how often behaviors are expressed. A ZIP model uses the Poisson distribution to account for the excess of zeros (Lambert, Citation1992), and to address some of the variation resulting from overdispersion in the outcome variable (i.e., when the variance of the variable is greater than its mean value). In contrast, the NBZI model pairs a Poisson approach with a multiplicative random effect to model overdispersion in the outcome variable (Gschlößl & Czado, Citation2008). The advantage of the NBZI model is that it can account for both the overabundance of zeros and for a greater level of overdispersion in the outcome than the ZIP model (Minami et al., Citation2007). Thus, when the outcome is overdispersed, the confidence intervals from the NBZI model are likely to be narrower than those from a ZIP model (Long, Citation1997). Despite their strengths, zero-inflated approaches require higher statistical power to detect effects than linear regressions (Dufour & Zung, Citation2005), and standardized model fit indices and estimates of effect sizes developed for linear regression analysis are not available (Muthén & Muthén, Citation1998–2017). Finally, the Monte Carlo integration required to estimate missing data in a Poisson-distributed outcome variable (Muthén & Muthén, Citation1998–2017), limits comparisons with approaches that use other missing data estimation methods (e.g., full-information maximum-likelihood).

Skew-t Distribution Approach

Although Azzalini (Citation1985) introduced novel approaches to address moderately skewed data (e.g., the skew-normal distribution) and highly skewed data (e.g., the skew-t distribution), analytic software has only recently advanced to the point that these techniques can be carried out in a reasonable time frame. Thus, the current investigation is the first to our knowledge to apply this approach in the modeling of NSSI.

The skew-t distribution accounts for skewness (and kurtosis) by including a skew scale or shaping parameter with each variable and a degree of freedom parameter for the entire multivariate distribution when calculating the regression estimates (Lee & McLachlan, Citation2013). Theoretically, the multivariate skew-t distribution will equal the t distribution when there is no skewness in the data, and the degrees of freedom approach infinity (Asparouhov & Muthén, Citation2015). The most common method of parameter estimation using this distribution is maximum likelihood (Asparouhov & Muthén, Citation2015). A significant advantage of the skew-t approach is that skewness is modeled using predictors rather than removed by transforming variables (Muthén & Muthén, Citation1998–2017).

Summary

Individual approaches for dealing with data skewness have been presented in statistical journals, which has limited their uptake among applied researchers (e.g., Adarabioyo & Ipinyomi, Citation2019; Lee, Karrison, Nocon, & Huang, Citation2018; Tüzen & Erbaş, Citation2018). Thus, the current investigation sought to illustrate and compare these approaches as applied to a contemporary clinical issue, namely NSSI. As reviewed earlier, studies demonstrate that NSSI data are highly skewed such that most people do not engage in NSSI, a sizable minority of individuals have on at least one occasion, and a smaller number have done so on many occasions (Benjet et al., Citation2017; Gillies et al., Citation2018). Researchers studying NSSI have employed different statistical techniques to deal with the skewness of NSSI, including transformation approaches (e.g., Boone & Brausch, Citation2016; Buser et al., Citation2019; Midkiff et al., Citation2018) and zero-inflated approaches that address the overabundance of non-injurers in most samples using either Poisson (e.g., Esposito et al., Citation2019; Fox et al., Citation2018; Kranzler et al., Citation2018; Yates, Tracy, et al., Citation2008; You & Leung, Citation2012) or negative binomial distributions (e.g., Allen et al., Citation2019; Glenn, Kleiman, Cha, Nock, & Prinstein, Citation2016; Schoenleber, Berenbaum, & Motl, Citation2014; Vergara et al., Citation2019). However, clinical researchers generally, and those focused on NSSI in particular, have not compared different approaches for dealing with distributional violations of the normality assumption. Thus, the overarching goal of this investigation was to provide a novel comparison of multiple statistical approaches for addressing the distributional features of NSSI as applied to the evaluation of theoretically- and empirically- supported relations between child maltreatment and NSSI.

The Current Study

This investigation evaluated raw, transformation, zero-inflated, and skew-t distribution methods for dealing with skewed data when examining the relation between college students’ self-reported histories of child maltreatment and NSSI. We compared multiple approaches for addressing the skewed distributional features of NSSI, including a) a natural log transformation of NSSI, b) a square-root transformation of NSSI, c) an inverse transformation of NSSI, d) a ZIP regression model, e) a NBZI regression model, and f) a skew-t distribution. Each approach for addressing the skewness of NSSI was evaluated with reference to the regression model results using observed (i.e., raw) NSSI data. Given the aforementioned variability in missing data estimation constraints across these varied approaches (e.g., Monte Carlo integration vs. full-information maximum-likelihood), predicted values were correlated with the raw NSSI variable to assess which approach most appropriately captured the expression of NSSI in this large and diverse college-student sample. Additional comparisons across effect size estimates (when available) and residual value distributions were also considered in evaluating these approaches.

All analyses controlled for youth gender, age, and ethnicity-race. Relative fit indices were evaluated to select the best-fitting model within each approach. This model was used to estimate predicted values of NSSI, which were then correlated with the raw NSSI scores and used to calculate residual values for further comparison. All predicted NSSI outcomes from these approaches were expected to correlate more strongly with the observed NSSI variable than with estimates using the raw data. However, we expected that zero-inflated models would yield the most accurate NSSI estimates because they can address the abundance of zeros that are expected in NSSI data.

METHOD

Participants

The present sample included 2,651 undergraduate students (Mage = 19.005, SD = 1.196) at a large University in the Western United States. The sample was predominantly female (64.2%), and nearly half (46.8%) the respondents were of Asian descent, with the remaining students identifying as Hispanic (29.0%), White (14.8%), Black (6.5%), or Multiracial/Other (3%). Unfortunately, data regarding specific countries of origin were not available.

Procedures

Participants were recruited from introductory psychology courses and received class credit in exchange for their completion of a 2-hour computerized survey. Informed consent was administered verbally and in writing, including specific reference to the potentially triggering nature of the questions posed, as well as confidentiality limitations pertinent to mandated reporting concerns. Participants completed the survey in small groups of up to 14 students using private cubicles in a laboratory setting under the supervision of a trained research assistant. Password-protected surveys were administered through a computerized survey management company. Responses were carefully monitored for completeness and the information was encrypted (until download) and identified by a code number to further ensure data security. Participants were informed that the purpose of the study was to examine relations between adaptation in young adulthood and various experiences in childhood and adolescence, including difficult life events. They were assured their participation was anonymous and no personally identifying information was collected. Participants were required to stay for the full 2-hour survey period to minimize the incentive to rush through the questionnaires. All participants received printed information and resource referrals for both campus- and community-based health and mental health support services. These procedures were reviewed and approved by the Human Research Review Board of the participating university.

Measures

Non-Suicidal Self-Injury (NSSI)

Participants completed the Functional Assessment of Self-Mutilation (FASM; Lloyd, Kelley, & Hope, Citation1997) to assess rates and methods of NSSI during the 12 months preceding the time of data collection. The reliability and validity of the FASM has been established across several studies (e.g., Guertin, Lloyd-Richardson, Spirito, Donaldson, & Boergers, Citation2001; Nock & Prinstein, Citation2004, Citation2005). Participants reported whether and how often they engaged in each of 11 different forms of NSSI behavior, including cutting or carving skin, picking at a wound, self-hitting, scraping skin to bleed, self-biting, picking areas of the body to bleed, inserting objects under skin or nails, self-tattooing, burning skin, pulling out hair, or erasing skin to bleed in the past 12 months. Participants rated the frequency of each form of NSSI on a 7-point scale that ranged across 0 (0 times), 1 (1 time), 2 (2 times), 3 (3–5 times), 4 (6–10 times), 5 (11–20 times), and 6 (>20 times). For these analyses, NSSI was operationally defined as the sum of cutting and burning behaviors, which is consistent with prior studies and recent measures of NSSI (e.g., the Self-Injurious Thoughts and Behaviors Interview; Nock, Holmberg, Photos, & Michel, Citation2007) that emphasize cutting and burning because they are most clearly identified as NSSI, are most commonly endorsed, and involve direct tissue damage.

Child Maltreatment

The Child Abuse and Trauma Scale (CATS) includes 38 items that assess the frequency and extent of child maltreatment experiences (Sanders & Becker-Lausen, Citation1995; e.g., “Did your parents ever hit or beat you when you did not expect it?” “Did your relationship with your parents ever involve a sexual experience?”). Items were rated on a five-point scale from 0 (never) to 4 (always) and composited to yield a total maltreatment score (α = .916). The CATS evidences concurrent validity with objective measures of child maltreatment and acceptable reliability in college student populations (Sanders & Becker-Lausen, Citation1995).

Data Analytic Plan

Analysis Software

Data preparation was conducted utilizing IMB SPSS 26 to evaluate descriptive statistics, perform transformations (i.e., square-root, natural log, and inverse), and evaluate Little’s MCAR test (Citation1988). All regression models were evaluated using MPlus 8.3 (Muthén & Muthén, Citation1998–2017).

Data Preparation

Prior to analyses, all predictors and continuous covariates were evaluated for violations of normality assumptions. Participant age was square-root transformed, since it was the only exogenous variable that surpassed conservative recommended levels of skewness (i.e., >1; Tabachnick & Fidell, Citation2001).Footnote1 depicts pre- and post-transformation estimates of participant’s age and NSSI, as well as skewness, means, and standard deviations for all study variables. Outliers were evaluated using a multiple regression to obtain Mahalanobis distance scores.Footnote2 Six scores with p < .001, which were derived from leverage scores, emerged as multivariate outliers and were deleted from all analyses.Footnote3

TABLE 1 Means, Standard Deviations, Skewness, and Kurtosis for Study Variables

Data Analysis and Missingness

Data were missing on age (3.2%), child maltreatment (5.1%), and NSSI (4.7%). Missing data were handled using full-information maximum likelihood (FIML) estimation as supported by Little’s (Citation1988) test, χ2 (27) = 27.409, p = .442, which indicated the data met criteria for missing completely at random (MCAR). A multivariate analysis of variance (MANOVA) evaluated mean differences in age, child maltreatment, and NSSI by participant gender, ethnicity-race, and their interaction.

Model Evaluation

All variables were centered to reduce multicollinearity, since saturated models include all predictors and the interactions among covariates. Bias-corrected bootstrapped confidence intervals (CIs) across 10,000 resamples evaluated relations between child maltreatment and NSSI while holding participant age (square-root transformed), gender, and ethnicity-race constant (MacKinnon, Lockwood, & Williams, Citation2004).

Model Fitting Approach 1

Model fit was assessed sequentially using the criteria set forth by Hu and Bentler (Citation1999) as informed by model fit chi-squares, the comparative fit index (CFI > .90), the Tucker-Lewis Index (TLI > .90), the standardized root mean square residual (SRMR < .08), and the root mean square error of approximation (RMSEA < .08). Failure to meet criteria on one or more fit indices was indicative of poor fit.

Model Fitting Approach 2

Information measures assessed model fit when the above fit indices were unavailable. These included Akaiake’s information criterion (AIC), the Bayesian information criterion (BIC), which allows for comparison of nested and non-nested models (Long, Citation1997), and the sample size adjusted BIC (BICSSA). summarizes the resampling technique, missing data estimation, and model fitting for all model approaches.

TABLE 2 Resampling, Missing Data, and Model Fitting by Model Approaches

Model Comparisons

Traditional methods (e.g., chi-square differences) were not available to compare all methods; therefore, outputted predicted values were correlated against the raw NSSI values to assess the predictive strength of each approach, and to evaluate the distributional pattern of residual values.

Model Approaches

The regression model evaluated the effect of child maltreatment (i.e., CATS Scores) on NSSI, while holding age (square-root transformed), gender, and ethnicity-race constant. All models included interactions among covariates (not shown) and used the same general equation: NSSIˆ=β0+β1CATS+ β2SQRT_AGE+β3GENDER+ β4RACE

Raw Regression Approach (A)

The raw NSSI variable was regressed on the predictors to evaluate the correlation of predicted values of NSSI to actual NSSI.

Transformation Approaches (B)

NSSI was transformed with the natural log (i.e., ln[NSSI + 1], Model B.1), square-root (i.e., SQRT[NSSI + 1], Model B.2), and inverse (i.e., 1/NSSI + 1, Model B.3) approaches. The inverse transformation was then reversed to preserve the direction of the original scale. See for pre- and post-transformation values of NSSI.

Zero-Inflated Regression Model Approaches (C)

The ZIP (Model C.1) and NBZI (Model C.2) regression approaches used the same general equation to evaluate the likelihood of NSSI endorsement (NSSI#1)ˆ and the frequency of NSSI endorsed (NSSI0ˆ). The ZIP model used a logit and Poisson distribution for NSSI, and the NBZI model used a negative binomial distribution for NSSI.

Skew-t Distribution Approach (D)

The skew-t distribution approach adds a skew scaling weight when estimating regression parameters using the same general equation to evaluate NSSI.

RESULTS

Descriptive & Bivariate Analyses

Out of the 2,651 participants, 16.3% reported having engaged in one or more episodes of NSSI during the past 12 months. A multivariate analysis of variance (MANOVA) evaluated mean differences in age, child maltreatment, and NSSI as a function of participant gender, ethnicity-race, and their interaction (See ). There were significant main effects for participant gender (Wilks’ λ = .985, p < .001) and ethnicity-race (Wilks’ λ = .970, p < .001), as well as for their interaction (Wilks’ λ = .987, p < .001). Between-subjects effects revealed significant gender differences in participant age, child maltreatment, and NSSI, with males being older than females, and females endorsing higher rates of child maltreatment and NSSI than males. Likewise, there were significant differences in all study variables across ethnic-racial groups. Bonferonni-corrected post-hoc tests indicated that White participants were older than Hispanic respondents, Hispanic and multiracial participants endorsed higher rates of child maltreatment than both White and Black respondents, and Asian participants reported lower levels of NSSI than White, Hispanic, and multiracial respondents. There was a significant gender*ethnicity-race interaction for maltreatment, such that females endorsed higher rates of maltreatment in all ethnic-racial groups, except among Black respondents where males endorsed higher rates than females. There was also a significant gender*ethnicity-race interaction for NSSI, such that females endorsed higher rates of NSSI than males among all ethnic-racial groups, except among Whites where rates did not differ significantly by gender. At the bivariate level, NSSI was positively related to child maltreatment (r = .239, p < .001), but age was not significantly related to child maltreatment or NSSI.

TABLE 3 Descriptive Data for Study Variables by Participant Gender and Ethnicity-Race

Model Fitting

and show successive model fitting indices for all models. Estimates for both approaches were presented when available. Across all models, age tended to evidence nonsignificant effects and covariances. Models ending in 0 represent saturated models (i.e., all predictors and correlations among covariates are included) and models ending in F represent final best fitting models. The best fitting model for each approach was used to output predicted NSSI values.

TABLE 4 Comparison of Model Fit Indices of Raw and Transformation Approaches with 10,000 Bootstrap/Montecarlo Resamples

TABLE 5 Comparison of Model Fit Indices of Zero-Inflated and Skew-t Distribution Approaches with 10,000 Bootstrap/Montecarlo Resamples

Regression Model Analyses

A regression model evaluated the effect of child maltreatment on NSSI while holding age (square-root transformed), gender, and ethnicity-race constant. provides estimates, standardized estimates, and model fit indices for the final model (A) of the raw data approach. The raw data regression model evidenced good fit (χ2 [3] = 3.693, p = .297; RMSEA = .009, 90% CI [.000–.035], p = .998; CFI = .997; TLI = .993; SRMR = .009), and child maltreatment was related to higher NSSI (b = 1.069, β = .247, p < .001).

TABLE 6 Estimates, Standardized Estimates, and Model Fit Indices for the Final Models of Raw and Transformation Approaches

Transformation Approaches (Models B.1-B.3)

presents estimates, standardized estimates, and model fit indices for the final model of each transformation approach.

Natural Log Transformation (B.1)

The natural log NSSI regression model evidenced good fit (χ2 [3] = 3.562, p = .313; RMSEA = .009, 90% CI [.000–.035], p = .998; CFI = .998; TLI = .994; SRMR = .009), and child maltreatment was related to higher NSSI (b = .847, β = 1.865, p < .001).

Square-Root Transformation (B.2)

The square-root NSSI regression model evidenced good fit (χ2 [3] = 3.618, p = .306; RMSEA = .009, 90% CI [.000–.035], p = .998; CFI = .997; TLI = .994; SRMR = .009), and child maltreatment was related to higher NSSI (b = .461, β = 4.247, p < .001).

Inverse Transformation (B.3)

The inverse NSSI regression model evidenced good fit (χ2 [3] = 3.614, p = .306; RMSEA = .009, 90% CI [.000–.035], p = .998; CFI = .997; TLI = .994; SRMR = .009), and child maltreatment was related to higher NSSI (b = .717, β = 9.159, p < .001).

Zero-Inflated Approaches (Models C.1 and C.2)

provides estimates, standardized estimates, and model fit indices for the final model of the zero-inflated approaches.

TABLE 7 Estimates, Standardized Estimates, and Model Fit Indices for the Final Models of the Zero-Inflated and Skew-t Distribution Approaches

Zero-Inflated Poisson Regression (ZIP) Approach (C.1)

The ZIP regression model indicated that child maltreatment was associated with a decreased probability of endorsing NSSI (b#1 = − .499, β#1 = − .082, p < .001) and with greater frequency of NSSI when it was endorsed (b = .542, β = .851, p < .001).

Negative Binomial Zero-Inflated (NBZI) Approach (C.2)

The NBZI regression model indicated that child maltreatment was associated with an increased probability of endorsing NSSI (b#1 = 2.046, β#1 = .126, p < .001) and with greater frequency of NSSI when it was endorsed (b = .529, β = .846, p < .001).

Skew-t Distribution Approach (D)

The Skew-t regression model indicated that child maltreatment was related to lower NSSI (b = − .007, β = − .005, p < .001; see ).

Model Comparisons

Effect size estimates for the raw and transformation regression models can be found in . Across these models, the relation between child maltreatment and NSSI was generally moderate in magnitude (Cohen, Citation1988) suggesting that, consistent with prior work (e.g., Kaess et al., Citation2013; Lang & Sharma-Patel, Citation2011; Liu et al., Citation2018; Serafini et al., Citation2017), child maltreatment emerged as a robust predictor of NSSI in this sample. The skewness and kurtosis of residual values for each of the models are reported in and . Although there was significant non-normality across residuals for all models, as confirmed by significant Shapiro-Wilk Normality Test estimates for all models, the zero-inflated models evidenced the lowest levels of skew and kurtosis, which suggests these approaches were best able to address the distributional features of NSSI.

shows the correlations of predicted values with observed values of NSSI across all models. Differences among correlations were evaluated using the Fisher Z transformation. The correlations between NSSI and the predicted values of NSSI across all transformation models did not differ significantly from the raw data regression model, and the transformation approaches did not differ significantly from each other (all r values = .272, p < .001; all Fisher Z values = .984). The ZIP (r = .833, p < .001) and NBZI (r = .832, p < .001) models evidenced the strongest relations between reported NSSI and predicted values and these relations significantly differed from all transformation approaches (both Fisher Z values = − 32.285), but not from each other (Fisher Z = .100). The skew-t distribution showed the weakest relation between NSSI and the predicted values (r = .246, p < .001). This value was lower than those obtained from the zero-inflated approaches (Fisher Z values = 33.190_ZIP and 33.080_NBZI), but did not differ from the transformation models (all Fisher Z values = 0.999).

TABLE 8 Correlations among Actual NSSI and Predicted Values Across all Approaches

DISCUSSION

The present investigation sought to illustrate the application of varied statistical techniques for addressing highly skewed clinical data as applied to the evaluation of theoretically-specified and empirically-validated associations between child maltreatment and NSSI. As in prior studies (Guertin et al., Citation2001; Lloyd et al., Citation1997, April; Nock & Prinstein, Citation2005; Yates, Tracy, et al., Citation2008), NSSI emerged as a highly-skewed variable with an overabundance of zeros. Researchers have employed various procedures to address this distributional pattern, such as transforming the variable to reach acceptable levels (e.g., Afifi, Kotlerman, Ettner, & Cowan, Citation2007; Andover & Gibb, Citation2010; Robertson et al., Citation2013; Yates, Carlson, et al., Citation2008; You et al., Citation2016), and, less often, using zero-inflated approaches to model the overabundance of zeros as distinct from the skewed frequency of NSSI using either Poisson (Yates, Tracy, et al., Citation2008) or negative binomial models (e.g., Allen et al., Citation2019; Glenn et al., Citation2016; Schoenleber et al., Citation2014).

Across raw and transformation approaches, child maltreatment evidenced moderately strong relations with higher NSSI. Likewise, both zero-inflated models yielded significant and positive relations between child maltreatment and NSSI frequency among respondents who endorsed NSSI. However, the ZIP model yielded a negative, albeit small, relation between child maltreatment and the probability of NSSI endorsement, whereas the NBZI model supported a positive relation between child maltreatment and the likelihood of endorsing NSSI. Similarly, the skew-t model showed a small, but significant negative relation between child maltreatment and NSSI. The negative associations between child maltreatment and NSSI obtained from the ZIP and skew-t model were surprising. Relative to the NBZI correction, which can account for clinical phenomena where there is an increase in the distribution of frequencies at the tail end of the distribution, the ZIP approach may be less able to accommodate subsequent spikes yielding the appearance of a negative association between child maltreatment and the likelihood of endorsing NSSI. With regard to the skew-t distribution approach, additional research is needed to apply this method with both zero-inflated and non-zero-inflated skewed data to further understand if and how an excess of zeros might alter the performance of the skew-t distribution approach.

Across approaches, the predicted values of the ZIP and NBZI models evidenced the strongest relations with raw NSSI scores, whereas the skew-t distribution model predicted values with the weakest correlation to the observed data. This pattern was echoed in our evaluation of the residual values for each of the approaches. The superiority of the zero-inflated approaches may indicate that accounting for the overabundance of zeros is most important when distributions feature this type of skewed data. However, as noted earlier, there may be risks to approaches that fail to fully correct for overdispersion of outcome data in the ZIP model. Likewise, the nonnormality of the residual values from the zero-inflated approaches points to the need for ongoing efforts to develop and evaluate statistical approaches for dealing with highly skewed data.

Strengths & Limitations

The current study drew on a strong foundation of theoretical and empirical literature, as well as a large and diverse sample of participants who provided real-world NSSI data with unique skewed features (i.e., zero-inflated and overdispersed), to provide clinical researchers with a comparative evaluation of different methods for addressing highly skewed data in their statistical analyses. However, despite these strengths, the current study featured several limitations that qualify the interpretation of the data while highlighting valuable directions for future research.

First, although NSSI is most prevalent among late adolescents and young adults (Whitlock et al., Citation2006), college students from a public university in the western United states may not be representative of college-going youth at private institutions or in other regions and countries. Likewise, student samples may not be representative of clinical, incarcerated, or broader community populations (e.g., working young adults who are not attending school). Indeed, in a sample of adolescent inpatients, Lüdtke, In-Albon, Michel, and Schmid (Citation2016) did not find a significant association between child maltreatment and NSSI. Given the potential for variability across samples, future studies must evaluate these associations in other adolescent and young adult populations.

Second, and related to the first concern, the generalizability of the obtained findings regarding the relative merits of statistical approaches for addressing skewness await replication in future studies using varied conceptualizations of NSSI, as well as in studies of other clinical phenomena with high public health impact despite relatively low base rates. The current study assessed NSSI outcomes using frequency bands, rather than count data. Although numerous studies have used similar frequency bands (e.g., Buser et al., Citation2019; Hasking, Boyes, Finlay-Jones, McEvoy, & Rees, Citation2019; Midkiff et al., Citation2018; Whitlock et al., Citation2013), this approach may result in reduced power or conflate potentially different degrees of NSSI. That said, extant measures that employ count report formats may inadvertently bias reports to favor particular frequency ranges by suggesting count metrics (e.g., 1, 5, 15 in Cerutti, Zuffianò, & Spensieri, Citation2018; 0, 10, 100, 500 in Nielsen, Sayal, & Townsend, Citation2017). Similarly, studies of NSSI vary with regard to duration, with some studies assessing NSSI events during the prior year as in the current study (e.g., Hasking et al., Citation2019; Muehlenkamp, Xhunga, & Brausch, Citation2019), in the past month (e.g., Auerbach et al., Citation2014; Fox et al., Citation2019), or over the course of the respondent’s life time (e.g., Cerutti et al., Citation2018). It is not yet clear if and how these variations may influence NSSI distributions and, by extension, the relative effectiveness of statistical approaches for addressing data skewness. In addition to evaluating the applicability of these approaches to different conceptualizations of NSSI, future research should test whether or not zero-inflated models yield superior predictions across multiple forms of clinical symptom expression, or whether these approaches are particularly effective for modeling NSSI. Of note, in the absence of additional samples and metrics, Monte Carlo simulation studies may support the evaluation of skewness approaches by comparing “estimated” parameters to “true” parameters (Feinberg & Rubright, Citation2016).

Third, although the current study evaluated several prominent approaches for addressing data skewness, additional models should be evaluated in future studies. Many clinical phenomena, including NSSI, can be modeled using multinomial regression approaches (e.g., by recoding NSSI frequency into a three-point ordinal scale; Yates, Carlson, & Egeland, Citation2008). In addition, semi-parametric regression-type models may overcome some of the distributional limitations of generalized linear models (Akantziliotou, Rigby, & Stasinopoulos, Citation2002; Rigby & Stasinopoulos, Citation2001, Citation2005). For example, Generalized Additive Models for Location, Scale, and Shape (GAMLSS) address skewness and kurtosis issues using non-parametric smoothing functions, though their application to questions about NSSI and other clinical phenomena awaits further evaluation.

Fourth, the current study relied on retrospective, self-report data collected at a single time point using a survey platform, which raises concerns about the validity of the data due to shared method variance, and constrains our capacity to render directional interpretations of the obtained associations. Retrospective reports of child maltreatment may be affected by false reporting (Fergusson, Horwood, & Woodward, Citation2000), errors in recollection (DiLillo et al., Citation2006), and inconsistency in and/or inability to access traumatic memories (Widom & Morris, Citation1997). However, it is important to note that underreporting (i.e., false negatives) is far more common than overreporting (i.e., false positives; Hardt & Rutter, Citation2004). Moreover, empirical studies support the convergent validity of retrospective self-reports of maltreatment with child welfare records using both administrative data and prospective indices of adjustment (Dube, Williamson, Thompson, Felitti, & Anda, Citation2004; Shaffer, Huston, & Egeland, Citation2008). Although retrospective reports capture the totality of participant’s lived experiences, rather than just those associated with reported and reportable events (Hardt & Rutter, Citation2004; Shaffer et al., Citation2008), the current findings would have been bolstered by concurrent administrative data reports, particularly given that both maltreatment and NSSI data were provided by a single informant in this study.

Conclusions

The current findings suggest that the unique skewness characteristics of NSSI may render specific analytic methods inappropriate or less accurate, including the use of skew-t distribution models, even though these approaches have been successful in modeling skewed non-clinical data (e.g., Asparouhov & Muthén, Citation2015). Apparent differences in predictive power across the approaches examined here highlight the importance of using zero-inflated models, rather than transformation or skew-t approaches. Conceptually, zero-inflated approaches offer a critical advantage by modeling symptom initiation (e.g., probability of occurrence) versus maintenance (e.g., frequency of occurrence). Indeed, by offering distinct parameter estimates, these models enable researchers to evaluate predictors of symptom onset as distinct from those accounting for symptom maintenance or exacerbation, which is consistent with broader tenets of developmental psychopathology emphasizing the possibility that distinct factors account for disorder onset versus course (Lewinsohn et al., Citation1999; Yates, Burt, & Troy, Citation2011). Efforts to elucidate distinct developmental pathways toward disorder onset versus recurrence will inform more targeted and efficacious prevention and intervention treatments, respectively. For example, prior work has suggested that social and peer factors are salient for NSSI initiation, whereas intrapersonal factors tend to be more strongly associated with NSSI recurrence (Brausch & Muehlenkamp, Citation2018; Muehlenkamp, Brausch, Quigley, & Whitlock, Citation2013). Moving forward, expanded efforts to evaluate putative mediators of the relation between child maltreatment and NSSI using zero-inflated modeling approaches will further refine extant treatment approaches.

ACKNOWLEDGMENTS

We gratefully acknowledge Dr. Chandra Reynolds who provided generous instruction and consultation throughout the execution of the analyses reported herein.

DISCLOSURE STATEMENT

No potential conflict of interest was reported by the authors.

Notes

1 All model approaches were also evaluated using the raw age variable. Across all approaches, the age-transformed models evidenced better fit. However, there were minimal differences across raw and transformed analyses with regard to predicted relations with the raw NSSI outcome scores.

2 Alternatively, following Tabachnick and Fidell (Citation2001), outliers could be re-assessed after transformations of age to evaluate which transformation recommends the fewest outlier removals. However, to maintain consistency across approaches and to remain conservative, all outliers were removed prior to transformation.

3 Analyses including outliers revealed no major differences in findings. Probability-probability and residual plots for assessing outliers are available upon request.

REFERENCES

  • Abbott, R. D., & Gutgesell, H. P. (1994). Effects of heteroscedasticity and skewness on prediction in regression: Modeling growth of the human heart. In M. L. Johnson (Ed.), Essential numerical computer methods (pp. 67–81). Oxford, UK: Academic Press.
  • Adarabioyo, M. I., & Ipinyomi, R. A. (2019). Comparing zero-inflated poisson, zero-inflated negative binomial and zero-inflated geometric in count data with excess zero. Asian Journal of Probability and Statistics, 1–10. doi:10.9734/ajpas/2019/v4i230113
  • Afifi, A. A., Kotlerman, J. B., Ettner, S. L., & Cowan, M. (2007). Methods for improving regression analysis for skewed continuous or counted responses. Annual Review Public Health, 28, 95–111. doi:10.1146/annurev.publhealth.28.082206.094100
  • Akantziliotou, K., Rigby, R. A., & Stasinopoulos, D. M. (2002). The R implementation of generalized additive models for location, scale and shape. Statistical modelling in Society: Proceedings of the 17th International Workshop on statistical modelling (pp. 75–83). Chania, Greece: Statistical Modelling Society.
  • Allen, K. J., Fox, K. R., Schatten, H. T., & Hooley, J. M. (2019). Frequency of nonsuicidal self-injury is associated with impulsive decision-making during criticism. Psychiatry Research, 271, 68–75. doi:10.1016/j.psychres.2018.11.022
  • Andover, M. S., & Gibb, B. E. (2010). Non-suicidal self-injury, attempted suicide, and suicidal intent among psychiatric inpatients. Psychiatry Research, 178(1), 101–105. doi:10.1016/j.psychres.2010.03.019
  • Asparouhov, T., & Muthén, B. (2015). Structural equation models and mixture models with continuous nonnormal skewed distributions. Structural Equation Modeling: A Multidisciplinary Journal, 23(1), 1–19. doi:10.1080/10705511.2014.947375
  • Auerbach, R. P., Kim, J. C., Chango, J. M., Spiro, W. J., Cha, C., Gold, J., … Nock, M. K. (2014). Adolescent nonsuicidal self-injury: Examining the role of child abuse, comorbidity, and disinhibition. Psychiatry Research, 220(1–2), 579–584. doi:10.1016/j.psychres.2014.07.027
  • Azzalini, A. (1985). A class of distributions which includes the normal ones. Scandinavian Journal of Statistics, 12, 171–178.
  • Benjet, C., González-Herrera, I., Castro-Silva, E., Méndez, E., Borges, G., Casanova, L., & Medina-Mora, M. E. (2017). Non-suicidal self-injury in Mexican young adults: Prevalence, associations with suicidal behavior and psychiatric disorders, and DSM-5 proposed diagnostic criteria. Journal of Affective Disorders, 215, 1–8. doi:10.1016/j.jad.2017.03.025
  • Boone, S. D., & Brausch, A. M. (2016). Physical activity, exercise motivations, depression, and nonsuicidal self-injury in youth. Suicide and Life-Threatening Behavior, 46(5), 625–633. doi:10.1111/sltb.2016.46.issue-5
  • Brausch, A. M., & Muehlenkamp, J. J. (2018). Perceived effectiveness of NSSI in achieving functions on severity and suicide risk. Psychiatry Research, 265, 144–150. doi:10.1016/j.psychres.2018.04.038
  • Buser, T. J., Pertuit, T. L., & Muller, D. L. (2019). Nonsuicidal self‐injury, stress, and self‐differentiation. Adultspan Journal, 18(1), 4–16. doi:10.1002/adsp.2019.18.issue-1
  • Cerutti, R., Zuffianò, A., & Spensieri, V. (2018). The role of difficulty in identifying and describing feelings in non-suicidal self-injury behavior (NSSI): Associations with perceived attachment quality, stressful life events, and suicidal ideation. Frontiers in Psychology, 9, 318. doi:10.3389/fpsyg.2018.00318
  • Cohen, J., Cohen, P., West, S. G., & Aiken, L. S. (2003). Applied multiple regression/correlation analysis for the behavioral sciences (3rd ed.). Mahwah, NJ: Lawerence Associates.
  • Cohen, J. E. (1988). Statistical power analysis for the behavioral sciences. Hillsdale, NJ: Lawrence Erlbaum Associates, Inc.
  • Curran, P. J., West, S. G., & Finch, J. F. (1996). The robustness of test statistics to nonnomality and specification error in confirmatory factor analysis. Psychological Methods, 1(1), 16–29. doi: 10.1.1.382.6856
  • DiLillo, D., Foriter, M. A., Hayes, S. A., Trask, E., Perry, A. R., Messman-Moore, T., … Nash, C. (2006). Retrospective assessment of childhood sexual and physical abuse. Assessment, 13, 297–312. doi:10.1177/1073191106288391
  • Driscoll, C. F. B., Murray, C. B., Holbein, C. E., Stiles‐Shields, C., Cuevas, G., & Holmbeck, G. N. (2019). Camp‐based psychosocial intervention dosage and changes in independence in young people with spina bifida. Developmental Medicine & Child Neurology, 61(12), 1392–1399. doi:10.1111/dmcn.v61.12
  • Dube, S. R., Williamson, D. F., Thompson, T., Felitti, V. J., & Anda, R. F. (2004). Assessing the reliability of retrospective reports of adverse childhood experiences among adult HMO members attending a primary care clinic. Child Abuse & Neglect, 28, 729–737. doi:10.1016/j.chiabu.2003.08.009
  • Dufour, J. P., & Zung, N. T. (2005). Poisson structures and their normal forms. Basel, Switzerland: Birkhauser.
  • Esposito, C., Bacchini, D., & Affuso, G. (2019). Adolescent non-suicidal self-injury and its relationships with school bullying and peer rejection. Psychiatry Research, 274, 1–6. doi:10.1016/j.psychres.2019.02.018
  • Feinberg, R. A., & Rubright, J. D. (2016). Conducting simulation studies in psychometrics. Educational Measurement: Issues and Practice, 35(2), 36–49. doi:10.1111/emip.12111
  • Fergusson, D. M., Horwood, L. J., & Woodward, L. J. (2000). The stability of child abuse reports: A longitudinal study of the reporting behaviour of young adults. Psychological Medicine, 30(3), 529–544. doi:10.1017/S0033291799002111
  • Fox, K. R., Huang, X., Linthicum, K. P., Wang, S. B., Franklin, J. C., & Ribeiro, J. D. (2019). Model complexity improves the prediction of nonsuicidal self-injury. Journal of Consulting and Clinical Psychology, 87(8), 684. doi:10.1037/ccp0000421
  • Fox, K. R., Ribeiro, J. D., Kleiman, E. M., Hooley, J. M., Nock, M. K., & Franklin, J. C. (2018). Affect toward the self and self-injury stimuli as potential risk factors for nonsuicidal self-injury. Psychiatry Research, 260, 279–285. doi:10.1016/j.psychres.2017.11.083
  • Gillies, D., Christou, M. A., Dixon, A. C., Featherston, O. J., Rapti, I., Garcia-Anguita, A., … Christou, P. A. (2018). Prevalence and characteristics of self-harm in adolescents: Meta-analyses of community-based studies 1990–2015. Journal of the American Academy of Child & Adolescent Psychiatry, 57(10), 733–741. doi:10.1016/j.jaac.2018.06.018
  • Glenn, C. R., Kleiman, E. M., Cha, C. B., Nock, M. K., & Prinstein, M. J. (2016). Implicit cognition about self-injury predicts actual self-injurious behavior: results from a longitudinal study of adolescents. Journal of Child Psychology and Psychiatry, 57(7), 805–813. doi:10.1111/jcpp.2016.57.issue-7
  • Godfrey, L. G., & Orme, C. D. (1991). Testing for skewness of regression disturbances. Economic Letters, 37, 31–34. doi:10.1016/0165-1765(91)90238-G
  • Gschlößl, S., & Czado, C. (2008). Modeling count data with overdispersion and spatial effects. Statistical Papers, 49, 531–552. doi:10.1007/s00362-006-0031-6
  • Guertin, T., Lloyd-Richardson, E., Spirito, A., Donaldson, D., & Boergers, J. (2001). Self-mutilative behavior in adolescents who attempt suicide by overdose. Journal of the American Academy of Child & Adolescent Psychiatry, 40(9), 1062–1069. doi:10.1097/00004583-200109000-00015
  • Hall, P. (1992). On the removal of skewness by transformation. Journal of the Royal Statistical Society: Series B (Methodological), 54(1), 221–228. doi:10.1111/rssb.1992.54.issue-1
  • Hardt, J., & Rutter, M. (2004). Validity of adult retrospective reports of adverse childhood experiences: Review of the evidence. Journal of Child Psychology and Psychiatry, 45, 260–273. doi:10.1111/jcpp.2004.45.issue-2
  • Hasking, P., Boyes, M. E., Finlay-Jones, A., McEvoy, P. M., & Rees, C. S. (2019). Common pathways to NSSI and suicide ideation: The roles of rumination and self-compassion. Archives of Suicide Research, 23(2), 247–260. doi:10.1080/13811118.2018.1468836
  • Hu, L., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling: A Multidisciplinary Journal, 6(1), 1–55. doi:10.1080/10705519909540118
  • Kaess, M., Parzer, P., Mattern, M., Plener, P. L., Bifulco, A., Resch, F., & Brunner, R. (2013). Adverse childhood experiences and their impact on frequency, severity, and the individual function of nonsuicidal self-injury in youth. Psychiatry Research, 206(2–3), 265–272. doi:10.1016/j.psychres.2012.10.012
  • Klonsky, E. D., & Olino, T. M. (2008). Identifying clinically distinct subgroups of self-injurers among young adults: A latent class analysis. Journal of Consulting and Clinical Psychology, 76(1), 22. doi:10.1037/0022-006X.76.1.22
  • Kranzler, A., Fehling, K. B., Lindqvist, J., Brillante, J., Yuan, F., Gao, X., … Selby, E. A. (2018). An ecological investigation of the emotional context surrounding nonsuicidal self‐injurious thoughts and behaviors in adolescents and young adults. Suicide and Life‐Threatening Behavior, 48(2), 149–159. doi:10.1111/sltb.2018.48.issue-2
  • Lambert, D. (1992). Zero-inflated poisson regression, with an application to defects in manufacturing. Techonometrics, 34(1), 1–14. doi:10.2307/1269547
  • Lang, C. M., & Sharma-Patel, K. (2011). The relation between childhood maltreatment and self-injury: A review of the literature on conceptualization and intervention. Trauma, Violence, & Abuse, 12(1), 23–37. doi:10.1177/1524838010386975
  • Lee, S. M., Karrison, T., Nocon, R. S., & Huang, E. (2018). Weighted zero-inflated poisson mixed model with an application to medicaid utilization data. Communications for Statistical Applications and Methods, 25(2), 173–184. doi:10.29220/CSAM.2018.25.2.173
  • Lee, S. X., & McLachlan, G. J. (2013). On mixtures of skew normal and skew t- distributions. Advances in Data Analysis and Classification, 7(3), 241–266. doi:10.1007/s11634-013-0132-8
  • Lewinsohn, P. M., Allen, N. B., Seeley, J. R., & Gotlib, I. H. (1999). First onset versus recurrance of depression: Differential process of psychological risk. Journal of Abnormal Psychology, 108(3), 483–489. doi:10.1037/0021-843X.108.3.483
  • Little, R. J. A. (1988). A test of missing completely at random for multivariate data with missing values. Journal of the American Statistical Association, 83(404), 1198–1202. doi:10.1080/01621459.1988.10478722
  • Liu, R. T., Scopelliti, K. M., Pittman, S. K., & Zamora, A. S. (2018). Childhood maltreatment and non-suicidal self-injury: A systematic review and meta-analysis. The Lancet Psychiatry, 5(1), 51–64. doi:10.1016/S2215-0366(17)30469-8
  • Lloyd, E. E., Kelley, M. L., & Hope, T. (1997, April). Self-mutilation in a community sample of adolescents: Descriptive characteristics and provisional prevalence rates. Paper presented at the Society for Behavioral Medicine, New Orleans, LA.
  • Long, J. S. (1997). Regression models for categorical and limited dependent variables. Thousand Oaks, CA: Sage Publications.
  • Lüdtke, J., In-Albon, T., Michel, C., & Schmid, M. (2016). Predictors for DSM-5 nonsuicidal self-injury in female adolescent inpatients: The role of childhood maltreatment, alexithymia, and dissociation. Psychiatry Research, 239, 346–352. doi:10.1016/j.psychres.2016.02.026
  • MacGillivray, H. L. (1986). Skewness and asymmetry: Measures and orderings. The Annals of Statistics, 14, 994–1011. doi:10.1214/aos/1176350046
  • MacKinnon, D. P., Lockwood, C. M., & Williams, J. (2004). Confidence limits for the indirect effect: Distribution of the product and resampling methods. Multivariate Behavioral Research, 39(1), 99–128. doi:10.1207/s15327906mbr3901_4
  • Midkiff, M. F., Lindsey, C. R., & Meadows, E. A. (2018). The role of coping self-efficacy in emotion regulation and frequency of NSSI in young adult college students. Cogent Psychology, 5(1), 1520437. doi:10.1080/23311908.2018.1520437
  • Minami, M., Lennert-Cody, C. E., Gao, W., & Román-Verdesoto, M. (2007). Modeling shark bycatch: The zero-inflated negative binomial regression model with smoothing. Fisheries Research, 84(2), 210–221. doi:10.1016/j.fishres.2006.10.019
  • Morse, C. D., Klingman, K. J., Jacob, B. L., & Kodali, L. (2019). Exercise and insomnia risk in middle-aged women. The Journal for Nurse Practitioners, 15(3), 236–240. doi:10.1016/j.nurpra.2018.10.020
  • Muehlenkamp, J., Brausch, A., Quigley, K., & Whitlock, J. (2013). Interpersonal features and functions of nonsuicidal self‐injury. Suicide and Life‐Threatening Behavior, 43(1), 67–80. doi:10.1111/sltb.2013.43.issue-1
  • Muehlenkamp, J. J., Xhunga, N., & Brausch, A. M. (2019). Self-injury age of onset: A risk factor for NSSI severity and suicidal behavior. Archives of Suicide Research, 23(4), 551–563. doi:10.1080/13811118.2018.1486252
  • Muthén, L. K., & Muthén, B. O. (1998–2017). Mplus user’s guide (8th ed.). Los Angeles, CA: Muthén & Muthén.
  • Nielsen, E., Sayal, K., & Townsend, E. (2017). Functional coping dynamics and experiential avoidance in a community sample with no self-injury vs. non-suicidal self-injury only vs. those with both non-suicidal self-injury and suicidal behaviour. International Journal of Environmental Research and Public Health, 14(6), 575. doi:10.3390/ijerph14060575
  • Nock, M. K. (2010). Self-injury. Annual Review of Clinical Psychology, 6, 339–363. doi:10.1146/annurev.clinpsy.121208.131258
  • Nock, M. K., Holmberg, E. B., Photos, V. I., & Michel, B. D. (2007). Self-injurious thoughts and behaviors interview: Development, reliability, and validity in an adolescent sample. Psychological Assessment, 19(3), 309–317.
  • Nock, M. K., & Prinstein, M. J. (2004). A functional approach to the assessment of self-mutilative behavior. Journal of Counseling & Clinical Psychology, 72(5), 885–890. doi:10.1037/0022-006X.72.5.885
  • Nock, M. K., & Prinstein, M. J. (2005). Contextual features and behavioral functions of self-mutilation among adolescents. Journal of Abnormal Psychology, 114(1), 140–146. doi:10.1037/0021-843X.114.1.140
  • Osborne, J. W., & Waters, E. (2002). Four assumptions of multiple regression that researchers should always test. Practical Assessment, Research, and Evaluation, 8(1), 2.
  • Rigby, R. A., & Stasinopoulos, D. M. (2001, July). The GAMLSS project: A flexible approach to statistical modelling. New trends in statistical modelling: Proceedings of the 16th international workshop on statistical modelling (Vol. 337, p. 345). Odense, Denmark: University of Southern Denmark.
  • Rigby, R. A., & Stasinopoulos, D. M. (2005, June). Generalized additive models for location, scale and shape. Journal of the Royal Statistical Society: Series C (Applied Statistics), 54(3), 507–554. doi:10.1111/j.1467-9876.2005.00510.x
  • Robertson, C. D., Miskey, H., Mitchell, J., & Nelson-Gray, R. (2013). Variety of self-injury: Is the number of different methods of non-suicidal self-injury related to personality, psychopathology, or functions of self-injury? Archives of Suicide Research, 17(1), 33–40. doi:10.1080/13811118.2013.748410
  • Sanders, B., & Becker-Lausen, E. (1995). The measurement of psychological maltreatment: Early data on the child abuse and trauma scale. Child Abuse & Neglect, 19(3), 315–323. doi:10.1016/S0145-2134(94)00131-6
  • Schoenleber, M., Berenbaum, H., & Motl, R. (2014). Shame-related functions of and motivations for self-injurious behavior. Personality Disorders: Theory, Research, and Treatment, 5(2), 204. doi:10.1037/per0000035
  • Serafini, G., Canepa, G., Adavastro, G., Nebbia, J., Belvederi Murri, M., Erbuto, D., … Amore, M. (2017). The relationship between childhood maltreatment and non-suicidal self-injury: A systematic review. Frontiers in Psychiatry, 8, 149. doi:10.3389/fpsyt.2017.00149
  • Shaffer, A., Huston, L., & Egeland, B. (2008). Identification of child maltreatment using prospective and self-report methodologies: A comparison of maltreatment incidence and relation to later psychopathology. Child Abuse & Neglect, 32, 682–692. doi:10.1016/j.chiabu.2007.09.010
  • Swannell, S. V., Martin, G. E., Page, A., Hasking, P., & St John, N. J. (2014). Prevalence of nonsuicidal self‐injury in nonclinical samples: Systematic review, meta‐analysis and meta‐regression. Suicide and Life‐Threatening Behavior, 44(3), 273–303. doi:10.1111/sltb.2014.44.issue-3
  • Tabachnick, B. G., & Fidell, L. S. (2001). Using multivariate statistics. Needham Heights, MA: Allyn and Bacon.
  • Thomassin, K., Shaffer, A., Madden, A., & Londino, D. L. (2016). Specificity of childhood maltreatment and emotion deficit in nonsuicidal self-injury in an inpatient sample of youth. Psychiatry Research, 244, 103–108. doi:10.1016/j.psychres.2016.07.050
  • Tüzen, M. F., & Erbaş, S. (2018). A comparison of count data models with an application to daily cigarette consumption of young persons. Communications in Statistics-Theory and Methods, 47(23), 5825–5844. doi:10.1080/03610926.2017.1402050
  • Vergara, G. A., Stewart, J. G., Cosby, E. A., Lincoln, S. H., & Auerbach, R. P. (2019). Non-suicidal self-injury and suicide in depressed adolescents: Impact of peer victimization and bullying. Journal of Affective Disorders, 245, 744–749. doi:10.1016/j.jad.2018.11.084
  • Weisberg, S. (2005). Applied linear regression (Vol. 528). Hoboken, NJ: John Wiley & Sons.
  • Whitlock, J., Muehlenkamp, J., Eckenrode, J., Purington, A., Abrams, G. B., Barreira, P., & Kress, V. (2013). Nonsuicidal self-injury as a gateway to suicide in young adults. Journal of Adolescent Health, 52(4), 486–492. doi:10.1016/j.jadohealth.2012.09.010
  • Whitlock, J. L., Eckenrode, J., & Silverman, D. (2006). Self-injurious behaviors in a college population. Pediatrics, 117(6), 1939–1948. doi:10.1542/peds.2005-2543
  • Widom, C. S., & Morris, S. (1997). Accuracy of adult recollection of childhood victimization: Part 2 childhood sexual abuse. Psychological Assessment, 9, 34–46. doi:10.1037/1040-3590.9.1.34
  • Williams, M. N., Grajales, C. A. G., & Kurkiewicz, D. (2013). Assumptions of multiple regression: Correcting two misconceptions. Practical Assessment, Research, and Evaluation, 18(1), 11.
  • Yates, T. M. (2009). Developmental pathways from child maltreatment to nonsuicidal self-injury. In M. Nock (Ed.), Understanding non-suicidal self-injury: Current science and practice (pp. 117–137). Washington, DC: American Psychological Association.
  • Yates, T. M., Burt, K. B., & Troy, M. F. (2011). A developmental approach to clinical research, classification & practice. In D. Cicchetti & G. I. Roisman (Eds.), The Minnesota symposia on child psychology: The origins and organization of adaptation and maladaptation (Vol. 36, pp. 231–282). New York, NY: Wiley.
  • Yates, T. M., Carlson, E., & Egeland, B. (2008). A prospective study of child maltreatment and self-injurious behavior in a community sample. Development and Psychopathology, 20, 651–671. doi:10.1017/S0954579408000321
  • Yates, T. M., Tracy, A. J., & Luthar, S. S. (2008). Nonsuicidal self-injury among “privileged” youths: Longitudinal and cross-sectional approaches to developmental process. Journal of Consulting and Clinical Psychology, 76(1), 52–62. doi:10.1037/0022-006X.76.1.52
  • You, J., & Leung, F. (2012). The role of depressive symptoms, family invalidation and behavioral impulsivity in the occurrence and repetition of non-suicidal self-injury in Chinese adolescents: A 2-year follow-up study. Journal of Adolescence, 35(2), 389–395. doi:10.1016/j.adolescence.2011.07.020
  • You, J., Zheng, C., Lin, M. P., & Leung, F. (2016). Peer group impulsivity moderated the individual-level relationship between depressive symptoms and adolescent nonsuicidal self-injury. Journal of Adolescence, 47, 90–99. doi:10.1016/j.adolescence.2015.12.008

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.