721
Views
2
CrossRef citations to date
0
Altmetric
Research Article

Revisiting the psychometric properties of a revised Danish version of the McGill ingestive skills assessment

& ORCID Icon | (Reviewing Editor)
Article: 1281558 | Received 01 Nov 2016, Accepted 09 Jan 2017, Published online: 23 Jan 2017

Abstract

Background: During a longstanding validation process of the Danish version of the McGill Ingestive Skills Assessment (MISA2-DK) for measuring mealtime performance in dysphagic clients, extensive revisions have been undertaken. Therefore, this study aimed to determine the psychometric properties of this revised version. Methods: In a cross-sectional study, 328 adults referred to occupational therapy for swallowing evaluation were included. MISA2-DK with 36 items distributed into four subscales (positioning for meals, self-feeding skills, liquid ingestion, and solid ingestion) was administered as observation during a meal. Statistical analysis included item analysis by the Rasch model and exploratory factor analysis (EFA). Results: The initial analysis of MISA2-DK presented misfit to the Rasch model, which was resolved by grouping items within subscales into testlets to adjust for local item dependency. However, when testing the items within each subscale, the subscale structure was not supported. The EFA and further item analysis by the Rasch model suggested a different distribution of items—namely, anticipation, bolus preparation, bolus propulsion, and airway protection. Conclusion: The total MISA2-DK score might provide a unidimensional measure of mealtime performance. However, for detailed information of qualitative aspects of dysphagic clients’ mealtime performance, the four EFA-derived subscale domains are recommended.

Public Interest Statement

Mealtimes are important events in everyday life for everyone, as they are linked to our physical health and are associated with social interactions, fun and emotional support. Adults with disabilities can experience swallowing difficulties and choking during meals with potentially harmful health and psychosocial consequences, which decreases quality of life. To provide proper mealtime support, health care professionals employ mealtime observations as a part of the investigation procedures for the swallowing function. A longstanding research process has resulted in a robust assessment procedure of mealtime performance related to mealtime positioning and self-feeding skills as well as functions related to taking in and manipulating solids and liquids through the mouth and throat into the body. By using this assessment procedure, occupational therapists could validly document impaired skills during meals, which can provide important information for choosing relevant strategies enabling safe and enjoyable meals.

Competing Interests

The authors declare no competing interest.

1. Introduction

Swallowing disorders (dysphagia) in adults are a secondary consequence of several underlying pathologies that affect the ability to safely and efficiently eat and drink during meals (Roden & Altman, Citation2013; Shaker, Easterling, Belafsky, & Postma, Citation2012). Dysphagia is associated with high morbidity and mortality due to increased risk of aspiration pneumonia and malnutrition (Altman, Yu, & Schaefer, Citation2010; Roden & Altman, Citation2013) and negatively influences quality of life for the sufferers and their families (Cichero & Clavé, Citation2012). Feeding, eating and drinking, essential for adequate nutrition and hydration, are forms of social interaction involved in many occupations of everyday life (Clark, Avery-Smith, Wold, Anthony, & Holm, Citation2007). Individuals with dysphagia report social and psychological problems such as anxiety, shame, loss of control and lack of enjoyment while eating and drinking (Martino, Beaton, & Diamant, Citation2009). This may lead to social isolation and the avoidance of participation in meal-related activities (Klinke, Wilson, Hafsteinsdóttir, & Jónsdóttir, Citation2013). Accordingly, the affected individual must cope with the life-threatening and social consequences of dysphagia (Cichero & Clavé, Citation2012). Therefore, comprehensive assessment, which includes systematic observation of actual mealtime performance, is highly recommended (Danish Health Authority, Citation2015; Klinke et al., Citation2013; Scottish Intercollegiate Guidelines Network [SIGN], Citation2010; Shaker et al., Citation2012).

On the multidisciplinary dysphagia rehabilitation team, occupational therapists (OTs) provide information on the specific performance components involved in self-feeding, eating/drinking and swallowing during meals (Clark et al., Citation2007; Hansen, Kjaersgaard, & Faber, Citation2011). Such systematic observation of mealtime performance should preferably be undertaken using rating-scale-based measurements for documentation (de Vet, Terwee, Mokkink, & Knol, Citation2011). A systematic review (Hansen, Kjaersgaard, et al., Citation2011) suggests that the Canadian McGill Ingestive Skills Assessment (MISA) (Lambert, Gisel, Wood-Dauphine, Groher, & Abrahamowicz, Citation2006) provides OTs with adequate valid and reliable observation-based measures of actual mealtime performance.

Observed mealtime performance is a complex and integrated neuromuscular process of the following: anticipation with somatosensory, cognitive, pre-oral motor and psychosocial factors engendered by the meal; bolus preparation with bolus manipulation and formation in the mouth; bolus propulsion through the oral cavity, pharynx and oesophagus into the stomach; and airway protection (Clark et al., Citation2007; Leopold & Kagel, Citation1997). In MISA, this process is expressed as 43 ingestive skill items to be rated on a three-point ordinal scale, which are summed into five subscale scores related to positioning, self-feeding skills, liquid ingestion, solid ingestion and texture management as well as into a total score (Lambert et al., Citation2006). As such, both multidimensional measures and a single unidimensional measure of the latent construct of mealtime performance are provided (Christensen, Kreiner, & Mesbah, Citation2013).

MISA was translated into a Danish version (MISA-DK) (Hansen, Lambert, & Faber, Citation2011) and has been validated using classical test theory (Hansen, Lambert, & Faber, Citation2012a, Citation2012b). The Danish validation process added item analysis by the Rasch model (Hansen, Citation2014; Hansen et al., Citation2012b) within the conventions of modern item-response theory (Christensen et al., Citation2013; de Vet et al., Citation2011). The Rasch methodology involves statistical analysis for investigating whether a set of rating scale data satisfies the assumptions of the model, which is based on principles of fundamental measurement (Christensen et al., Citation2013; Tennant & Conaghan, Citation2007). If ordinal data in a rating scale fit the assumptions in the Rasch model, transformation into an interval scale is permitted, and the summed scores are regarded as specific objective and sufficient (Christensen et al., Citation2013). That is, invariant comparisons of items and persons can be made in terms of a constant unit, and all available information is in the person’s or the item’s total score (Christensen et al., Citation2013). Such features make an outcome measure very useful in clinical practice and research trials (de Vet et al., Citation2011).

The added item analyses by the Rasch model showed that MISA-DK did not satisfy the principles of fundamental measurement (Hansen, Citation2014; Hansen et al., Citation2012b). Accordingly, MISA-DK and MISA have been revised (Canadian Occupational Therapy Association [CAOT], Citation2015; Hansen, Citation2014) and thus require a new large-scale validation process (Christensen et al., Citation2013; de Vet et al., Citation2011). Therefore, the aim of this study was to revisit the psychometric properties of MISA2-DK to determine whether the MISA2-DK total and subscale scores satisfy the principle of fundamental measurements. Since the exact number of dimensions and the factor structure of the original MISA have never been established (Lambert et al., Citation2006), this study also addressed whether the MISA2-DK items should be distributed into alternative subscales.

2. Method

2.1. Design and setting

The study used a cross-sectional design and involved OT departments at four regional hospitals and three community-based rehabilitation (CBR) units in Denmark (ClinicalTrials.gov, identifier No: NCT02328196).

2.2. Participants

Adults ≥18 years referred to an OT for swallowing evaluation because of known or suspected dysphagia of neurogenic or peripheral mechanical origin (Shaker et al., Citation2012) were included. All participants provided written informed consent that their data were used for the study. Participants with severe cognitive impairment, poorly controlled psychosis or known contraindications to ingest any liquids and foods were excluded. The study was approved by the Institutional Review Board at the Metropolitan University College in Copenhagen, Denmark and by the Danish Data Protection Authority (Reg. No: 2014-41-3230). According to Danish legislation, the study did not need approval by the local ethical committee in the Capital Region (Reg. No: H-2-2014-044).

2.3. Measurement

MISA2-DK consists of 36 ingestive skill items distributed into four subscales: positioning for meals (four items); self-feeding skills (seven items); liquid ingestion (11 items) and solid ingestion (14 items). All items are scored on a three-point ordinal scale (1 = absent ingestive skill; 2 = insufficient ingestive skill performance; 3 = adequate ingestive skill performance), which are summated into subscale scores and a total score. This format is a result of the revisions that included clarifications of five items, development of three new items and replication of three items from the solid ingestion subscale to the liquid ingestion subscale. The texture management items from the original version are regarded as single items not to be summarised. Instead, these items reflect the test-meal of four liquid texture categories (thin, mildly thick, moderately thick, and extremely thick) and four main solid texture categories (regular, soft, minced and moist, and pureed). Based on the assigned scores to the 36 ingestive skill items, the OT judges the participant’s ability to efficiently and safely ingestion of the textures (CAOT, Citation2015; Hansen, Citation2014).

2.4. Procedure

MISA2-DK was part of the initial routine assessment procedures for dysphagia at the included sites. Specially trained OTs (Hansen, Madsen, & Sørensen, Citation2016) administered the MISA2-DK to the participants for observation during a test meal at breakfast or lunchtime from September 2014 to June 2015. For inpatient participants, the assessments occurred at the hospital ward. For participants in CBR, the assessment occurred at the CBR unit or in the home environment.

2.5. Data analysis

Descriptive statistics were conducted using IBM Statistics SPSS® version 22 for Windows (SPSS Inc., Chicago, IL) to describe the characteristics of the sample and perform missing values analysis of the score records on the MISA2-DK items. An acceptable percentage of missing values was less than 15% per item (de Vet et al., Citation2011).

The psychometric properties were analysed in three stages. The first stage used item analysis by the Rasch model to test whether the MISA2-DK total scale and the proposed subscales met the model expectations. The second stage used exploratory factor analysis (EFA) to determine the dimensions underpinning MISA2-DK, and the third stage tested whether these new subscales met the assumptions in the Rasch model.

Item analysis by the Rasch model was conducted using RUMM2030 (Andrich, Lyne, Sheridan, & Luo, Citation2010), which integrates a pairwise conditional maximum likelihood algorithm in the estimation of item and person parameters that are placed on the same logit-scale centred by a mean item location of zero. Positive values reflect difficult items and high ability levels, and negative values reflect easy items and low ability levels (Andrich & Luo, Citation2002; Christensen et al., Citation2013). A significant likelihood ratio test (p < 0.001) indicated adoption of the Partial Credit model that allows each item to have a unique rating scale structure (Christensen et al., Citation2013). Because of the specific objectivity property of the Rasch model (Christensen et al., Citation2013), missing values were handled without any imputation.

The analysis followed established procedures (Christensen et al., Citation2013; Hagquist, Bruce, & Gustavsson, Citation2009; Pallant & Tennant, Citation2007) and was initiated as follows:

The overall fit of the scale to the model was investigated by an item-trait interaction using the chi-square (χ2) statistic. A non-significant probability value indicates that there is no substantial deviation from the model and that the scale items function hierarchically from easy to difficult in the same pattern across all ability groups (i.e. are invariant). In addition, the overall person and item fit was addressed by inspecting the mean item and person standardised fit residuals (FR), which should be close to zero with a SD of 1.4 or less (Hagquist et al., Citation2009; Pallant & Tennant, Citation2007). The unidimensionality of the scale (i.e. all items in the rating scale measure a single construct) was assessed using principal component analysis conducted on the residual correlations to identify subsets of items loading positively and negatively on the first principal component. Person estimates from these two subsets of items were compared through a series of independent t-tests. If less than 5% of these tests are significant or if the lower bound of an exact binomial confidence interval is below 5%, the scale can be considered unidimensional (Hagell, Citation2014; Smith, Citation2002). The Person Separation Index (PSI) and Cronbach’s alpha assessed the reliability of the scale, and values of 0.7 and 0.9 indicate sufficient reliability for group and individual use (Fisher, Citation1992).

Next, sources of deviation from model expectations were addressed. Misfitting items and persons were identified by standardised FR values outside the range ±2.5 and a significant χ2 statistic (Hagquist et al., Citation2009; Pallant & Tennant, Citation2007). Misfitting items or persons were removed in a stepwise procedure to obtain an overall fit to the model. The structure of the score categories was investigated by studying the thresholds of each item. Thresholds refer to the point between two adjacent score categories where both scores are equally probable and monotonicity is expected. That is, there is a monotonic relationship between the increasing item score and increasing level of the latent construct (Andrich, de Jong, & Sheridan, Citation1997). Disordered thresholds were resolved by combining adjacent categories (Andrich et al., Citation1997). Local item independency, which means that a person’s score on one item does not depend on another item’s score (Christensen et al., Citation2013), was evaluated using the residual correlation matrix. Residual correlation values > 0.2 above the average residual correlation was taken as an indicator for local item dependency (LID) (Christensen, Makransky, & Horton, Citation2015). LID was adjusted by combining the dependent items to form a larger item, called a testlet (Marais & Andrich, Citation2008). If the pattern of the residual correlations suggests that LID were between clusters of items within subscales, the items were grouped and treated as a testlet (Andrich, Citation2015; Nilsson & Tennant, Citation2011; Reise, Morizot, & Hays, Citation2007). When testlets of subscale domains are added to form a total score, RUMM2030 allows the variance components of a multidimensional construct to be separated into the common variance component (responsible for the correlation of the dimensions) and the dimension-specific variance component (Andrich, Citation2015). The latent estimate (i.e. mealtime performance) was the assessment associated with this common variance of the items in the testlets after having discarded the unique variance related to multidimensionality (i.e. the different aspects covered by each testlet) (Andrich, Citation2015). The absence of a differential item function (DIF) was also addressed. No DIF implies that a particular item’s score does not differ due to exogenous factors for persons with equal ability levels (Christensen et al., Citation2013; Tennant & Pallant, Citation2007). DIF was evaluated for three age groups with similar sample sizes (32–73 years, 74–84 years, 85–108 years), sex (female, male), and setting (hospital or CBR) and was tested by conducting an analysis of variance of the standardised residuals. If there is a consistent systematic difference in the records to an item across the latent construct (i.e. uniform DIF), it was resolved by splitting the item into the number of compared groups (Tennant & Pallant, Citation2007).

For all final solutions, targeting was evaluated. To be acceptable, mean person locations should approximate the mean item threshold location (i.e. 0.0 logits), and item thresholds should cover approximately the same range of the logit scale as the person locations (Hagquist et al., Citation2009; Pallant & Tennant, Citation2007). All analysis was significant at the 5% level with a Bonferroni correction applied.

EFA was based on a polychoric correlation matrix for ordinal data (Holgado-Tello, Chacon-Moscoso, Barbero-Garcia, & Vila-Abad, Citation2010), using the FACTOR software program version 10.3.01 (Lorenzo-Seva & Ferrando, Citation2006). Observations with missing values were excluded. Bartlett’s Test of specificity and the Kaiser–Meyer–Olkin (KMO) measure of sampling adequacy tested suitability of the data for EFA (Beavers et al., Citation2013). The method of factor extraction was unweighted least squares (ULS) followed by oblique (direct oblimin) rotation of factors, which allow factors to be correlated. The scree plot and parallel analysis (PA) with optimal implementation guided the number of retained factors (Baglin, Citation2014). Items with factor loadings <0.35 were not retained. Items with significant loadings ≥0.35 on more than one factor were assigned to the factor with the highest loading value.

Sample size: For item analysis by the Rasch model, a sample size of more than 250 allows for accurate and stable person and item estimates (Chen et al., Citation2014). For EFA, the subject-to-item ratio should to be more than five (Beavers et al., Citation2013).

3. Results

In total, 328 participants were included. Table displays the characteristics of the sample. The number of missing values was <15% for all MISA2-DK items with a mean of 0.75% (range 0–11%).

Table 1. Demographic characteristics (N = 328)

3.1. Stage 1: Analysis by the Rasch mode: the original MISA2-DK total scale and subscales

Table (Analyses 1–9) displays the overall summary fit statistics, and Table shows the item fit statistics of the initial analysis. As seen in Table (Analysis 1), the initial analysis of all 36 MISA2-DK items showed overall model misfit. Three items and 20 participants showed misfit, three items had disordered thresholds, and DIF by setting was present for item 3 (F(df) = 16.71(1), p < 0.001) with inpatient participants more likely to obtain a higher score. LID was identified between items within and between all four subscales (see Table ). The best fitting solution was to use testlets to adjust LID between items within the four subscales, which provided excellent overall fit (Table , Analysis 2). The correlation between the testlets was high (r = 0.80), and 88% of the total variance was found to be common.

Table 2. Summary of the overall fit statics from the analysis of MISA2-DK by the Rasch model

Table 3. Item level fit statistics from the initial analyses of MISA2-DK by the Rasch model

Separate analysis of the four subscales revealed that the positioning subscale was initially consistent with the Rasch model expectations (Table , Analysis 3). For the self-feeding skills subscale, three items showed misfit (item 5: FR = −3.3, χ2(df) = 14.57(3), p = 0.002; item 6: FR = −4.5, χ2(df) = 27.81(3), p < 0.001; and item 7 FR = −3.6, χ2(df) = 19.67(4), p < 0.001). Disordered thresholds were seen for items 5 and 7, and LID was found between items 6 and 7 (r = 0.32). The best fitting solution was removal of the three misfitting items. However, the item-trait interaction remained significant and the reliability became very low (Table , Analyses 4 and 5). For the liquid ingestion subscale, it was necessary to remove two misfitting items (item 19: FR = 4.0, χ2(df) = 14.57(2), p < 0.001 and item 20: FR = 2.5, χ2(df) = 17.40(3), p < 0.001) and to adjust LID between items 12 and 13 (r = 0.43) (Table , Analyses 6 and 7). To improve the overall model fit for the solid ingestion subscale, it was necessary to remove 25 misfitting participants and one misfitting item (item 32: FR = 3.4, χ2(df) = 20.04(3), p < 0.001) and to adjust LID between items 26 and 27 (r = 0.48) (Table , Analyses 8 and 9). No items displayed DIF.

3.2. Stage 2: EFA of 36 MISA2-DK items

The EFA of the 36 MISA2-DK items included 263 participants resulting in a subject-to-item ratio of seven. Bartlett’s test of specificity was highly significant (χ2(df) = 5,073.4 (630), p < 0.0001), and the KMO measure was 0.86, which supports the factorability of the matrix. Both the scree plot and the parallel analysis suggested extraction of five factors, which explained 53% of the total variance. As seen in Table , the Oblimin rotation of the five-latent-factor solution revealed that most of the 36 items loaded significantly on only one factor. Item 20, Drinks with a sequence of sips, failed to load significantly on any factors. The five-factor solution could be interpreted as representing different aspects of ingestion: anticipation (items 1–13 and item 23), bolus formation (item 24 and items 26–28), bolus manipulation (items 14, 25, 33 and 34), bolus propulsion in the oral cavity (items 15, 16, and 29) and the pharyngeal cavity (items 17, 18, 30, and 31), and airway protection (items 19, 21, 22, 35, 32, and 36).

Table 4. Pattern matrix of a five-factor solution from ULS with oblimin rotation (N = 263)

3.3. Stage 3: Analysis by the Rasch model: the EFA-derived MISA2-DK subscales

Table (Analyses 10–21) displays the item analysis by the Rasch model of the EFA-derived structure of MISA2-DK. The anticipation subscale did not fit the model. It was necessary to adjust LID between items 5, 6 and 7 (item 5/6: r = 0.33 and item 6/7: r = 0.41) and between items 12 and 13 (r = 0.44). Furthermore, it was necessary to split two items with DIF by settings that were biased towards inpatient participants (item 2: F(df) = 10.88(1), p = 0.001 and item 3: F(df) = 25.12(1), p < 0.001) (Table , Analyses 10 and 11). The bolus formation subscale showed model misfit. The removal of one misfitting item (item 27: FR = −1.59, χ2(df) = 18.54(2), p < 0.001) increased the overall model fit for items, but decreased the PSI markedly (Table , Analysis 12 and 13). The bolus manipulation subscale showed misfit to the model. Removal of one misfitting item (item 33: FR = −2.88, χ2(df) = 3.55(2), p = 0.169) improved the fit residual SD for items. However, the χ2-statistic was highly significant (p < 0.001), and the reliability was too low (Table , Analyses 14 and 15). Given the misfit of these two subscales, and since their items relate to bolus preparation in the mouth, they were combined. This bolus preparation subscale showed slight deviations from the Rasch model, which improved after adjusting LID between items 26 and 27 (r = 0.36) (Table , Analyses 16 and 17). The bolus propulsion subscale showed adequate fit to the model. This improved after splitting item 29, which displayed DIF by sex that were biased towards the male sex (F(df) = 13.44(1), p < 0.001) (Table , Analyses 18 and 19). The airway protection subscale showed adequate overall fit statistics (Table , Analysis 20). Analysis using testlets of the four subscales anticipation, bolus preparation, bolus propulsion and airway protection resulted in approximately good model fit (Table , Analysis 21). The correlation between the testlets was moderate (r = 0.65) and 81% of the total variance was found to be common.

For all final analyses in stages 1 and 3, targeting was suboptimal (Table ).

Table 5. Targeting of the original and EFA-derived MISA2-DK subscales and total scale

4. Discussion

As a part of the ongoing process of scale validation, this study revisited the psychometric properties of MISA2-DK. The number of missing values was acceptable (de Vet et al., Citation2011), and the item analysis by the Rasch model in stage 1 supported the psychometric properties of a total score using testlets of the original four subscales as previously published (Hansen, Citation2014). For the four subscale sum scores, only the positioning subscale satisfied the model expectations. For the remaining three subscales, it was necessary to remove misfitting items, and it was possible to achieve reasonable fit only for liquid and solid ingestion. None of the new developed items in MISA2-DK showed misfit, and disordered thresholds were present for as few as three of the original items, namely items 5, 6 and 7. This is an improvement compared to Hansen et al. (Citation2012b). However, the category ordering is not operating as intended for these items, which assess the ability to choose, grasp and bring a cup/glass or utensils to the mouth. Therefore, the content of the three items might benefit from revisions (Andrich et al., Citation1997).

The EFA in stage 2 indicated a factor solution that did not match the original four subscales but suggested five factors interpreted as anticipation, bolus formation, bolus manipulation, bolus propulsion, and airway protection. Item 20 (Drinks with a sequence of sips) failed to load on any factors during the EFA, and it might be speculated whether drinking with a sequence of sips is a part of normal mealtime performance. However, since the physiology of multiple swallows is different from a single swallow, it is a clinically important item (SIGN, Citation2010; Shaker et al., Citation2012) but must be regarded as a single item.

The subsequent item analysis by the Rasch model in stage 3 resulted in four subscales—anticipation, bolus preparation, bolus propulsion, and airway protection -, which obtained satisfactory model fit after adjustments of DIF and LID for some items.

The initial analysis by the Rasch model in stage 1 suggested that the main reason for model misfit for the total score was LID between items within subscales. However, LID was also present between items across subscales, which largely reflected the factor solution in the subsequent EFA in stage 2. LID can be caused by response dependency or multidimensionality (Marais & Andrich, Citation2008). Since the items in MISA2-DK reflect a sequence of skills necessary for successful ingestion (Clark et al., Citation2007; Leopold & Kagel, Citation1997), it is likely that response dependency might have caused the observed LID. However, response dependency inflates the reliability of a scale (Marais & Andrich, Citation2008). When the items within the four original subscales as well as within the four EFA-derived subscales were grouped and treated as a testlet to adjust for the LID, the reliability decreased. This might also reflect response dependency (Marais & Andrich, Citation2008). Applying the testlet solution for the original four subscales resulted in a very good fit to the Rasch model and with 88% of the total non-error variance as common. This supports MISA2-DK’s ability to summarise the participants’ ingestive skill ability profiles by a single measure (Andrich, Citation2015). Thus, it is possible to maintain the integrity of the original MISA2-DK scale (Nilsson & Tennant, Citation2011). This solution was in many ways more persuasive than the testlet solution for the EFA-derived subscales, which obtained less satisfactory overall fit to the Rasch model. This suggests that the total score based on the original four subscales might better reflect the common underlying construct of mealtime performance. A total score offers advantages in clinical identification of clients with deficient ingestive skills and may simplify research analysis for example, on the effectiveness of dysphagia rehabilitation. However, it is possible that real gains within one or two subscale domains will be obscured within a large scale assessing overall mealtime performance. Thus, grouping items into subscales will allow detailed information of a dysphagic client’s ingestion profile and detailed evaluation at more specific levels. This is particularly relevant for OTs in dysphagia rehabilitation, which typically addresses several aspects of ingestive skills (Clark et al., Citation2007; SIGN, Citation2010; Shaker et al., Citation2012). Accordingly, this study might suggest that the four subscales related to anticipation, bolus preparation, bolus propulsion, and airway protection, which was derived in stages 2 and 3, should be used rather than the original subscales. In fact, this structure seems to be clinically and conceptual meaningful (Clark et al., Citation2007; Leopold & Kagel, Citation1997).

During the item analysis by the Rasch model in stages 1 and 3, the test for DIF revealed that items 2 and 3, which reflect basic mobility functions (Hansen, Lambert, et al., Citation2011), were biased towards inpatient participants, who were more likely to obtain a higher score than participants in CBR. For the participants in CBR, approximately 84% of the MISA2-DK assessments occurred in the home environment. Therefore, environmental factors, such as inappropriate furniture, might have influenced the sitting positioning. However, this can be only speculative since the data collection did not include detailed records on the mealtime environment. DIF was also found for item 29, which reflects the strength of the tongue, and was biased towards males who were more likely to obtain a higher score than females. The reason for the DIF might be that tongue strength is significantly greater in males compared to females (Adams, Mathisen, Baines, Lazarus, & Callister, Citation2013). Since all DIF was uniform, it was possible to split the items into group-specific items (Tennant & Pallant, Citation2007).

4.1. Methodological considerations

This study has certain limitations. All analyses are based upon a single sample, and it would have been desirable to replicate the analysis by the Rasch model in stage 3 with a validation sample, making the results more robust. However, although the current sample size was sufficiently large (Beavers et al., Citation2013; Chen et al., Citation2014), it did not allow division into two samples. In addition, the sample tended to be at the ceiling of the scale. For the subscales, high percentages of extreme scores were present, and the coverage of items to persons was reduced. This indicates suboptimal targeting, which might have influenced the PSI negatively (Christensen et al., Citation2013). Additionally, confirmatory factor analysis to test the proposed subscale structure could have been used (de Vet et al., Citation2011). This was performed as a part of the analyses for this study using the Bayesian approach for ordinal data. Because of the high number of items and the presence of LID, it was not possible to draw any conclusions. Finally, since a new validation process has not been applied to the Canadian MISA2, the cross-cultural validity of MISA2-DK is not yet fully established. For this, DIF analysis of merged data from both versions is needed (Christensen et al., Citation2013; de Vet et al., Citation2011).

5. Conclusion

Revisiting the psychometric properties of MISA2-DK using item analyses by the Rasch model could support a total score used to provide a unidimensional measure of mealtime performance, thus enabling identification of clients with deficient ingestive skills in clinical practice and research studies. However, given that not all four original MISA2-DK subscales—positioning, self-feeding skills, liquid ingestion and solid ingestion—satisfied the requirements in the Rasch model, they cannot be recommended to obtain detailed information and identification of qualitative aspects of mealtime performance. Rather, it is recommended to use the four EFA-derived subscales: anticipation, bolus preparation, bolus propulsion, and airway protection. However, further testing is required to confirm the new structure.

Author contributions

Tina Hansen, conception and design, translation of the protocol into practice, acquisition of data, statistical analyses, lead authorship by drafting the article, final approval of the version to be published; Dorte Melgaard Kristiansen, translation of the protocol into practice, acquisition of data, interpretation of the data analyses, reading, editing, and final approval of the version to be published.

Funding

The work was supported by the Research Council, Metropolitan University College, Copenhagen under [grant number: 13-4705].

Acknowledgement

Thanks to all participants who took part in the study. Special thanks go to the following OTs for coordinating the data collection at the included sites: Gitte Østergaard Poulsen, Lene Engelbrecht, Miriam Wolf Olsen and Lina S. Fink. In addition, thanks to Senior lecturer Anne-Le Morville, PhD at the Division of Physical and Occupational Therapy, Metropolitan University College in Copenhagen for thorough and critical reading of the manuscript. Finally, special thanks go to Dr Heather Lambert at the School of Physical and Occupational Therapy, McGill University, Montreal, Quebec, Canada for letting us become involved in the further development of MISA.

Additional information

Notes on contributors

Tina Hansen

Dr Tina Hansen is Senior lecturer at the Division of Physical and Occupational Therapy at the Metropolitan University College in Denmark. Dr Hansen’s work is focused on the optimization of activity-based measurement tools for rehabilitation planning as well as motor learning principles and activity-based interventions targeting the swallowing function among adults suffering from dysphagia.

Dorte Melgaard Kristiansen

Dorte Melgaard Kristiansen is a PhD student at the Aalborg University in Denmark, and has an interest in dysphagia and its implications among frail elders with community acquired pneumonia. The current article supports the continuing collaboration between the authors, which aims to improve the application of evidence-based and activity-based principles in dysphagia management.

References

  • Adams, V., Mathisen, B., Baines, S., Lazarus, C., & Callister, R. (2013). A systematic review and meta-analysis of measurements of tongue and hand strength and endurance using the Iowa oral performance instrument (IOPI). Dysphagia, 28, 350–369.10.1007/s00455-013-9451-3
  • Altman, K. W., Yu, G. P., & Schaefer, S. D. (2010). Consequence of dysphagia in the hospitalized patient: Impact on prognosis and hospital resources. Archives of Otolaryngology-Head & Neck Surgery, 136, 784–789.10.1001/archoto.2010.129
  • Andrich, D., de Jong, J. H. A. L., & Sheridan, B. E. (1997). Diagnostic opportunities with the Rasch model for ordered response categories. In J. Rost & R. Langeheine (Eds.), Applications of latent trait and latent class models in the social sciences (pp. 59–70). Münster: Waxmann.
  • Andrich, D., & Luo, G. (2002). Conditional pairwise estimation in the Rasch model for ordered response categories using principal components. Journal of Applied Measurement, 4, 205–221.
  • Andrich, D., Lyne, A., Sheridan, B., & Luo, G. (2010). RUMM2030. Perth: RUMM Laboratory Pty.
  • Andrich, D. (2015). Components of variance of scales with a subscale structure using two calculations of coefficient alpha. (Pensamiento Educativo) Journal of Latin American Educational Research, 52, 6–33.
  • Baglin, J. (2014). Improving your exploratory factor analysis for ordinal data: A demonstration using FACTOR. Practical Assessment, Research & Evaluation, 19(5), 1–14. Retrieved from http://pareonline.net/getvn.asp?v=19&n=5
  • Beavers, A. S., Lounsbury, J. W., Richards, J. K., Huck, S. W., Skolits, G. J., & Esquivel, S. L. (2013). Practical considerations for using exploratory factor analysis in educational research. Practical Assessment, Research & Evaluation, 18(6), 1–13. Retrieved from http://pareonline.net/pdf/v18n6.pdf
  • Canadian Occupational Therapy Association. (2015). The McGill Ingestive Skills Assessment 2: Improvements in bedside dysphagia assessment. Retrieved from https://www.caot.ca/store/SearchResults.aspx?searchterm=MISA+2&searchoption=ALL
  • Chen, W. H., Lenderking, W., Jin, Y., Wyrwich, K. W., Gelhorn, H., & Revicki, D. A. (2014). Is Rasch model analysis applicable in small sample size pilot studies for assessing item characteristics? An example using PROMIS pain behavior item bank data. Quality of Life Research, 23, 485–493.10.1007/s11136-013-0487-5
  • Christensen, K. B., Kreiner, S., & Mesbah, M. (Eds.). (2013). Rasch models in health. Hoboken, NJ: Wiley.
  • Christensen, K. B., Makransky, G., & Horton, M. (2015). Critical Values for Yen’s Q3: Identification of local dependence in the Rasch model using residual correlations [Research report 15/5]. Department of Biostatistics, University of Copenhagen. Retrieved from https://ifsv.sund.ku.dk/biostat/biostat_annualreport/index.php5/Research_reports
  • Cichero, J., & Clavé, P. (2012). Stepping stones to living well with dysphagia. Gerontology, 58, 481–570.
  • Clark, G., Avery-Smith, W., Wold, L., Anthony, P., & Holm, S. (2007). Specialized knowledge and skills in feeding, eating, and swallowing for occupational therapy practice. The American Journal of Occupational Therapy, 61, 686–700.
  • Danish Health Authority. (2015). National clinical guideline for oropharyngeal dysphagia: Screening, assessment and selected initiatives. Retrieved from https://www.sst.dk/en/publications/2015/~/media/3BBE1F1B05354CC3A7080032E7D6F1A8.ashx
  • de Vet, H. C., Terwee, C. B., Mokkink, L. B., & Knol, D. L. (2011). Measurement in medicine. Cambridge: Cambridge University Press.10.1017/CBO9780511996214
  • Fisher, W. P. (1992). Reliability statistics. Rasch measurement transactions, 6, 238.
  • Hagell, P. (2014). Testing rating scale unidimensionality using the principal component analysis (PCA)/t-test protocol with the Rasch model: The primary of theory over statistics. Open Journal of Statistics, 4, 456–465.10.4236/ojs.2014.46044
  • Hagquist, C., Bruce, M., & Gustavsson, J. P. (2009). Using the Rasch model in nursing research: An introduction and illustrative example. International Journal of Nursing Studies, 46, 380–393.10.1016/j.ijnurstu.2008.10.007
  • Hansen, T., Kjaersgaard, A., & Faber, J. (2011). Measuring elderly dysphagic patients’ performance in eating—A review. Disability and Rehabilitation, 33, 1931–1940.10.3109/09638288.2011.553706
  • Hansen, T., Lambert, H. C., & Faber, J. (2011). Content validation of a Danish version of “The McGill Ingestive Skills Assessment” for dysphagia management. Scandinavian Journal of Occupational Therapy, 18, 282–293.10.3109/11038128.2010.521949
  • Hansen, T., Lambert, H. C., & Faber, J. (2012a). Reliability of the Danish version of the McGill Ingestive Skills Assessment for observation-based measures during meals. Scandinavian Journal of Occupational Therapy, 19, 488–496.10.3109/11038128.2012.674552
  • Hansen, T., Lambert, H. C., & Faber, J. (2012b). Validation of the Danish version of the McGill Ingestive Skills Assessment using classical test theory and the Rasch model. Disability and Rehabilitation, 34, 859–868.10.3109/09638288.2011.624249
  • Hansen, T. (2014). Further validation of the Danish version of the McGill Ingestive Skills Assessment (MISA-DK): Rasch analysis. International Journal of Therapy & Rehabilitation, 21, 26–34.
  • Hansen, T., Madsen, E. E., & Sørensen, A. (2016). The effect of rater training on scoring performance and scale-specific expertise amongst occupational therapists participating in a multicentre study: A single-group pre–post-test study. Disability and Rehabilitation, 38, 1216–1226.10.3109/09638288.2015.1076069
  • Holgado-Tello, F. P., Chacon-Moscoso, S., Barbero-Garcia, I., & Vila-Abad, E. (2010). Polychoric versus Pearson correlations in exploratory and confirmatory factor analysis of ordinal variables. Quality & Quantity, 44, 153–166.10.1007/s11135-008-9190-y
  • Klinke, M. E., Wilson, M. E., Hafsteinsdóttir, T. B., & Jónsdóttir, H. (2013). Recognizing new perspectives in eating difficulties following stroke: A concept analysis. Disability and Rehabilitation, 35, 1491–1500.10.3109/09638288.2012.736012
  • Lambert, H. C., Gisel, E. G., Wood-Dauphine, S., Groher, M. E., & Abrahamowicz, M. (2006). McGill Ingestive Skills Assessment: User’s manual. Ottawa: Canadian Association of Occupational Therapists.
  • Leopold, N. A., & Kagel, M. C. (1997). Dysphagia—Ingestion or deglutition?: A proposed paradigm. Dysphagia, 12, 202–206.10.1007/PL00009537
  • Lorenzo-Seva, U., & Ferrando, P. J. (2006). FACTOR: A computer program to fit the exploratory factor analysis model. Behavior Research Methods, 38, 88–91.10.3758/BF03192753
  • Marais, I., & Andrich, D. (2008). Formalizing dimension and response violations of local independence in the unidimensional Rasch model. Journal of Applied Measurement, 9, 200–215.
  • Martino, R., Beaton, D., & Diamant, N. E. (2009). Using different perspectives to generate items for a new scale measuring medical outcomes of dysphagia (MOD). Journal of Clinical Epidemiology, 62, 518–526.10.1016/j.jclinepi.2008.05.007
  • Nilsson, Å. L., & Tennant, A. (2011). Past and present issues in Rasch analysis: The functional independence measure (FIMTM) revisited. Journal of Rehabilitation Medicine, 43, 884–891.10.2340/16501977-0871
  • Pallant, J. F., & Tennant, A. (2007). An introduction to the Rasch measurement model: An example using the Hospital Anxiety and Depression Scale (HADS). British Journal of Clinical Psychology, 46(1), 1–18.10.1348/014466506X96931
  • Reise, S. P., Morizot, J., & Hays, R. D. (2007). The role of the bifactor model in resolving dimensionality issues in health outcomes measures. Quality of Life Research, 16, 19–31.10.1007/s11136-007-9183-7
  • Roden, D. F., & Altman, K. W. (2013). Causes of dysphagia among different age groups. Otolaryngologic Clinics of North America, 46, 965–987.10.1016/j.otc.2013.08.008
  • Scottish Intercollegiate Guidelines Network. (2010). Management of Patients with Stroke: Identification and Management of Dysphagia. A National Clinical Guideline (No.119). Edinburgh. Retrieved from http://www.sign.ac.uk/pdf/sign119.pdf
  • Shaker, R., Easterling, C., Belafsky, P. C., & Postma, G. N. (Eds.). (2012). Manual of diagnostic and therapeutic techniques for disorders of deglutition. New York, NY: Springer Science & Business Media.
  • Smith, E. V. (2002). Detecting and evaluating the impact of multidimensionality using item fit statistics and principal component analysis of residuals. Journal of Applied Measurement, 3, 205–231.
  • Tennant, A., & Conaghan, P. G. (2007). The Rasch measurement model in rheumatology: What is it and why use it? When should it be applied, and what should one look for in a Rasch paper? Arthritis & Rheumatism, 57, 1358–1362.10.1002/(ISSN)1529-0131
  • Tennant, A., & Pallant, J. F. (2007). DIF matters: A practical approach to test if differential item functioning makes a difference. Rasch Measurement Transactions, 20, 1082–1084.