5,744
Views
53
CrossRef citations to date
0
Altmetric
Articles

Using Item Response Theory to Develop a 60-Item Representation of the NEO PI–R Using the International Personality Item Pool: Development of the IPIP–NEO–60

, , , , &
Pages 4-15 | Received 19 Apr 2017, Published online: 31 Oct 2017
 

ABSTRACT

Given advantages of freely available and modifiable measures, an increase in the use of measures developed from the International Personality Item Pool (IPIP), including the 300-item representation of the Revised NEO Personality Inventory (NEO PI–R; Costa & McCrae, Citation1992a) has occurred. The focus of this study was to use item response theory to develop a 60-item, IPIP-based measure of the Five-Factor Model (FFM) that provides equal representation of the FFM facets and to test the reliability and convergent and criterion validity of this measure compared to the NEO Five Factor Inventory (NEO-FFI). In an undergraduate sample (n = 359), scores from the NEO-FFI and IPIP–NEO–60 demonstrated good reliability and convergent validity with the NEO PI–R and IPIP–NEO–300. Additionally, across criterion variables in the undergraduate sample as well as a community-based sample (n = 757), the NEO-FFI and IPIP–NEO–60 demonstrated similar nomological networks across a wide range of external variables (rICC = .96). Finally, as expected, in an MTurk sample the IPIP–NEO–60 demonstrated advantages over the Big Five Inventory–2 (Soto & John, Citation 2017; n = 342) with regard to the Agreeableness domain content. The results suggest strong reliability and validity of the IPIP–NEO–60 scores.

Notes

1 The marginal reliability coefficient is calculated by transforming values of test information at a given level of θ, or Iθ, given by the formula ρθ=1 – (1/Iθ2). This coefficient is consistent with the classical definition of reliability but refers to the reliability of IRT-estimated θ (see Raju, Price, Oshima, & Nering, Citation2006). With regard to fit indexes other than root mean square error of approximation (RMSEA), we note that all null models (i.e., scales if we constrain all of the a parameters to 0) were all between .08 and .11. It is suggested that when the null model has an RMSEA of less than .158, a measure of Tucker–Lewis Index (TLI) or comparative fit index (CFI) is not informative, as the value will always be less than .90 and thus never appear acceptable, even though it might be (e.g., White et al., Citation2015). Consistent with this, TLI for the IPIP–NEO–60 ranged from .60 to .77, and CFI ranged from .60 to .77, which are below limits typically noted as acceptable, but likely due to mathematical requirements regarding our null model.

2 A series of regression analyses using the minifacets suggests that the IPIP–NEO–60 agreeableness statistical predictive advantage relative to BFI–2 was due to content from the facets of immodesty and low straightforwardness. Conversely, the BFI–2's advantage in relation to reactive aggression and Factor 2 psychopathy traits was primarily due to its low trust facet.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 344.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.