REFERENCES
- Bhola , D. S. , Impara , J. C. and Buckendahl , C. W. 2003 . Aligning tests with states’ content standards: Methods and issues . Educational Measurement: Issues and Practice , 22 ( 3 ) : 21 – 29 .
- Cohen , J. 1960 . A coefficient of agreement for nominal scales . Educational and Psychological Measurement , 20 : 37 – 46 .
- Crocker , L. , Miller , M. D. and Franks , E. A. 1989 . Quantitative methods for assessing the fit between test and curriculum . Applied Measurement in Education , 2 : 179 – 194 .
- Deville , C. 1996 . An empirical link of content and construct validity evidence . Applied Psychological Measurement , 20 : 127 – 139 .
- Hambleton , R. K. 1984 . “ Validating the test scores ” . In A guide to criterion-referenced test construction , Edited by: Berk , R. A. 199 – 230 . Baltimore, MD : The Johns Hopkins University Press .
- Harcourt Assessment . 2004 . Arizona's Instrument to Measure Standards Technical Report, 2004 , Phoenix, AZ : Arizona Department of Education .
- Hu , L. and Bentler , P. M. 1999 . Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives . Structural Equation Modeling , 6 ( 1 ) : 1 – 55 .
- Hubert , L. and Arabie , P. 1985 . Comparing partitions . Journal of Classification , 2 : 193 – 218 .
- Milligan , G. W. and Cooper , M. C. 1985 . An examination of procedures for determining the number of clusters in a data set . Psychometrika , 50 : 159 – 179 .
- Martone , A. and Sireci , S. G. 2009 . Evaluating alignment between curriculum, assessment, and instruction . Review of Educational Research , 79 : 1332 – 1361 .
- Rand , W. M. 1971 . Objective criteria for the evaluation of clustering methods . Journal of the American Statistical Association , 66 : 846 – 850 .
- Schaefer , L. , Raymond , M. and Stamps White , A. 1992 . A comparison of two methods for structuring performance domains . Applied Measurement in Education , 5 : 321 – 335 .
- Sireci , S. 1998a . The construct of content validity . Social Indicators Research , 45 : 83 – 117 .
- Sireci , S. 1998b . Gathering and analyzing content validity data . Educational Assessment , 5 : 299 – 321 .
- Sireci , S. and Geisinger , K. 1992 . Analyzing test content using cluster analysis and multidimensional scaling . Applied Psychological Measurement , 16 : 17 – 31 .
- Sireci , S. and Geisinger , K. 1995 . Using subject-matter experts to assess content representation: An MDS analysis . Applied Psychological Measurement , 19 : 241 – 255 .
- Stalans , L. J. 2001 . “ Multidimensional scaling ” . In Reading and understanding multivariate statistics , Edited by: Grimm , L. G. and Yarnold , P. R. 137 – 168 . Washington, DC : American Psychological Association .
- Trochim , W. 1989 . An introduction to concept mapping for planning and evaluation . Evaluation and Program Planning , 12 : 1 – 16 .
- Warrens , M. J. 2008 . On the equivalence of Cohen's kappa and the Hubert-Arabie adjusted Rand index . Journal of Classification , 25 : 177 – 183 .
- Webb , N. L. 2007 . Issues related to judging the alignment of curriculum standards and assessments . Applied Measurement in Education , 20 : 7 – 25 .