References
- Akyol, Ş. P., Krishna, K., & Wang, J. (2018). Taking PISA seriously, how accurate are low stakes exams (Working Paper No. 24930). http://www.nber.org/papers/w24930
- Bandalos, D. L., & Finney, S. J. (2010). Factor analysis: Exploratory and confirmatory. In G. R. Hancock & R. O. Mueller (Eds.), The reviewer's guide to quantitative methods in the social sciences (pp. 93–114). Routledge.
- Baumert, J., & Demmrich, A. (2001). Test motivation in the assessment of student skills: The effects of incentives on motivation and performance. European Journal of Psychology of Education, 16(3), 441–462. https://doi.org/https://doi.org/10.1007/BF03173192
- Bentler, P. M. (2005). EQS 6 structural equations program manual. Multivariate Software.
- Bollen, K. A. (1989). Structural equations with latent variables. John Wiley & Sons.
- Borgonovi, F., & Biecek, P. (2016). An international comparison of students’ ability to endure fatigue and maintain motivation during a low-stakes test. Learning and Individual Differences, 49, 128–137. https://doi.org/https://doi.org/10.1016/j.lindif.2016.06.001
- Bradbury-Jones, C., Taylor, J., & Herber, O. R. (2014). Vignette development and administration: A framework for protecting research participants. International Journal of Social Research Methodology, 17(4), 427–440. https://doi.org/https://doi.org/10.1080/13645579.2012.750833
- Brown, G. T. L. (2008). Students’ conceptions of assessment inventory (SCoA Version VI) [Measurement instrument]. University of Auckland. https://doi.org/https://doi.org/10.17608/k6.auckland.4596820.v1
- Brown, G. T. L. (2011). Self-regulation of assessment beliefs and attitudes: A review of the students’ conceptions of assessment inventory. Educational Psychology, 31(6), 731–748. https://doi.org/https://doi.org/10.1080/01443410.2011.599836
- Brown, G. T. L. (2013). Student conceptions of assessment across cultural and contextual differences. In G. A. D. Liem & A. B. I. Bernardo (Eds.), Advancing cross-cultural perspectives on educational psychology (pp. 143–167). Information Age Publishing.
- Brown, G. T. L., Harris, L. R., O’Quin, C., & Lane, K. E. (2017). Using multi-group confirmatory factor analysis to evaluate cross-cultural research: Identifying and understanding non-invariance. International Journal of Research & Method in Education, 40(1), 66–90. https://doi.org/https://doi.org/10.1080/1743727X.2015.1070823
- Brown, G. T. L., & Hirschfeld, G. H. F. (2008). Students’ conceptions of assessment: Links to outcomes. Assessment in Education: Principles, Policy & Practice, 15(1), 3–17. https://doi.org/https://doi.org/10.1080/09695940701876003
- Brown, G. T. L., Hui, S. K. F., Yu, F. W. M., & Kennedy, K. J. (2011). Teachers’ conceptions of assessment in Chinese contexts: A tripartite model of accountability, improvement, and irrelevance. International Journal of Educational Research, 50(5–6), 307–320. https://doi.org/https://doi.org/10.1016/j.ijer.2011.10.003
- Brown, G. T. L., Irving, S. E., Peterson, E. R., & Hirschfeld, G. H. F. (2009). Use of interactive–informal assessment practices: New Zealand secondary students’ conceptions of assessment. Learning and Instruction, 19(2), 97–111. https://doi.org/https://doi.org/10.1016/j.learninstruc.2008.02.003
- Brown, G. T. L., Peterson, E. R., & Irving, S. E. (2009). Beliefs that make a difference: Adaptive and maladaptive self-regulation in students’ conceptions of assessment. In D. M. McInerney, G. T. L. Brown, & G. A. D. Liem (Eds.), Student perspectives on assessment: What students can tell us about assessment for learning (pp. 159–186). Information Age Publishing.
- Brown, G. T. L., Peterson, E. R., & Yao, E. S. (2016). Student conceptions of feedback: Impact on self-regulation, self-efficacy, and academic achievement. British Journal of Educational Psychology, 86(4), 606–629. https://doi.org/https://doi.org/10.1111/bjep.12126
- Brown, G. T. L., & Walton, K. F. (2017). The effect of conceptions of assessment upon reading achievement: An evaluation of the influence of self-efficacy and interest. Interdisciplinary Education and Psychology, 1(1): 3. https://doi.org/https://doi.org/10.31532/InterdiscipEducPsychol.1.1.003
- Burnham, K. P., & Anderson, D. R. (2004). Multimodel inference: Understanding AIC and BIC in model selection. Sociological Methods & Research, 33(2), 261–304. https://doi.org/https://doi.org/10.1177/0049124104268644
- Byrne, B. M. (2013). Structural equation modelling with AMOS: Basic concepts, applications, and programming (2nd ed.). Routledge.
- Chen, J., & Brown, G. T. L. (2018). Chinese secondary school students’ conceptions of assessment and achievement emotions: Endorsed purposes lead to positive and negative feelings. Asia Pacific Journal of Education, 38(1), 91–109. https://doi.org/https://doi.org/10.1080/02188791.2018.1423951
- Cheung, G. W., & Rensvold, R. B. (2002). Evaluating goodness-of-fit indexes for testing measurement invariance. Structural Equation Modeling: A Multidisciplinary Journal, 9(2), 233–255. https://doi.org/https://doi.org/10.1207/S15328007SEM0902_5
- Cheung, T. K. Y. (2008). An assessment blueprint in curriculum reform. Journal of Quality School Education, 5, 23–37.
- Cronbach, L. J. (1960). Essentials of psychological testing (2nd ed.). Harper & Row.
- Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood estimation from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological), 39(1), 1–22. https://doi.org/https://doi.org/10.1111/j.2517-6161.1977.tb01600.x
- Deneen, C. C., Brown, G. T. L., Bond, T. G., & Shroff, R. H. (2013). Understanding outcome-based education changes in teacher education: Evaluation of a new instrument with preliminary findings. Asia-Pacific Journal of Teacher Education, 41(4), 441–456. https://doi.org/https://doi.org/10.1080/1359866X.2013.787392
- Dorans, N. J. (2012). The contestant perspective on taking tests: Emanations from the statue within. Educational Measurement: Issues and Practice, 31(4), 20–37. https://doi.org/https://doi.org/10.1111/j.1745-3992.2012.00250.x
- Dorgan, M. (2000, July 9). In China, examinations determine fate. Chicago Tribune. https://www.chicagotribune.com/news/ct-xpm-2000-07-09-0007090206-story.html
- Duckworth, A. L., Quinn, P. D., Lynam, D. R., Loeber, R., & Stouthamer-Loeber, M. (2011). Role of test motivation in intelligence testing. Proceedings of the National Academy of Sciences, 108(19), 7716–7720. https://doi.org/https://doi.org/10.1073/pnas.1018601108
- Eccles, J. S., & Wigfield, A. (2002). Motivational beliefs, values, and goals. Annual Review of Psychology, 53, 109–132. https://doi.org/https://doi.org/10.1146/annurev.psych.53.100901.135153
- Eklöf, H. (2007). Test-taking motivation and mathematics performance in TIMSS 2003. International Journal of Testing, 7(3), 311–326. https://doi.org/https://doi.org/10.1080/15305050701438074
- Eklöf, H. (2008). Test-taking motivation on low-stakes tests: A Swedish TIMSS 2003 example. In M. von Davier & D. Hastedt (Eds.), IERI monograph series: Vol. 1. Issues and methodologies in large-scale assessments (pp. 9–21). IEA-ETS Research Institute.
- Eklöf, H. (2010). Skill and will: Test-taking motivation and assessment quality. Assessment in Education: Principles, Policy and Practice, 17(4), 345–356. https://doi.org/https://doi.org/10.1080/0969594X.2010.516569
- Eklöf, H., & Knekta, E. (2017). Using large-scale educational data to test motivation theories: A synthesis of findings from Swedish studies on test-taking motivation. International Journal of Quantitative Research in Education, 4(1–2), 52–71. https://doi.org/https://doi.org/10.1504/IJQRE.2017.086499
- Eklöf, H., & Nyroos, M. (2013). Pupil perceptions of national tests in science: Perceived importance, invested effort, and test anxiety. European Journal of Psychology of Education, 28(2), 497–510. https://doi.org/https://doi.org/10.1007/s10212-012-0125-6
- Eklöf, H., Pavešič, B. J., & Grønmo, L. S. (2014). A cross-national comparison of reported effort and mathematics performance in TIMSS advanced. Applied Measurement in Education, 27(1), 31–45. https://doi.org/https://doi.org/10.1080/08957347.2013.853070
- Fan, X., & Sivo, S. A. (2007). Sensitivity of fit indices to model misspecification and model types. Multivariate Behavioral Research, 42(3), 509–529. https://doi.org/https://doi.org/10.1080/00273170701382864
- Finney, S. J., & DiStefano, C. (2013). Non-normal and categorical data in structural equation modeling. In G. R. Hancock & R. O. Mueller (Eds.), Structural equation modeling: A second course (pp. 439–492). Information Age Publishing.
- Finney, S. J., Mathers, C. E., & Myers, A. J. (2016). Investigating the dimensionality of examinee motivation across instruction conditions in low-stakes testing contexts. Research & Practice in Assessment, 11, 5–17.
- Finney, S. J., Myers, A. J., & Mathers, C. E. (2018). Test instructions do not moderate the indirect effect of perceived test importance on test performance in low-stakes testing contexts. International Journal of Testing, 18(4), 297–322. https://doi.org/https://doi.org/10.1080/15305058.2017.1396466
- Flores, M. A., Brown, G., Pereira, D., Coutinho, C., Santos, P., & Pinheiro, C. (2019). Portuguese university students’ conceptions of assessment: Taking responsibility for achievement. Higher Education, 79(3), 377–394. https://doi.org/https://doi.org/10.1007/s10734-019-00415-2
- Gao, L., & Watkins, D. A. (2002). Conceptions of teaching held by school science teachers in P.R. China: Identification and cross-cultural comparisons. International Journal of Science Education, 24(1), 61–79. https://doi.org/https://doi.org/10.1080/09500690110066926
- Gao, L., & Watkins, D. (2001). Identifying and assessing the conceptions of teaching of secondary school physics teachers in China. British Journal of Educational Psychology, 71(3), 443–469. https://doi.org/https://doi.org/10.1348/000709901158613
- Gao, M. (2010). Student perspectives of school-based assessment [Unpublished doctoral dissertation]. University of Hong Kong.
- Gneezy, U., List, J. A., Livingston, J. A., Qin, X., Sadoff, S., & Xu, Y. (2019). Measuring success in education: The role of effort on the test itself. American Economic Review: Insights, 1(3), 291–308. https://doi.org/https://doi.org/10.1257/aeri.20180633
- Hancock, G. R. (2001). Effect size, power, and sample size determination for structured means modeling and mimic approaches to between-groups hypothesis testing of means on a single latent construct. Psychometrika, 66(3), 373–388. https://doi.org/https://doi.org/10.1007/BF02294440
- Hawthorne, K. A., Bol, L., Pribesh, S., & Suh, Y. (2015). Effects of motivational prompts on motivation, effort, and performance on a low-stakes standardized test. Research Practice in Assessment, 10(1), 30–38.
- Hirschfeld, G. H. F., & Brown, G. T. L. (2009). Students’ conceptions of assessment: Factorial and structural invariance of the SCoA across sex, age, and ethnicity. European Journal of Psychological Assessment, 25(1), 30–38. https://doi.org/https://doi.org/10.1027/1015-5759.25.1.30
- Hong, S., Malik, M. L., & Lee, M.-K. (2003). Testing configural, metric, scalar, and latent mean invariance across genders in sociotropy and autonomy using a non-western sample. Educational and Psychological Measurement, 63(4), 636–654. https://doi.org/https://doi.org/10.1177/0013164403251332
- Hopfenbeck, T. N., & Kjærnsli, M. (2016). Students’ test motivation in PISA: The case of Norway. The Curriculum Journal, 27(3), 406–422. https://doi.org/https://doi.org/10.1080/09585176.2016.1156004
- Hu, L., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling: A Multidisciplinary Journal, 6(1), 1–55. https://doi.org/https://doi.org/10.1080/10705519909540118
- Hughes, R., & Huby, M. (2004). The construction and interpretation of vignettes in social research. Social Work and Social Sciences Review, 11(1), 36–51. https://doi.org/https://doi.org/10.1921/17466105.11.1.36
- Jenkins, N., Bloor, M., Fischer, J., Berney, L., & Neale, J. (2010). Putting it in context: The use of vignettes in qualitative interviewing. Qualitative Research, 10(2), 175–198. https://doi.org/https://doi.org/10.1177/1468794109356737
- Jin, D., & Nida, E. A. (2006). On translation: An expanded edition. City University of Hong Kong Press.
- Kline, R. B. (2016). Methodology in the social sciences. Principles and practice of structural equation modeling (4th ed.). The Guilford Press.
- Knekta, E. (2017). Are all pupils equally motivated to do their best on all tests? Differences in reported test-taking motivation within and between tests with different stakes. Scandinavian Journal of Educational Research, 61(1), 95–111. https://doi.org/https://doi.org/10.1080/00313831.2015.1119723
- Knekta, E., & Eklöf, H. (2015). Modeling the test-taking motivation construct through investigation of psychometric properties of an expectancy-value-based questionnaire. Journal of Psychoeducational Assessment, 33(7), 662–673. https://doi.org/https://doi.org/10.1177/0734282914551956
- Lam, T. C. M., & Klockars, A. J. (1982). Anchor point effects on the equivalence of questionnaire items. Journal of Educational Measurement, 19(4), 317–322. https://doi.org/https://doi.org/10.1111/j.1745-3984.1982.tb00137.x
- Lau, A. R., Swerdzewski, P. J., Jones, A. T., Anderson, R. D., & Markle, R. E. (2009). Proctors matter: Strategies for increasing examinee effort on general education program assessments. The Journal of General Education, 58(3), 196–217. https://doi.org/https://doi.org/10.1353/jge.0.0045
- Li, J. (2002). A cultural model of learning: Chinese “heart and mind for wanting to learn”. Journal of Cross-Cultural Psychology, 33(3), 248–269. https://doi.org/https://doi.org/10.1177/0022022102033003003
- Liu, O. L., Bridgeman, B., & Adler, R. M. (2012). Measuring learning outcomes in higher education: Motivation matters. Educational Researcher, 41(9), 352–362. https://doi.org/https://doi.org/10.3102/0013189X12459679
- Liu, O. L., Rios, J. A., & Borden, V. (2015). The effects of motivational instruction on college students’ performance on low-stakes assessment. Educational Assessment, 20(2), 79–94. https://doi.org/https://doi.org/10.1080/10627197.2015.1028618
- MacCallum, R. C., Roznowski, M., & Necowitz, L. B. (1992). Model modifications in covariance structure analysis: The problem of capitalization on chance. Psychological Bulletin, 111(3), 490–504. https://doi.org/https://doi.org/10.1037/0033-2909.111.3.490
- Marsh, H. W., Hau, K.-T., & Wen, Z. (2004). In search of golden rules: Comment on hypothesis-testing approaches to setting cutoff values for fit indexes and dangers in overgeneralizing Hu and Bentler’s (1999) findings. Structural Equation Modeling, 11(3), 320–341. https://doi.org/https://doi.org/10.1207/s15328007sem1103_2
- Matos, D. A. S., & Brown, G. T. L. (2015). Comparing university student conceptions of assessment: Brazilian and New Zealand beliefs. In C. Carvalho & J. Conboy (Eds.), Feedback, identidade, trajetórias escolares: Dinâmicas e consequências (pp. 177–194). Universidade de Lisboa, Instituto de Educação.
- Matos, D. A. S., Brown, G. T. L., & Gomes, C. (2019). Bifactor invariance analysis of Student Conceptions of Assessment Inventory. Psico-USF, 24(4), 737–750. https://doi.org/https://doi.org/10.1590/1413-82712019240411
- Myers, A. J., & Finney, S. J. (2019). Does it matter if examinee motivation is measured before or after a low-stakes test? A moderated mediation analysis. Educational Assessment, 26(1), 1–19. https://doi.org/https://doi.org/10.1080/10627197.2019.1645591
- Myers, A. J., & Finney, S. J. (2021). Change in self-reported motivation before to after test completion: Relation with performance. The Journal of Experimental Education, 89(1), 74–94. https://doi.org/https://doi.org/10.1080/00220973.2019.1680942
- Organisation for Economic Co-operation and Development. (2011). Strong performers and successful reformers in education: Lessons from PISA for the United States. https://doi.org/https://doi.org/10.1787/2220363x
- Organisation for Economic Co-operation and Development. (2014). PISA 2012 technical report. https://www.oecd.org/pisa/pisaproducts/PISA-2012-technical-report-final.pdf
- Paddam, A., Barnes, D., & Langdon, D. (2010). Constructing vignettes to investigate anger in multiple sclerosis. Nurse Researcher, 17(2), 60–73. https://doi.org/https://doi.org/10.7748/nr2010.01.17.2.60.c7463
- Pekrun, R., Goetz, T., Titz, W., & Perry, R. P. (2002). Academic emotions in students’ self-regulated learning and achievement: A program of qualitative and quantitative research. Educational Psychologist, 37(2), 91–105. https://doi.org/https://doi.org/10.1207/S15326985EP3702_4
- Penk, C., & Richter, D. (2017). Change in test-taking motivation and its relationship to test performance in low-stakes assessments. Educational Assessment, Evaluation and Accountability, 29(1), 55–79. https://doi.org/https://doi.org/10.1007/s11092-016-9248-7
- Penk, C., & Schipolowski, S. (2015). Is it all about value? Bringing back the expectancy component to the assessment of test-taking motivation. Learning and Individual Differences, 42, 27–35. https://doi.org/https://doi.org/10.1016/j.lindif.2015.08.002
- Peterson, E. R., Brown, G. T. L., & Hamilton, R. J. (2013). Cultural differences in tertiary students’ conceptions of learning as a duty and student achievement. International Journal of Quantitative Research in Education, 1(2), 167–181. https://doi.org/https://doi.org/10.1504/IJQRE.2013.056462
- Putwain, D. (2008). Do examinations stakes moderate the test anxiety–examination performance relationship? Educational Psychology, 28(2), 109–118. https://doi.org/https://doi.org/10.1080/01443410701452264
- Rosseel, Y. (2012). lavaan : An R package for structural equation modeling. Journal of Statistical Software, 48(2), 1–36. https://doi.org/https://doi.org/10.18637/jss.v048.i02
- Rutkowski, D., & Wild, J. (2015). Stakes matter: Student motivation and the validity of student assessments for teacher evaluation. Educational Assessment, 20(3), 165–179. https://doi.org/https://doi.org/10.1080/10627197.2015.1059273
- Satorra, A., & Bentler, P. M. (2001). A scaled difference chi-square test statistic for moment structure analysis. Psychometrika, 66(4), 507–514. https://doi.org/https://doi.org/10.1007/BF02296192
- Shanghai Government. (2013). Shanghai statistical yearbook 2013. https://tjj.sh.gov.cn/tjnj/zgsh/tjnj2013en.html
- Silm, G., Must, O., & Täht, K. (2013). Test-taking effort as a predictor of performance in low-stakes tests. TRAMES: A Journal of the Humanities and Social Sciences, 17(4), 433–488. https://doi.org/https://doi.org/10.3176/tr.2013.4.08
- Skilling, K., & Stylianides, G. J. (2020). Using vignettes in educational research: A framework for vignette construction. International Journal of Research & Method in Education, 43(5), 541–556. https://doi.org/https://doi.org/10.1080/1743727X.2019.1704243
- Smith, L. F., & Smith, J. K. (2004). The influence of test consequence on national examinations. North American Journal of Psychology, 6(1), 13–25. http://scholar.google.com/scholar?hl=en&btnG=Search&q=intitle:The+Influence+of+Test+Consequence+on+National+Examinations#0
- Teo, T., & Kam, C. (2014). A measurement invariance analysis of the General Self-Efficacy Scale on two different cultures. Journal of Psychoeducational Assessment, 32(8), 762–767. https://doi.org/https://doi.org/10.1177/0734282914531707
- Thelk, A. D., Sundre, D. L., Horst, S. J., & Finney, S. J. (2009). Motivation matters: Using the Student Opinion Scale to make valid inferences about student performance. The Journal of General Education, 58(3), 129–151. https://doi.org/https://doi.org/10.1353/jge.0.0047
- Torres, S. (2009). Vignette methodology and culture-relevance: Lessons learned through a project on successful aging with Iranian immigrants to Sweden. Journal of Cross-Cultural Gerontology, 24, 93–114. https://doi.org/https://doi.org/10.1007/s10823-009-9095-9
- Tran, T. V., Nguyen, T., & Chan, K. (2017). Adopting or adapting existing instruments. In T. V. Tran, T. H. Nguyen, & K. T. Chan (Eds.), Developing cross-cultural measurement in social work research and evaluation (2nd ed.). Oxford University Press. https://doi.org/https://doi.org/10.1093/acprof:oso/9780190496470.003.0003
- Vandenberg, R. J., & Lance, C. E. (2000). A review and synthesis of the measurement invariance literature: Suggestions, practices, and recommendations for organizational research. Organizational Research Methods, 3(1), 4–70. https://doi.org/https://doi.org/10.1177/109442810031002
- Wang, Z., & Brown, G. T. L. (2014). Hong Kong tertiary students’ conceptions of assessment of academic ability. Higher Education Research and Development, 33(5), 1063–1077. https://doi.org/https://doi.org/10.1080/07294360.2014.890565
- Weekers, A. M., Brown, G. T. L., & Veldkamp, B. P. (2009). Analyzing the dimensionality of the Students’ Conceptions of Assessment (ScoA) inventory. In D. M. McInerney, G. T. L. Brown, & G. A. D. Liem (Eds.), Student perspectives on assessment: What students can tell us about assessment for learning (pp. 133–157). Information Age Publishing.
- Wise, S. L. (2015). Effort analysis: Individual score validation of achievement test data. Applied Measurement in Education, 28(3), 237–252. https://doi.org/https://doi.org/10.1080/08957347.2015.1042155
- Wise, S. L., & Cotten, M. R. (2009). Test-taking effort and score validity: The influence of student conceptions of assessment. In D. M. McInerney, G. T. L. Brown, & G. A. D. Liem (Eds.), Student perspectives on assessment: What students can tell us about assessment for learning (pp. 187–205). Information Age Publishing.
- Wise, S. L., & DeMars, C. E. (2005). Low examinee effort in low-stakes assessment: Problems and potential solutions. Educational Assessment, 10(1), 1–17. https://doi.org/https://doi.org/10.1207/s15326977ea1001_1
- Wise, S. L., & DeMars, C. E. (2010). Examinee noneffort and the validity of program assessment results. Educational Assessment, 15(1), 27–41. https://doi.org/https://doi.org/10.1080/10627191003673216
- Wise, S. L., Soland, J., & Bo, Y. (2019). The (non)impact of differential test taker engagement on aggregated scores. International Journal of Testing, 20(1), 57–77. https://doi.org/https://doi.org/10.1080/15305058.2019.1605999
- Wolf, L. F., & Smith, J. K. (1995). The consequence of consequence: Motivation, anxiety, and test performance. Applied Measurement in Education, 8(3), 227–242. https://doi.org/https://doi.org/10.1207/s15324818ame0803_3
- Zamarro, G., Hitt, C., & Mendez, I. (2019). When students don’t care: Reexamining international differences in achievement and student effort. Journal of Human Capital, 13(4), 519–552. https://doi.org/https://doi.org/10.1086/705799
- Zhao, Y. (2014). Who’s afraid of the big bad dragon: Why China has the best (and worst) education system in the world. Jossey-Bass.
- Zhou, L. (2017). Gao zhong sheng shuxue cuotiji jianli yu shiyong de tansuo [The exploration of establishment and use in mathematical errors of senior high school students] [Master’s thesis, Hunan Institute of Science and Technology]. CNKI. http://gb.oversea.cnki.net/KCMS/detail/detail.aspx?filename=1017257688.nh&dbcode=CMFD&dbname=CMFDTEMP
- Zilberberg, A., Finney, S. J., Marsh, K. R., & Anderson, R. D. (2014). The role of students’ attitudes and test-taking motivation on the validity of college institutional accountability tests: A path analytic model. International Journal of Testing, 14(4), 360–384. https://doi.org/https://doi.org/10.1080/15305058.2014.928301