Search in:

Language Assessment Quarterly Volume 2, 2005 - Issue 3

Submit an article Journal homepage

1,206

Views

CrossRef citations to date

Altmetric

Original Articles

Examining Rater Effects in TestDaF Writing and Speaking Performance Assessments: A Many-Facet Rasch Analysis

Thomas Eckes

Pages 197-221 | Published online: 16 Nov 2009

Cite this article
https://doi.org/10.1207/s15434311laq0203_2

Sample our Education journals, sign in here to start your access, latest two full volumes FREE to you for 14 days

Citations
Metrics
Reprints & Permissions
Read this article /doi/epdf/10.1207/s15434311laq0203_2?needAccess=true

Abstract

I studied rater effects in the writing and speaking sections of the Test of German as a Foreign Language (TestDaF). Building on the many-facet Rasch measurement methodology, the focus was on rater main effects as well as 2- and 3-way interactions between raters and the other facets involved, that is, examinees, rating criteria (in the writing section), and tasks (in the speaking section). Another goal was to investigate differential rater functioning related to examinee gender. Results showed that raters (a) differed strongly in the severity with which they rated examinees; (b) were fairly consistent in their overall ratings; (c) were substantially less consistent in relation to rating criteria (or speaking tasks, respectively) than in relation to examinees; and (d) as a group, were not subject to gender bias. These findings have implications for controlling and assuring the psychometric quality of the TestDaF rater-mediated assessment system.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related Research Data

The Contribution of Large‐Scale Assessment Programmes to Research on Gender Differences

Source: Informa UK Limited

Exploring bias analysis as a tool for improving rater consistency in assessing oral interaction

Source: SAGE Publications

Research Design & Statistical Analysis

Source: Psychology Press

Applying the Rasch Model

Source: Routledge

Using FACETS to model rater training effects

Source: SAGE Publications

Dimensionality and construct validity of language tests

Source: SAGE Publications

Criteria and Bias in Native English Teachers’ Assessment of L2 Pragmatic Appropriacy: Content and FACETS Analyses

Source: Springer Science and Business Media LLC

Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit.

Source: American Psychological Association (APA)

Investigating Variability in Tasks and Rater Judgements in a Performance Test of Foreign Language Speaking.

Source: SAGE Publications

18 – Generalizability Theory

Source: Elsevier

A Bayesian hierarchical latent trait model for estimating rater bias and reliability in large-scale performance assessment.

Source: Public Library of Science (PLoS)

Rater characteristics and rater bias: implications for training

Source: SAGE Publications

Investigating rater/prompt interactions in writing assessment: Quantitative and qualitative approaches

Source: Elsevier BV

Developing and validating a methodology for crowdsourcing L2 speech ratings in Amazon Mechanical Turk

Source: John Benjamins Publishing Company

Examining Rater Errors in the Assessment of Written Composition With a Many-Faceted Rasch Model

Source: Wiley

Comparing holistic and analytic scoring methods: issues of validity and reliability

Source: Informa UK Limited

Using G-theory and Many-facet Rasch measurement in the development of performance assessments of the ESL speaking skills of immigrants:

Source: SAGE Publications

A Qualitative Analysis of Rater Behavior on an L2 Speaking Assessment

Source: Informa UK Limited

Analyzing rater severity in a freshman composition course using many facet Rasch measurement

Source: Springer Science and Business Media LLC

Rater Effects on Essay Scoring

Source: Wiley

New Facets Model for Rater's Centrality/Extremity

Source: Wiley

Language Testing and Validation

Source: Palgrave Macmillan UK

Rater bias in psychological research: when is it a problem and what can we do about it?

Source: American Psychological Association (APA)

The Development of a Common Framework Scale of Language Proficiency

Source: Peter Lang US

Generalizability Theory

Source: Springer New York

A FACETS analysis of rater bias in measuring Japanese second language writing performance

Source: SAGE Publications

Sex Differences in Cognitive Abilities

Source: Psychology Press

Can subject matter experts rate the English language skills of customer services representatives (CSRs) at work in Indian contact centre

Source: Informa UK Limited

Evaluation von Beurteilungen

Source: Hogrefe Publishing Group

Progress and problems in reforming public language examinations in Europe: cameos from the Baltic States, Greece, Hungary, Poland, Slovenia, France and Germany

Source: SAGE Publications

Statistical Analyses for Language Assessment

Source: Cambridge University Press

A closer look at the construct validity of C-tests:

Source: SAGE Publications

A Generalized Partial Credit Model: Application of an EM Algorithm

Source: Wiley

A Generalized Partial Credit Model: Application of an EM Algorithm

Source: Wiley

Are People Prejudiced Against Women? Some Answers From Research on Attitudes, Gender Stereotypes, and Judgments of Competence

Source: Informa UK Limited

Validation of rating processes within an argument-based framework:

Source: SAGE Publications

Beurteilerübereinstimmung und Beurteilerstrenge

Source: Hogrefe Publishing Group

The Consistency Between Raters Scoring in Different Test Years

Source: Informa UK Limited

Linking provided by

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Examining Rater Effects in TestDaF Writing and Speaking Performance Assessments: A Many-Facet Rasch Analysis

Related Research Data

Information for

Open access

Opportunities

Help and information

Examining Rater Effects in TestDaF Writing and Speaking Performance Assessments: A Many-Facet Rasch Analysis

Abstract

Reprints and Corporate Permissions

Academic Permissions

Related Research Data

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature