A simulation study of rater agreement measures with 2x2 contingency tables

Authors

  • Manuel Ato Universidad de Murcia (Spain)
  • Juan José López Universidad de Murcia (Spain)
  • Ana Benavente Universidad de Murcia (Spain)

Abstract

A comparison between six rater agreement measures obtained using three different approaches was achieved by means of a simulation study. Rater coefficients suggested by Bennet’s (1954), Scott’s (1955), Cohen’s (1960) and Gwet’s (2008) were selected to represent the classical, descriptive approach, agreement parameter from Aickin (1990) to represent loglinear and mixture model approaches and measure from Martín and Femia (2004) to represent multiple-choice test. Main results confirm that and descriptive measures present high levels of mean bias in presence of extreme values of prevalence and rater bias but small to null levels with moderate values. The best behavior was observed with Bennet and Martín and Femia agreement measures for all levels of prevalence.

Downloads

Published

2011-06-16

Issue

Section

Methodology Section