Agreement between
Categorical Measurements
Kappa Statistics: an index to correct chance agreement.
Hypothetical Example: There are two doctors to diagnosis the same 29 patients.
| |
Doctor A |
|
No |
Yes |
Total |
| Doctor B |
No |
10 (34.5%) |
7 (24.1%) |
17 (58.6%) |
Yes |
0 (0.0%) |
12 (41.4%) |
12 (41.4%) |
Total |
10 (34.5%) |
19 (65.5%) |
29 |
Definition:
Kappa = (Observed agreement - Chance agreement)/(Maximum possible agreement - Chance
agreement)
Agreement = (10 + 12)/29 = 0.76 (observed agreement)
Chance agreement = 0.586 * 0.345 + 0.655 * 0.414 = 0.474
Kappa = (0.76 - 0.474)/(1 - 0.474) = 0.54
What is good agreement? (from Landis and Koch, 1977)
A rough guide is provided, but this should not be taken too seriously.
| Kappa |
Strength of agreement |
| 0.00 |
Poor |
| 0.01-0.20 |
Slight |
| 0.21-0.40 |
Fair |
| 0.41-0.60 |
Moderate |
| 0.61-0.80 |
Substantial |
| 0.81-1.00 |
Almost perfect |
The above Kappa of 0.54 is estimated from a fairly small sample of patients. What is
the standard error of the kappa coefficient? The required formula is rather complicated
and will not be presented here. Just let you know the standard error of the above
kappa is 0.134. The 95% confidence interval of kappa value is (0.279, 0.805). So in
terms of the statements in the above guide, agreement here is somewhere between fair and
almost perfect.
Calculator: Please fill out the following four required non-negative integers.
SAS Program:
data A;
input a b c;
cards;
1 1 12
1 0 7
0 1 0
0 0 10
;
proc freq;
tables a*b/agree;
weight c;
run;
SAS Output:
Simple Kappa Coefficient
------------------------
95% Confidence Bounds
Kappa = 0.542 ASE = 0.134 0.279 0.805
Sample Size = 29
Reference:
Dunn G, Everitt B. Clinical Biostatistics: An Introduction to
Evidence-Based Medicine. London: Edward Arnold 1995.
Everitt BS. Statistical Methods for Medical Investigations, 2nd ed.
London: Edward Arnold 1994.
By Jen-Hsiang Chuang, M.D., M.S.
Department of Medical Informatics
Columbia University
Last modified: 10/22/99