求教各位,当两个raters分别评估同一批病人,一个问卷有10道题目,每个题目的评级分数都为0,1,2,3,4.此外还有8,代表不知如何选择,9,代表评估员认为该题不符合该患者情况。求 interrater agreement for nominal scales
我用的是stata软件,SAS我不会,运算出现的问题是:
例1:
kap b114 bb114, tab
| bb114
b114 | 0 1 2 3 4 9 | Total
-----------+------------------------------------------------------------------+----------
0 | 29 2 1 0 0 0 | 32
1 | 1 13 0 0 0 0 | 14
2 | 0 2 9 2 0 0 | 13
3 | 0 0 0 9 0 0 | 9
4 | 0 0 0 0 2 0 | 2
9 | 0 0 0 0 0 2 | 2
-----------+------------------------------------------------------------------+----------
Total | 30 17 10 11 2 2 | 72
Expected
Agreement Agreement Kappa Std. Err. Z Prob>Z
-----------------------------------------------------------------
88.89% 27.68% 0.8464 0.0653 12.95 0.0000
例2:
kap s110a ss110a,tab
| ss110a
s110a | 1 2 3 4 | Total
-----------+--------------------------------------------+----------
1 | 10 0 0 0 | 10
2 | 2 40 1 1 | 44
3 | 0 3 15 0 | 18
4 | 0 0 0 0 | 0
-----------+--------------------------------------------+----------
Total | 12 43 16 1 | 72
Expected
Agreement Agreement Kappa Std. Err. Z Prob>Z
-----------------------------------------------------------------
90.28% 44.37% 0.8252 0.0864 9.56 0.0000
总之我很困惑,.我的这个算法貌似不对;另一个就是明明例1的
kappa= 0.8464 ,比例2的kappa= 0.8252大,可是为什么agreement的值反而比例2的小????