本人非统计专业,正在准备数据分析师的考试。刚刚看完两本教材,拿来a00-240的考题想练练手,结果发现自己很多知识都理解的不够扎实到位,错误百出。。。
这道是考卷中的第8题:
Suppose training data are oversampled in the event group to make the number of events and non- events roughly equal. A logistic regression is run and the probabilities are output to a data set NEW and given the variable name PE. A decision rule considered is, "Classify data as an event if probability is greater than 0.5." Also the data set NEW contains a variable TG that indicates whether there is an event (1=Event, 0= No event).
[size=10.000000pt]
The following SAS program was used:
Data NEW;
set NEW;
Solicit = PE > .5;
run;
proc means data=NEW(where = (TG = 1)) mean;
var Solicit;
run;
[size=10.000000pt]What does this program calculate?
[size=10.000000pt]A. Depth
B. Sensitivity
C. Specificity
D. Positive predictive value
[size=10.000000pt]
[size=10.000000pt]
[size=10.000000pt]我知道Depth/Sensi/Spec/PV+这几个Statistics应该怎样求得,但是我现在对于NEW这个data set里面的三个变量(PE,Solicit,TG)理解比较混乱,所以不知道应该怎样往公式里面代入。还请大家帮忙。谢谢啦。