proc logistic: how to build 2 X 2 classification table - Statistics版 - 未名存档

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

Statistics版 - proc logistic: how to build 2 X 2 classification table

相关主题
● 建模型，最后一步发现classification table不均匀，和解？	● 如何做ordinal logistic regression的validation？
● 紧急求助一个LOGISTIC REGRESSION 问题.	● Logistic model中 ROC曲线里面AUC 值太低怎么办？
● classification 问题求教!!	● 想问一个关于评价prediction performance的问题
● R-square of logistic regression	● 急需帮助，关于比较ROC的问题。
● How to test the difference between two C statistics （want the P	● 报两个offer-updated-附面试心得 (转载)
● How to express cut-off value	● ks 只有28%
● 问个logistic model的面试问题	● credit card fraud risk detection问题求教
● 老板总问：C很好为啥gain chart的夹区面积那么小	● SAS 9.2.2

相关话题的讨论汇总
话题: hat话题: logistic话题: proc话题: build

进入Statistics版参与讨论

1

(共1页)

S******y 发帖数: 1123	1 I am using PROC LOGISTIC to model binary outcomes. I have observed Y (1 or 0) from original data. I also have got predicted probability for each observation (i.e. predicted probability of event Y=1) from PROC LOGISTIC. Let us call it - p_hat. for example, I would have two columns - Y p_hat 1 0.6 0 0.3 1 0.45 ... I would like to build a 2X2 classification table - Y =1, Y =0 vs. Y_hat=1, Y_hat =0 to evaluate my classification accuracy. The challenge is - how to derive Y_hat from p_hat? what
b******1 发帖数: 367	2 there is no universal standards to set the cutoff point.
o****o 发帖数: 8077	3 this really depends on your business context and hence the definition of threshold. for example, if you want to max your Class=1 outcome in top 30%, then you can define whoever show up in top 30% in terms of your p_hat as Y_hat=1, ... . this is one reason I think Stats jobs that is integrated with business can't be outsourced
S******y 发帖数: 1123	4 Thanks both of you for prompt help! Happy Friday!
S******y 发帖数: 1123	5 Do you have to come up with cutting point also for KNN, Neural Net, RandomForest and rpart prediction? Isn't it true that if you treat binary response as factor, you can get predicted 0,1 without making up cutting point? Thanks.
o****o 发帖数: 8077	6 KNN uses some form of majority vote bah? so there is a nature step function RF also relies on a similar rule as majority vote, right? 【在 S******y 的大作中提到】 : Do you have to come up with cutting point also for KNN, Neural Net, : RandomForest and rpart prediction? : Isn't it true that if you treat binary response as factor, you can get : predicted 0,1 without making up cutting point? : Thanks.
D******n 发帖数: 2836	7 check out ROC and AUC. 【在 S******y 的大作中提到】 : I am using PROC LOGISTIC to model binary outcomes. : I have observed Y (1 or 0) from original data. : I also have got predicted probability for each observation (i.e. predicted : probability of event Y=1) from PROC LOGISTIC. Let us call it - p_hat. : for example, I would have two columns - : Y p_hat : 1 0.6 : 0 0.3 : 1 0.45 : ...
S******y 发帖数: 1123	8 Thank both of you! I tried - factor(out_come) ~ . in both kknn and logistic, it seems working out!

1

(共1页)

进入Statistics版参与讨论

相关主题
● SAS 9.2.2	● How to test the difference between two C statistics （want the P
● 请问如何验证已知的logistic regression models是不是能很好predict 自己的dataset	● How to express cut-off value
● Q on logistic model	● 问个logistic model的面试问题
● 做logistic regression，cases很少但是predictor很多	● 老板总问：C很好为啥gain chart的夹区面积那么小
● 建模型，最后一步发现classification table不均匀，和解？	● 如何做ordinal logistic regression的validation？
● 紧急求助一个LOGISTIC REGRESSION 问题.	● Logistic model中 ROC曲线里面AUC 值太低怎么办？
● classification 问题求教!!	● 想问一个关于评价prediction performance的问题
● R-square of logistic regression	● 急需帮助，关于比较ROC的问题。

相关话题的讨论汇总
话题: hat话题: logistic话题: proc话题: build

未名新帖统计// 7月16日

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

* 这里只显示发帖超过25的版面，努力灌水吧:-)