该怎么分析“所用的颜色的数量”和“诊断结果”之间的关系？ - Statistics版 - 未名存档

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

Statistics版 - 该怎么分析“所用的颜色的数量”和“诊断结果”之间的关系？

相关主题
● 再问multinomial logit	● R package e1071的一个问题：怎么读进一个svm model
● Question related to neural network package of R	● 还是run SVM的问题
● classification, 如果category有近上千个，大家一般用什么办法？	● 急问有关SVM，randomforest的问题（gene expression data）
● 面试：nonlinear regression, predictive modeling, machining learning问什么？	● svm的变量选择和shrinking选项什么关系？
● 请教：怎么能把Logistic regression的OR转化成probablity	● Re: reliability coefficient and its CI
● 做分类除了 logit reg 和svm 还有什么别的方法比较好	● 网上看到一道题
● Linear Regression	● 一个关于multicollinearity的问题
● how to do permutation in R?	● how to interpret these regression coefficients?

相关话题的讨论汇总
话题: color话题: diagnosis话题: used话题: age话题: sex

进入Statistics版参与讨论

1

(共1页)

w******a 发帖数: 25	1 Patient_Code Sex Age Color_used DIAGNOSIS 1 22 1 29 4 0 2 1 1 25 10 0 3 9 1 48 1 0 4 13 1 27 3 0 5 18 1 29 6 0 6 20 1 24 6 0 7 23 0 57 3 0 8 25 1 62 1 0 9 27 0 56 1 0 10 5 1 39 1 0 11
w******a 发帖数: 25	2 自己顶一下
S******y 发帖数: 1123	3 supervised learning Binary classifier -
d*******o 发帖数: 493	4 proc logistic data=one; model diagnose = sex Age Color_used ; run;
w******a 发帖数: 25	5 > mylogit<-glm(DIAGNOSIS~Color_used,family=binomial(link="logit"), na.action=na.pass) > summary(mylogit) Coefficients: Estimate Std. Error z value Pr(>\|z\|) (Intercept) -1.57289 1.13605 -1.385 0.166 Color_used 0.09416 0.18558 0.507 0.612 The table of coefficients shows that Color_used is statistically non- significant.
w******a 发帖数: 25	6 如果用classification tree或是Support Vector Machine或是KNN分析的话，training set and test set的data size太小，怎么分都是不够的. 请问statsguy，该怎么分呢？要bootstrap吗？
w******a 发帖数: 25	7 谢谢dapangmao 的sas code!热心人啊
S******y 发帖数: 1123	8 naiveBayes - R package e1071 Neural Net - R package nnet Collaborative Filtering (?) - Matlab??
w******a 发帖数: 25	9 e1071 only has one method called e1071.e1071-deprecated(). i don't know how to use it. > e1071-deprecated() Error: object 'e1071' not found nnet >train<-read.csv("dataset_nee.csv",header=T) > nnet(x=train[,2:4],y=train[,5]) Error in nnet.default(x = train[, 2:4], y = train[, 5]) : element 1 is empty; the part of the args list of 'c' being evaluated was: (size, dim(y)[2])
D******n 发帖数: 2836	10 then u have to accept the truth.... action=na.pass) "logit"), na.action=na.pass) 【在 w******a 的大作中提到】 : > mylogit<-glm(DIAGNOSIS~Color_used,family=binomial(link="logit"), na.action=na.pass) : > summary(mylogit) : Coefficients: : Estimate Std. Error z value Pr(>\|z\|) : (Intercept) -1.57289 1.13605 -1.385 0.166 : Color_used 0.09416 0.18558 0.507 0.612 : The table of coefficients shows that Color_used is statistically non- : significant.
w******a 发帖数: 25	11 in pilot study, only "color_used" stands out. We wish we can prove it and apply the result to save cost.I believe the data size is the issue and keep recruiting now. It is still really nice to be able to learn different analysis methods from your guys! Can anyone suggest something to solve the training/test dataset splitting problem?

1

(共1页)

进入Statistics版参与讨论

相关主题
● how to interpret these regression coefficients?	● 请教：怎么能把Logistic regression的OR转化成probablity
● R question	● 做分类除了 logit reg 和svm 还有什么别的方法比较好
● 请教比较两个regression coefficient	● Linear Regression
● 请教一个correlation coefficient的test的问题	● how to do permutation in R?
● 再问multinomial logit	● R package e1071的一个问题：怎么读进一个svm model
● Question related to neural network package of R	● 还是run SVM的问题
● classification, 如果category有近上千个，大家一般用什么办法？	● 急问有关SVM，randomforest的问题（gene expression data）
● 面试：nonlinear regression, predictive modeling, machining learning问什么？	● svm的变量选择和shrinking选项什么关系？

相关话题的讨论汇总
话题: color话题: diagnosis话题: used话题: age话题: sex

未名新帖统计// 7月16日

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

* 这里只显示发帖超过25的版面，努力灌水吧:-)