由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Statistics版 - 该怎么分析“所用的颜色的数量”和“诊断结果”之间的关系?
相关主题
再问multinomial logitR package e1071的一个问题:怎么读进一个svm model
Question related to neural network package of R还是run SVM的问题
classification, 如果category有近上千个,大家一般用什么办法?急问有关SVM,randomforest的问题(gene expression data)
面试:nonlinear regression, predictive modeling, machining learning问什么?svm的变量选择和shrinking选项什么关系?
请教:怎么能把Logistic regression的OR转化成probablityRe: reliability coefficient and its CI
做分类 除了 logit reg 和svm 还有什么别的方法比较好网上看到一道题
Linear Regression一个关于multicollinearity的问题
how to do permutation in R?how to interpret these regression coefficients?
相关话题的讨论汇总
话题: color话题: diagnosis话题: used话题: age话题: sex
进入Statistics版参与讨论
1 (共1页)
w******a
发帖数: 25
1
Patient_Code Sex Age Color_used DIAGNOSIS
1 22 1 29 4 0
2 1 1 25 10 0
3 9 1 48 1 0
4 13 1 27 3 0
5 18 1 29 6 0
6 20 1 24 6 0
7 23 0 57 3 0
8 25 1 62 1 0
9 27 0 56 1 0
10 5 1 39 1 0
11
w******a
发帖数: 25
2
自己顶一下
S******y
发帖数: 1123
3
supervised learning
Binary classifier -
d*******o
发帖数: 493
4
proc logistic data=one;
model diagnose = sex Age Color_used ;
run;
w******a
发帖数: 25
5
> mylogit<-glm(DIAGNOSIS~Color_used,family=binomial(link="logit"), na.action=na.pass)
> summary(mylogit)
Coefficients:
Estimate Std. Error z value Pr(>|z|)
(Intercept) -1.57289 1.13605 -1.385 0.166
Color_used 0.09416 0.18558 0.507 0.612
The table of coefficients shows that Color_used is statistically non-
significant.
w******a
发帖数: 25
6
如果用classification tree或是Support Vector Machine或是KNN分析的话,training set and test set的data size太小,怎么分都是不够的.
请问statsguy,该怎么分呢?要bootstrap吗?
w******a
发帖数: 25
7
谢谢dapangmao 的sas code!热心人啊
S******y
发帖数: 1123
8
naiveBayes - R package e1071
Neural Net - R package nnet
Collaborative Filtering (?) - Matlab??
w******a
发帖数: 25
9

e1071 only has one method called e1071.e1071-deprecated().
i don't know how to use it.
> e1071-deprecated()
Error: object 'e1071' not found

nnet
>train<-read.csv("dataset_nee.csv",header=T)
> nnet(x=train[,2:4],y=train[,5])
Error in nnet.default(x = train[, 2:4], y = train[, 5]) :
element 1 is empty;
the part of the args list of 'c' being evaluated was:
(size, dim(y)[2])
D******n
发帖数: 2836
10
then u have to accept the truth....

action=na.pass)
"logit"), na.action=na.pass)

【在 w******a 的大作中提到】
: > mylogit<-glm(DIAGNOSIS~Color_used,family=binomial(link="logit"), na.action=na.pass)
: > summary(mylogit)
: Coefficients:
: Estimate Std. Error z value Pr(>|z|)
: (Intercept) -1.57289 1.13605 -1.385 0.166
: Color_used 0.09416 0.18558 0.507 0.612
: The table of coefficients shows that Color_used is statistically non-
: significant.

w******a
发帖数: 25
11
in pilot study, only "color_used" stands out. We wish we can prove it and
apply the result to save cost.I believe the data size is the issue and keep
recruiting now.
It is still really nice to be able to learn different analysis methods from
your guys!
Can anyone suggest something to solve the training/test dataset splitting
problem?
1 (共1页)
进入Statistics版参与讨论
相关主题
how to interpret these regression coefficients?请教:怎么能把Logistic regression的OR转化成probablity
R question做分类 除了 logit reg 和svm 还有什么别的方法比较好
请教比较两个regression coefficientLinear Regression
请教一个correlation coefficient的test的问题how to do permutation in R?
再问multinomial logitR package e1071的一个问题:怎么读进一个svm model
Question related to neural network package of R还是run SVM的问题
classification, 如果category有近上千个,大家一般用什么办法?急问有关SVM,randomforest的问题(gene expression data)
面试:nonlinear regression, predictive modeling, machining learning问什么?svm的变量选择和shrinking选项什么关系?
相关话题的讨论汇总
话题: color话题: diagnosis话题: used话题: age话题: sex