由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Statistics版 - 请问:想fit gamma 并同时用lasso的方法做variable selection
相关主题
model的predictors之间有multi-colinearity怎么办?##面试过了,请教问题##
有80个候选Predictors,怎么从中选<10个model和variables都sig.但每个category都不sig
关于lasso的variable selection问题急问:用stata或R算predicted probabiltiy (logistic regressi
电话面试完了,肯定没戏,大家帮我看看题目,就算学习吧包子求解释,为什么anova解释variance这么少
[合集] 电话面试完了,肯定没戏,大家帮我看看题目,就算学习吧求 imputation 后 出来的iteration 的数据作用
how to convert a categorical variable into a continuous variableone question about variable selection in SAS
抓狂!为啥选出来的predictor都这么差请教如何用R做Cox model的k-fold cross-validation
求问~做大数据时怎样知道哪些predictor应该构造interaction term??新手请教logistic regression
相关话题的讨论汇总
话题: lasso话题: gamma话题: fit话题: variables话题: lar
进入Statistics版参与讨论
1 (共1页)
c****s
发帖数: 63
1
我的问题是:
我现在有cost为outcome的数据,要用gamma distribution来fit,
但是难题是predictors有1000个,所以又要同时选择predictor.
我的想法:
1.如果用stepwise方法来选,就可以既fit gamma 又同时select variables. 在R中可
以实现,但老板所stepwise不好,让用lasso来select variables.
2.如果直接先用lasso选变量,再去fit model,好像也不太对。因为lasso fit的
是least Angle regression, 不是基于gamma distribution. 应该不能适用于cost
data.
不知道大家遇到这种问题该怎么办,SAS, Stata or R, 那个能解决这个问题呢?
或者有什么好的建议,先谢谢了!
c****s
发帖数: 63
2
是不是我说的不清楚啊,我修改了一下,还望大家帮帮忙!!
s*********e
发帖数: 1051
3
i think it is ok to use lasso and here is why.
there are 2 parameters in gamma, scale and shape parameters. when shape
parameter is large, gamma converges to gaussian. so if you are working on
large sample, you should be fine.
i*******n
发帖数: 227
4
confused by your question.
LAR is only a solution for lasso, and LAR is nothing to do with statistical
assumptions. Why do you want to use lasso but worry about LAR?
I guess what you want is a gamma fitting model with L1-norm constraint, am I
right?
c****s
发帖数: 63
5
Thanks for your reply!
Yes, you are right.
So according what you said, can I use lasso in 'proc glmselect' in SAS to
find the best variables and then put those variables in 'proc genmod'?
Or do you have any suggestions? Thanks!
o***o
发帖数: 43
6
你应该把model的penalized likelihood写出来,看看自己能不能optimize.
有篇文章或许有用:L1-regularization path algorithm for generalized
linear models。
http://www-stat.stanford.edu/~hastie/Papers/JRSSB.69.4%20%282007%29%20659-677%20Park.pdf
1 (共1页)
进入Statistics版参与讨论
相关主题
新手请教logistic regression[合集] 电话面试完了,肯定没戏,大家帮我看看题目,就算学习吧
cross validation选择 lasso的 参数how to convert a categorical variable into a continuous variable
【大包子】Factor data analysis抓狂!为啥选出来的predictor都这么差
Gene expression =?= Variable selection求问~做大数据时怎样知道哪些predictor应该构造interaction term??
model的predictors之间有multi-colinearity怎么办?##面试过了,请教问题##
有80个候选Predictors,怎么从中选<10个model和variables都sig.但每个category都不sig
关于lasso的variable selection问题急问:用stata或R算predicted probabiltiy (logistic regressi
电话面试完了,肯定没戏,大家帮我看看题目,就算学习吧包子求解释,为什么anova解释variance这么少
相关话题的讨论汇总
话题: lasso话题: gamma话题: fit话题: variables话题: lar