问一个统计弱问题 - Statistics版 - 未名存档

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

Statistics版 - 问一个统计弱问题

相关主题
● R 里 encoding 提问；包子答谢！	● How to transform predictor variable?
● decision stump and sas macro	● 2^2 factorial design without Replicates怎么ANOVA 分析啊？
● regression problem - go confused	● 攒人品，发Google Statistician/Data Scientist电面面经
● model sample size重要吗?	● R里面用predict()的问题
● 急问：请教一个muliticollinearity的面试问题，谢谢！	● 请教outlier的influence check标准
● Linear Regression	● 请教一个正态分布的积分问题
● 如何在应用model前把correlated的predictors去掉？	● sample size vs. number of regressors
● Why shrinkage estimators are prefered?	● 问个关于credit score model的问题

相关话题的讨论汇总
话题: rank话题: mse话题: hat话题: ey

进入Statistics版参与讨论

1

(共1页)

w**s 发帖数: 26	1 如果用某变量的continuous observations 的regression结果weak, 那么用此变量的 rank observations 的结果是更弱还是会增强;比如x 有100个obs,现在把它rank 成十组,用它的rank 0,1,..,9作independent variable, the significance of the coefficient 是增强,还是减弱?
s*******n 发帖数: 901	2 这个我不知道
c*****n 发帖数: 46	3 这么做有没有什么实际的意义？【在 w**s 的大作中提到】 : 如果用某变量的continuous observations 的regression结果weak, 那么用此变量的 : rank observations 的结果是更弱还是会增强;比如x 有100个obs,现在把它rank 成十 : 组,用它的rank 0,1,..,9作independent variable, the significance of the : coefficient 是增强,还是减弱?
I*****a 发帖数: 5425	4 This question can be potentially useful. I don't know the answer. A much simplified and somehow similar problem may be as follows: We have a single continuous predictor x from some uniform distribution and a response variable EY = b0 + b1x. Instead of ranking x into bins, we round x to w, which is usually assumed in real situations where exact measurement is difficult. a) we estimate b1-hat b) we estimate d1-hat from EY = d0 + d1 w In this case, I think the predictor effect is less significant in b) than in a), especially you round too much (few number of bins). By assuming x = w + gamma, The t stat for b1-hat is proportional to Cov(x, Y) / sqrt(S(x) * mse_a), where S() is sample variance and mse is estimated MSE from the model. The denominator of above for a) is smaller than b), so the slope effect is more significant in the original problem than after rounding. Again, here I simplified the problem a lot, including assuming simple linear regression, uniform x (not too weird dsn of x), and rounding instead of ranking. Not sure how relevant this is to the original question. 【在 w**s 的大作中提到】 : 如果用某变量的continuous observations 的regression结果weak, 那么用此变量的 : rank observations 的结果是更弱还是会增强;比如x 有100个obs,现在把它rank 成十 : 组,用它的rank 0,1,..,9作independent variable, the significance of the : coefficient 是增强,还是减弱?

1

(共1页)

进入Statistics版参与讨论

相关主题
● 问个关于credit score model的问题	● 急问：请教一个muliticollinearity的面试问题，谢谢！
● C1 internship 面经	● Linear Regression
● 请教一个问题	● 如何在应用model前把correlated的predictors去掉？
● 问一个关于cox proportional hazard model的基础问题	● Why shrinkage estimators are prefered?
● R 里 encoding 提问；包子答谢！	● How to transform predictor variable?
● decision stump and sas macro	● 2^2 factorial design without Replicates怎么ANOVA 分析啊？
● regression problem - go confused	● 攒人品，发Google Statistician/Data Scientist电面面经
● model sample size重要吗?	● R里面用predict()的问题

相关话题的讨论汇总
话题: rank话题: mse话题: hat话题: ey

未名新帖统计// 7月16日

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

* 这里只显示发帖超过25的版面，努力灌水吧:-)