由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Statistics版 - test for randomness
相关主题
这样还能算Randomized sample吗有没有已知分布的真实data set?
[合集] 问个概率题请教一个sas问题
[合集] 请问一个问题, two random variables x and y (do not know disHOW TO Test Randomness when sample size is small?
what is the distribution of a CDF of a random variable which has standard normal distribution?Testing Random Effects in GLMM?
请问这个该怎么稿?random sampling in R
问个问题[合集] 问一个SAS做randomization assignment 的问题
[分享]牛人整理的统计学教材请教如何用SAS处理这个RANDOM SAMPLING的问题
问一个anova里关于model identifiability的问题怎样generate random number matrix
相关话题的讨论汇总
话题: missing话题: randomness话题: test话题: data话题: random
进入Statistics版参与讨论
1 (共1页)
c***z
发帖数: 6348
1
Hi all,
I am doing something new to myself and would like to hear your suggestions.
:)
I have a record of how often some page_visit data is missing, per day, for
100 million users, in 15 days. I need to check whether the missing data is
random, to make sure my analysis is not biased. We already know that in
average, received data is less than what we should receive; and that
sometimes we received more than what we should.
Any clue on how to do this?
Thanks a lot! :)
I*****a
发帖数: 5425
2
if you have a sample of the missing data, can you compare the distributions
of it with the one that are not missing, e.g. by a KS test.
It may be tricky which random variable you want to use. Maybe the ones you
are most interested of.

.

【在 c***z 的大作中提到】
: Hi all,
: I am doing something new to myself and would like to hear your suggestions.
: :)
: I have a record of how often some page_visit data is missing, per day, for
: 100 million users, in 15 days. I need to check whether the missing data is
: random, to make sure my analysis is not biased. We already know that in
: average, received data is less than what we should receive; and that
: sometimes we received more than what we should.
: Any clue on how to do this?
: Thanks a lot! :)

I*****a
发帖数: 5425
3
and depending on your real problem and missing reasons, some signals may
have different mean/var if missing not at random. In this case comparing
mean/var directly should give you more power.

distributions

【在 I*****a 的大作中提到】
: if you have a sample of the missing data, can you compare the distributions
: of it with the one that are not missing, e.g. by a KS test.
: It may be tricky which random variable you want to use. Maybe the ones you
: are most interested of.
:
: .

c***z
发帖数: 6348
4
thanks a lot! :)
1 (共1页)
进入Statistics版参与讨论
相关主题
怎样generate random number matrix请问这个该怎么稿?
Approximate random sample问个问题
和不很懂统计和DESIGN且不愿接受新东西总以为自己是对的老板工[分享]牛人整理的统计学教材
外行请教:这个问题有没有答案?问一个anova里关于model identifiability的问题
这样还能算Randomized sample吗有没有已知分布的真实data set?
[合集] 问个概率题请教一个sas问题
[合集] 请问一个问题, two random variables x and y (do not know disHOW TO Test Randomness when sample size is small?
what is the distribution of a CDF of a random variable which has standard normal distribution?Testing Random Effects in GLMM?
相关话题的讨论汇总
话题: missing话题: randomness话题: test话题: data话题: random