由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Statistics版 - A question in splitting dataset
相关主题
which route in SAS is faster?one quick question about concatenating data in SAS
菜鸟问个sas得问题,关于分数组为什么合并这两个数据不能用sas merge 而要用SAS SQL
有谁熟习用SAS做RANDOM SAMPLING, 请教!请教sas adv的题
如何比较两个proc contents的结果?问个效率问题 SQL vs data step,大数据量
[提问]怎样sort这个dataset?SAS菜鸟请教如果使SAS的output的结果放到一个文件内?
[SAS] data set options (obs=) in output tablesPROC SQL join data help
SAS dataset 中,怎么把数据往上移一行?SAS help : Proc dataset
求教一个简单的data step 牛肉包请教如何用SAS处理这个RANDOM SAMPLING的问题
相关话题的讨论汇总
话题: dataset话题: policy话题: 77821178话题: obs话题: 67
进入Statistics版参与讨论
1 (共1页)
g******7
发帖数: 1433
1
I have a dataset, and I want to randomly split it into two datasets.
For example,
Obs Policy #
1 67
2 67
3 67
4 78
5 78
....
10000000 77821178
10000001 77821178
10000002 77821178
I want all the same (unique) policy# in one splitted dataset (like all the
67 in first dataset), not in both of splitted datasets(like obs 1 in
dataset1, obs 2 in dataset2), how would I do that in SAS?
Sorry that i cant type chinese,
Thanks!
a****g
发帖数: 8131
2
proc sql/freq to find unique policy numbers
then proc surveyselect

【在 g******7 的大作中提到】
: I have a dataset, and I want to randomly split it into two datasets.
: For example,
: Obs Policy #
: 1 67
: 2 67
: 3 67
: 4 78
: 5 78
: ....
: 10000000 77821178

g******7
发帖数: 1433
3
could you write me the sample code on how to do this?
never write this kind of codes..
I have a large amount(millions) of unique policy #, will this gonna work?
baozi xie!

【在 a****g 的大作中提到】
: proc sql/freq to find unique policy numbers
: then proc surveyselect

a****g
发帖数: 8131
4
it is somethinglike the following
proc sql;
create table a as
select distinct policy
from data;
quit;
proc surveyselect data=data method=random out=out1;
where policy in (proc sql; select policy from a;)
....
then
proc sql;
create data out2
as
select * from
data
where policy not in (select policy from out1);
quit;
something like this. haven't used it for a long time don't remember some
minute details

【在 g******7 的大作中提到】
: could you write me the sample code on how to do this?
: never write this kind of codes..
: I have a large amount(millions) of unique policy #, will this gonna work?
: baozi xie!

g******7
发帖数: 1433
5
不是很清楚第二步
proc surveyselect data=data method=random out=out1;
where policy in (proc sql; select policy from a;)
这里的method是random? 还有syntax具体是怎么样的?

【在 a****g 的大作中提到】
: it is somethinglike the following
: proc sql;
: create table a as
: select distinct policy
: from data;
: quit;
: proc surveyselect data=data method=random out=out1;
: where policy in (proc sql; select policy from a;)
: ....
: then

D******n
发帖数: 2836
6
proc surveyselect;cluster;

【在 g******7 的大作中提到】
: I have a dataset, and I want to randomly split it into two datasets.
: For example,
: Obs Policy #
: 1 67
: 2 67
: 3 67
: 4 78
: 5 78
: ....
: 10000000 77821178

g******7
发帖数: 1433
7
cluster? 能否具体说下code怎么写,多谢!

【在 D******n 的大作中提到】
: proc surveyselect;cluster;
1 (共1页)
进入Statistics版参与讨论
相关主题
请教如何用SAS处理这个RANDOM SAMPLING的问题[提问]怎样sort这个dataset?
如何在1,2,3,4,5中随机选出2个数来?[SAS] data set options (obs=) in output tables
用SAS sampling的一个问题SAS dataset 中,怎么把数据往上移一行?
[合集] SAS里如何实现LOCF(LAST OBS CARRIED FORWARD)?求教一个简单的data step 牛肉包
which route in SAS is faster?one quick question about concatenating data in SAS
菜鸟问个sas得问题,关于分数组为什么合并这两个数据不能用sas merge 而要用SAS SQL
有谁熟习用SAS做RANDOM SAMPLING, 请教!请教sas adv的题
如何比较两个proc contents的结果?问个效率问题 SQL vs data step,大数据量
相关话题的讨论汇总
话题: dataset话题: policy话题: 77821178话题: obs话题: 67