由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Statistics版 - 请教一个关与SAS data的问题
相关主题
a question about SAS codeWhat's the wrong with this SAS code?
请教: 关于面试 被问到 large data sets问个SAS问题
logistic regression question求 imputation 后 出来的iteration 的数据作用
How to concatenate these two varaibles in SAS: name||Birt[SAS] Efficient way for subsetting data?
SAS Help! How to read variables with sepcial character.which route in SAS is faster?
从大data 产生多个小data 的方法a R question
[合集] 电话面试完了,肯定没戏,大家帮我看看题目,就算学习吧SAS 奇数行 和 偶数行 数据怎么合并
新人求问SAS简单问题~~双包子求教:SAS问题
相关话题的讨论汇总
话题: data话题: a1话题: a2话题: records话题: sets
进入Statistics版参与讨论
1 (共1页)
e***w
发帖数: 1160
1
I am spliting a large data sets into two data sets, below is my code.
data a1 a2;
set a;
if _n_<=5000 then output a1;
else output a2;
run;
Suppose data A has 10000 records, I end up get two data sets A1 and A2, each
contains 5000 records and same varaibles as a.
But the problem is the filesize of A1 and A2 is much larger than A.
Does anybody know what is going on here?
Thanks
t*****w
发帖数: 254
2
your code is ok. Please check the file size in data A;

each

【在 e***w 的大作中提到】
: I am spliting a large data sets into two data sets, below is my code.
: data a1 a2;
: set a;
: if _n_<=5000 then output a1;
: else output a2;
: run;
: Suppose data A has 10000 records, I end up get two data sets A1 and A2, each
: contains 5000 records and same varaibles as a.
: But the problem is the filesize of A1 and A2 is much larger than A.
: Does anybody know what is going on here?

e***w
发帖数: 1160
3
Here is the true result I have.
Data A, 15672 records, 63 variables, file size 6465K
Subset data A1: 5000 records, 63 variables, file size 40017K
Subset data A2: 10672 records, 63 variables, file size 85393K
w****r
发帖数: 28
4
试试 compress option在你的data step
1 (共1页)
进入Statistics版参与讨论
相关主题
双包子求教:SAS问题SAS Help! How to read variables with sepcial character.
logistic regression issue从大data 产生多个小data 的方法
SAS Code question? How to understand this output?[合集] 电话面试完了,肯定没戏,大家帮我看看题目,就算学习吧
请教SQL 要学多长时间新人求问SAS简单问题~~
a question about SAS codeWhat's the wrong with this SAS code?
请教: 关于面试 被问到 large data sets问个SAS问题
logistic regression question求 imputation 后 出来的iteration 的数据作用
How to concatenate these two varaibles in SAS: name||Birt[SAS] Efficient way for subsetting data?
相关话题的讨论汇总
话题: data话题: a1话题: a2话题: records话题: sets