e***w 发帖数: 1160 | 1 I am spliting a large data sets into two data sets, below is my code.
data a1 a2;
set a;
if _n_<=5000 then output a1;
else output a2;
run;
Suppose data A has 10000 records, I end up get two data sets A1 and A2, each
contains 5000 records and same varaibles as a.
But the problem is the filesize of A1 and A2 is much larger than A.
Does anybody know what is going on here?
Thanks | t*****w 发帖数: 254 | 2 your code is ok. Please check the file size in data A;
each
【在 e***w 的大作中提到】 : I am spliting a large data sets into two data sets, below is my code. : data a1 a2; : set a; : if _n_<=5000 then output a1; : else output a2; : run; : Suppose data A has 10000 records, I end up get two data sets A1 and A2, each : contains 5000 records and same varaibles as a. : But the problem is the filesize of A1 and A2 is much larger than A. : Does anybody know what is going on here?
| e***w 发帖数: 1160 | 3 Here is the true result I have.
Data A, 15672 records, 63 variables, file size 6465K
Subset data A1: 5000 records, 63 variables, file size 40017K
Subset data A2: 10672 records, 63 variables, file size 85393K | w****r 发帖数: 28 | 4 试试 compress option在你的data step |
|