由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Statistics版 - Decision Tree in Python or C++
相关主题
WHAT IS CART?怎么从rpart 的output 求 overall的MSE ?
R有package能做id3 decision tree么?R classification tree model 请教
这里有考过SAS predictive modeling认证的吗?R 里 encoding 提问;包子答谢!
[求教] Classfication and regression tree in SAScorrelation clustering用spss, stata, 或 r怎么做
[R] ada & rpart packages求教一个模型/预测问题
Size of R object for models is so big求教R中做CART tree 报错
相关话题的讨论汇总
话题: c++话题: decision话题: tree话题: python话题: sas
进入Statistics版参与讨论
1 (共1页)
S******y
发帖数: 1123
1
Have anybody used Decision Tree in Python or C++? (or written their own
decision tree implementation in Python or C++)? My goal is to run decision
tree on 8 million obs as training set and score 7 million in test set.
I am testing 'rpart' package on a 64-bit-Linux + 64-bit-R environment. But
it seems that rpart is either not stable or running out of memory very
quickly. (Is it because R is passing everything as copy instead of as object
reference?)
(PS. I would love to use SAS EM. but no licen
l*********s
发帖数: 5409
2
R is notorious for bad memory management;
Python is much better, C++ coding is too much a headache.
S******y
发帖数: 1123
3
Thanks. I won't mind re-writing it in C++ if it is 10X faster and better-
resource-managed than R.
A*******s
发帖数: 3942
4
i know there is a SAS macro for CHAID. Not sure if it can handle large
dataset.

decision
object

【在 S******y 的大作中提到】
: Have anybody used Decision Tree in Python or C++? (or written their own
: decision tree implementation in Python or C++)? My goal is to run decision
: tree on 8 million obs as training set and score 7 million in test set.
: I am testing 'rpart' package on a 64-bit-Linux + 64-bit-R environment. But
: it seems that rpart is either not stable or running out of memory very
: quickly. (Is it because R is passing everything as copy instead of as object
: reference?)
: (PS. I would love to use SAS EM. but no licen

d*******o
发帖数: 493
5
SAS is the good choice for large scale data. A 64bit SAS has excellent
memory management and basically can handle any size of datasets.
SAS 9.2 licenses PROC ARBORETUM, which is the foundation of SAS EM Decision
Tree Node. It supports AID, CHAID, XAID and CRT and code generation. It may
be used to train data and produce rules.
1 (共1页)
进入Statistics版参与讨论
相关主题
这里有考过SAS predictive modeling认证的吗?R 里 encoding 提问;包子答谢!
[求教] Classfication and regression tree in SAScorrelation clustering用spss, stata, 或 r怎么做
[R] ada & rpart packages求教一个模型/预测问题
Size of R object for models is so big求教R中做CART tree 报错
怎么从rpart 的output 求 overall的MSE ?WHAT IS CART?
R classification tree model 请教R有package能做id3 decision tree么?
相关话题的讨论汇总
话题: c++话题: decision话题: tree话题: python话题: sas