boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
DataSciences版 - 分享一篇有意思的文章Machine Learning: The High-Interest Credit Card of Technical Debt
相关主题
问问面试如何出题
Leetcode?
刷kaggle对找工作有用么?
[内推] NetBrain tech (software engineer等)职位内部推荐机会
Machine learning in action by Peter Harrington
请问machine learning入门级的书哪一本比较好?
攒人品,求bless~ 新鲜面经 - Machine Learning Engineer
Stanford 免费的 Machine Learning online course 开始了
求推荐machine learning和data mining的书 (转载)
非csee的engineer做machine learning engineer,办身份会有问题吗
相关话题的讨论汇总
话题: learning话题: machine话题: debt话题: card话题: technical
进入DataSciences版参与讨论
1 (共1页)
b*****o
发帖数: 715
1
Machine Learning:
The High-Interest Credit Card of Technical Debt
http://static.googleusercontent.com/media/research.google.com/e
其中我很感慨的就是下面这段:自己重写一个专用的工具比开源的通用工具靠谱。
It may be surprising to the academic community to know that only a tiny
fraction of the code in
many machine learning systems is actually doing “machine learning”. When
we recognize that a
mature system might end up being (at most) 5% machine learning code and (at
least) 95% glue code,
reimplementation rather than reuse of a clumsy API looks like a much better
strategy.
我自己加的推论就是,production的ML系统,千万不能用python来写,虽然刚上手的时
候很爽,但是太难维护和扩展了。
d******e
发帖数: 7844
2
有用Python来做产品的?

at

【在 b*****o 的大作中提到】
: Machine Learning:
: The High-Interest Credit Card of Technical Debt
: http://static.googleusercontent.com/media/research.google.com/e
: 其中我很感慨的就是下面这段:自己重写一个专用的工具比开源的通用工具靠谱。
: It may be surprising to the academic community to know that only a tiny
: fraction of the code in
: many machine learning systems is actually doing “machine learning”. When
: we recognize that a
: mature system might end up being (at most) 5% machine learning code and (at
: least) 95% glue code,

Z**0
发帖数: 1119
3
Oh my god. 95% glue code.
要么是里边有很多machine learning module,需要glue together。更加可能是,采集
的数据,需要很多步骤处理,才可以用在ML的module上面,最后还有presentation
result部分。
是不是大部分DS在公司很多工作都是和clean data和present final result有关了?

at

【在 b*****o 的大作中提到】
: Machine Learning:
: The High-Interest Credit Card of Technical Debt
: http://static.googleusercontent.com/media/research.google.com/e
: 其中我很感慨的就是下面这段:自己重写一个专用的工具比开源的通用工具靠谱。
: It may be surprising to the academic community to know that only a tiny
: fraction of the code in
: many machine learning systems is actually doing “machine learning”. When
: we recognize that a
: mature system might end up being (at most) 5% machine learning code and (at
: least) 95% glue code,

b*****o
发帖数: 715
4
Machine Learning:
The High-Interest Credit Card of Technical Debt
http://static.googleusercontent.com/media/research.google.com/e
其中我很感慨的就是下面这段:自己重写一个专用的工具比开源的通用工具靠谱。
It may be surprising to the academic community to know that only a tiny
fraction of the code in
many machine learning systems is actually doing “machine learning”. When
we recognize that a
mature system might end up being (at most) 5% machine learning code and (at
least) 95% glue code,
reimplementation rather than reuse of a clumsy API looks like a much better
strategy.
我自己加的推论就是,production的ML系统,千万不能用python来写,虽然刚上手的时
候很爽,但是太难维护和扩展了。
d******e
发帖数: 7844
5
有用Python来做产品的?

at

【在 b*****o 的大作中提到】
: Machine Learning:
: The High-Interest Credit Card of Technical Debt
: http://static.googleusercontent.com/media/research.google.com/e
: 其中我很感慨的就是下面这段:自己重写一个专用的工具比开源的通用工具靠谱。
: It may be surprising to the academic community to know that only a tiny
: fraction of the code in
: many machine learning systems is actually doing “machine learning”. When
: we recognize that a
: mature system might end up being (at most) 5% machine learning code and (at
: least) 95% glue code,

Z**0
发帖数: 1119
6
Oh my god. 95% glue code.
要么是里边有很多machine learning module,需要glue together。更加可能是,采集
的数据,需要很多步骤处理,才可以用在ML的module上面,最后还有presentation
result部分。
是不是大部分DS在公司很多工作都是和clean data和present final result有关了?

at

【在 b*****o 的大作中提到】
: Machine Learning:
: The High-Interest Credit Card of Technical Debt
: http://static.googleusercontent.com/media/research.google.com/e
: 其中我很感慨的就是下面这段:自己重写一个专用的工具比开源的通用工具靠谱。
: It may be surprising to the academic community to know that only a tiny
: fraction of the code in
: many machine learning systems is actually doing “machine learning”. When
: we recognize that a
: mature system might end up being (at most) 5% machine learning code and (at
: least) 95% glue code,

l********e
发帖数: 220
7
为啥不能用python?看不出来你这个推论怎么来的?不用python 用啥?C++, R?有什么
区别?R的ML API也很多把?

at

【在 b*****o 的大作中提到】
: Machine Learning:
: The High-Interest Credit Card of Technical Debt
: http://static.googleusercontent.com/media/research.google.com/e
: 其中我很感慨的就是下面这段:自己重写一个专用的工具比开源的通用工具靠谱。
: It may be surprising to the academic community to know that only a tiny
: fraction of the code in
: many machine learning systems is actually doing “machine learning”. When
: we recognize that a
: mature system might end up being (at most) 5% machine learning code and (at
: least) 95% glue code,

1 (共1页)
进入DataSciences版参与讨论
相关主题
非csee的engineer做machine learning engineer,办身份会有问题吗
machine learning startup内推 (转)
学习 使用 spark需要会machine learning理论么
Machine Learning硕士求推荐
请教各位大牛
又要开始找工作了, 想找 machine learning方面的工作机会
Azure Machine Learning视频
找DS工作, 一定要懂(甚至精通)machine learning或者statistics
[请教] machine learning 面试
求审稿机会-Machine Learning, Bioinformatics方向
相关话题的讨论汇总
话题: learning话题: machine话题: debt话题: card话题: technical