ai建模是统计，快速运算是cs - Military版

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

Military版 - ai建模是统计，快速运算是cs

相关主题
● 索罗斯给中国和中国领导人下诊断书了	● 如果你非科班出身想在老外面前装逼
● 看大家还在讨论阿尔法狗，我准备再次科普下	● 自主知识产权石化DCS系统取得重大突破
● 关于反转基因的一些想法	● 谈谈“中央集权制大一统政体”的“科学性”和“合理性”
● 狗的深度学习是真的有技术突破，还是又一个骗钱的玩意？	● 美国家安全局被曝曾监听中国包括多位前领导人
● 65％的美国人认为美国正在衰败（decline）	● Y染色体破解白银案
● U.S. vs. China: A cold trade war	● 过去三十年，物理界炒的最热的应该是
● 这Yahoo的美女CEO如何？成硅谷女神了	● 挑选的7人已进入道家内丹修炼第四阶段
● 马云夫妇对生物医学的资助	● 只有码公才能生存

相关话题的讨论汇总
话题: search话题: go话题: game话题: policy话题: value

进入Military版参与讨论

(共1页)

w********2
发帖数: 632

加一起ai

d*****u
发帖数: 17243

现在deep learning没啥直接的建模。
对数据的统计分布等等不关心，就是直接上神经网络，反正理论上什么函数都能拟合。

w********2
发帖数: 632

李飞飞没那个图库就啥也干不了，所以还是supervised
有归类，算相似分数

w********2
发帖数: 632

有的，在底层。建几百上千，甚至更多，然后选最符合training set的。

【在 d*****u 的大作中提到】

: 现在deep learning没啥直接的建模。
: 对数据的统计分布等等不关心，就是直接上神经网络，反正理论上什么函数都能拟合。

d*****u
发帖数: 17243

除了网络结构没有什么模型。deep learning的模型基本都是相似的，最多试少数几个
网络架构
然后就是调整hyperparameter
其他都靠training

【在 w********2 的大作中提到】

: 有的，在底层。建几百上千，甚至更多，然后选最符合training set的。

w********2
发帖数: 632

是这样的，就是调参数，但参数优化靠training set，容易overfitting。
ai现在火的关键是有了大的库，读写都很快，实用，其实相似技术1990年代就有了。

【在 d*****u 的大作中提到】

: 除了网络结构没有什么模型。deep learning的模型基本都是相似的，最多试少数几个
: 网络架构
: 然后就是调整hyperparameter
: 其他都靠training

w********2
发帖数: 632

举个例子，nmr样本谱图和标准库对比得分排序，一个道理。

【在 w********2 的大作中提到】

: 是这样的，就是调参数，但参数优化靠training set，容易overfitting。
: ai现在火的关键是有了大的库，读写都很快，实用，其实相似技术1990年代就有了。

w********2
发帖数: 632

Go program, based on a combina- tion of deep neural networks and tree search
http://web.iitd.ac.in/~sumeet/Silver16.pdf

w********2
发帖数: 632

We have introduced a new search algorithm that successfully combines neural
network evalu- ations with Monte Carlo rollouts. Our program AlphaGo
integrates these components together, at scale, in a high-performance tree
search engine.
a new search algorithm: this is the key

w********2
发帖数: 632

During the match against Fan Hui, AlphaGo evaluated thousands of times fewer
positions than Deep Blue did in its chess match against Kasparov4;
compensating by selecting those positions more intelli- gently, using the
policy network, and evaluating them more precisely, using the value network
—an approach that is perhaps closer to how humans play.

相关主题
● U.S. vs. China: A cold trade war	● 如果你非科班出身想在老外面前装逼
● 这Yahoo的美女CEO如何？成硅谷女神了	● 自主知识产权石化DCS系统取得重大突破
● 马云夫妇对生物医学的资助	● 谈谈“中央集权制大一统政体”的“科学性”和“合理性”
进入Military版参与讨论

w********2
发帖数: 632

Go is exemplary in many ways of the difficulties faced by artificial
intelligence33,34: a challenging decision-making task, an intractable search
space, and an optimal solution so complex it appears infeasible to directly
approximate using a policy or value function.

w********2
发帖数: 632

By combining tree search with policy and value networks, AlphaGo has finally
reached a professional level in Go

w********2
发帖数: 632

an intractable search space: cs
a challenging decision-making task: tree, statistics
a policy or value function: statistics

w********2
发帖数: 632

All games of perfect information have an optimal value function, v*(s),
which determines the outcome of the game, from every board position or state
s, under perfect play by all players. These games may be solved by
recursively computing the optimal value function in a search tree containing
approximately bd possible sequences of moves, where b is the game’s
breadth (number of legal moves per position) and d is its depth (game length
).
this is brutal force playing the game rule, basically the game is broken.

w********2
发帖数: 632

i think ml is more mathematical optimization than statistics or cs.

w********2
发帖数: 632

In large games, such as chess (b ≈ 35, d ≈ 80)1 and especially Go (b ≈
250, d ≈ 150)1, exhaustive search is infeasible2,3, but the effective
search space can be reduced by two general principles. First, the depth of
the search may be reduced by position evaluation: truncating the search tree
at state s and replacing the subtree below s by an approximate value
function v(s) ≈ v*(s) that predicts the outcome from state s. This approach
has led to superhuman performance in chess4, checkers5 and othello6, but it
was believed to be intractable in Go due to the complexity of the game7.
Second, the breadth of the search may be reduced by sampling actions from a
policy p(a|s) that is a prob- ability distribution over possible moves a in
position s. For example, Monte Carlo rollouts8 search to maximum depth
without branching at all, by sampling long sequences of actions for both
players from a policy p. Averaging over such rollouts can provide an
effective position evaluation, achieving superhuman performance in
backgammon8 and Scrabble9, and weak amateur level play in Go10.

w********2
发帖数: 632

In other words, in the future if computer speed increases 1 million times,
alpha go can go straight brutal force exhaustive search without any
simplification, and will
be more accurate.

w********2
发帖数: 632

so the only way to beat the alpha go is to break the game rule by human
player, so that the alpha go has to recalculate each time.

w********2
发帖数: 632

don't be a perfect player when play with alpha go. that is the key.

state
containing
length

【在 w********2 的大作中提到】

: All games of perfect information have an optimal value function, v*(s),
: which determines the outcome of the game, from every board position or state
: s, under perfect play by all players. These games may be solved by
: recursively computing the optimal value function in a search tree containing
: approximately bd possible sequences of moves, where b is the game’s
: breadth (number of legal moves per position) and d is its depth (game length
: ).
: this is brutal force playing the game rule, basically the game is broken.

w********2
发帖数: 632

In other words, try to beat the algorithm by playing oddly, than try to win
the game. And you will deadlock the machine (very long time calculating each
step), then win.

【在 w********2 的大作中提到】

: don't be a perfect player when play with alpha go. that is the key.
:
: state
: containing
: length

w********2
发帖数: 632

More precisely, break its assumptions.

win
each

【在 w********2 的大作中提到】

: In other words, try to beat the algorithm by playing oddly, than try to win
: the game. And you will deadlock the machine (very long time calculating each
: step), then win.

(共1页)

进入Military版参与讨论

相关主题
● 只有码公才能生存	● 65％的美国人认为美国正在衰败（decline）
● 罗胖精选\|梅西的足球系统	● U.S. vs. China: A cold trade war
● deep learning还没走多远	● 这Yahoo的美女CEO如何？成硅谷女神了
● 华为搞间谍，迄今为止没有任何证据	● 马云夫妇对生物医学的资助
● 索罗斯给中国和中国领导人下诊断书了	● 如果你非科班出身想在老外面前装逼
● 看大家还在讨论阿尔法狗，我准备再次科普下	● 自主知识产权石化DCS系统取得重大突破
● 关于反转基因的一些想法	● 谈谈“中央集权制大一统政体”的“科学性”和“合理性”
● 狗的深度学习是真的有技术突破，还是又一个骗钱的玩意？	● 美国家安全局被曝曾监听中国包括多位前领导人

相关话题的讨论汇总
话题: search话题: go话题: game话题: policy话题: value

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

boards

未名新帖统计// 7月16日

历史上的今天