请教一个关于k-means的问题。 - CS版 - 未名存档

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

CS版 - 请教一个关于k-means的问题。

相关主题
● 问个 gaussian mixture的问题	● [转载] How to minimize this variance?
● probit regression一问 (转载)	● [转载] 求救，optimization问题
● 欢迎大家积极讨论一个ms简单的算法面试题 (转载)	● Mapquest面试题，大伙儿看看
● EM 算法	● 请教一道算法题
● [合集] 问个 EM 的问题	● [合集] 问个人工智能的问题
● [合集] 问个 gaussian distribution distance的问题	● 一个算法求助
● 请问板上有人对gaussian process熟吗	● shortest path algorithm(dijkstra)的变形
● Need Help on Facility Location problem	● 问个算法题，给个简单的思路就好。

相关话题的讨论汇总
话题: means话题: distance话题: use话题: total话题: mixture

进入CS版参与讨论

1

(共1页)

d******e 发帖数: 7844	1 【以下文字转载自 Statistics 讨论区】发信人: drburnie (专门爆料), 信区: Statistics 标题: 请教一个关于k-means的问题。发信站: BBS 未名空间站 (Tue Aug 25 16:37:40 2009, 美东) 我现在在比较Gaussian Mixture Model和K-means。虽然Gaussian Mixture用EM算法只能获得local optimal,但可以随机执行若干次，取 likelihood最大的结果。对于K-means，每次也只能获得局部最优，虽然也可以随机执行若干次，但是无法比较哪次更好。一般来讲，这个应该怎么衡量？
T**********n 发帖数: 480	2 kmeans之后不是要跑个最近邻测准确率么？【在 d******e 的大作中提到】 : 【以下文字转载自 Statistics 讨论区】 : 发信人: drburnie (专门爆料), 信区: Statistics : 标题: 请教一个关于k-means的问题。 : 发信站: BBS 未名空间站 (Tue Aug 25 16:37:40 2009, 美东) : 我现在在比较Gaussian Mixture Model和K-means。 : 虽然Gaussian Mixture用EM算法只能获得local optimal,但可以随机执行若干次，取 : likelihood最大的结果。 : 对于K-means，每次也只能获得局部最优，虽然也可以随机执行若干次，但是无法比较 : 哪次更好。一般来讲，这个应该怎么衡量？
z*****e 发帖数: 231	3 You cannot use the predictive accuracy to measure the convergence of the algorithm. Instead, you should use the criteria you are trying to maximize/ minimize. In k-means, I think you can use the total distance of each data point from the centroids.
d******e 发帖数: 7844	4 K-means can always minimize the total distance to 0. Some other criterion is required to evaluate the convergence. 【在 z*****e 的大作中提到】 : You cannot use the predictive accuracy to measure the convergence of the : algorithm. Instead, you should use the criteria you are trying to maximize/ : minimize. In k-means, I think you can use the total distance of each data : point from the centroids.
l******e 发帖数: 470	5 k-means minimizes the SQUARED distance. 【在 z*****e 的大作中提到】 : You cannot use the predictive accuracy to measure the convergence of the : algorithm. Instead, you should use the criteria you are trying to maximize/ : minimize. In k-means, I think you can use the total distance of each data : point from the centroids.
l******e 发帖数: 470	6 ???? 【在 d******e 的大作中提到】 : K-means can always minimize the total distance to 0. : Some other criterion is required to evaluate the convergence.
d******e 发帖数: 7844	7 我看错了... ... 我把total distance算错了... ... 【在 l******e 的大作中提到】 : : ????
N**D 发帖数: 10322	8 they are equivalaent under assumptions 【在 d******e 的大作中提到】 : 【以下文字转载自 Statistics 讨论区】 : 发信人: drburnie (专门爆料), 信区: Statistics : 标题: 请教一个关于k-means的问题。 : 发信站: BBS 未名空间站 (Tue Aug 25 16:37:40 2009, 美东) : 我现在在比较Gaussian Mixture Model和K-means。 : 虽然Gaussian Mixture用EM算法只能获得local optimal,但可以随机执行若干次，取 : likelihood最大的结果。 : 对于K-means，每次也只能获得局部最优，虽然也可以随机执行若干次，但是无法比较 : 哪次更好。一般来讲，这个应该怎么衡量？
K****n 发帖数: 5970	9 要是本来就没label咋办【在 T**********n 的大作中提到】 : kmeans之后不是要跑个最近邻测准确率么？
K****n 发帖数: 5970	10 。。。嗯，我觉得人家说的total square error挺好的，和maximum likelihood多搭呀我看就用k-mean吧，既不用log又不用矩阵，写起来啥numerical issue都没有【在 d******e 的大作中提到】 : 我看错了... ... : 我把total distance算错了... ...
w***s 发帖数: 424	11 Exactly, when K-means use squared loss. 【在 N**D 的大作中提到】 : they are equivalaent under assumptions

1

(共1页)

进入CS版参与讨论

相关主题
● 问个算法题，给个简单的思路就好。	● [合集] 问个 EM 的问题
● 请教一个找最短的闭合曲线的问题	● [合集] 问个 gaussian distribution distance的问题
● 急问个优化的问题	● 请问板上有人对gaussian process熟吗
● 【包子贴】请教非线性优化问题有哪些算法不错	● Need Help on Facility Location problem
● 问个 gaussian mixture的问题	● [转载] How to minimize this variance?
● probit regression一问 (转载)	● [转载] 求救，optimization问题
● 欢迎大家积极讨论一个ms简单的算法面试题 (转载)	● Mapquest面试题，大伙儿看看
● EM 算法	● 请教一道算法题

相关话题的讨论汇总
话题: means话题: distance话题: use话题: total话题: mixture

未名新帖统计// 7月16日

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

* 这里只显示发帖超过25的版面，努力灌水吧:-)