SAS: find the first 100 mostly frequently used words in a large database - Statistics版 - 未名存档

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

Statistics版 - SAS: find the first 100 mostly frequently used words in a large database

相关主题
● 急问：医院里都用什么database啊？面试求助！	● 两个简单的SAS问题
● pro freq +table	● SAS EXCEL的问题
● SAS/S-plus/R 作图问题请教	● ASK FOR ONE SAS QUESTION
● ods rtf startpage=never does not work anymore in SAS9.2?	● R重要还是SAS重要？
● a SAS question	● non inferiority test for the difference of two proportions in "proc freq" ?
● 新人求问SAS简单问题~~	● SAS code - help needed. 8 个包子酬谢
● 求教proc sql 问题	● 求助：data manipulation的一个问题
● SAS能不能批处理变量？	● 中级SAS问题

相关话题的讨论汇总
话题: sas话题: mostly话题: words话题: bio话题: database

进入Statistics版参与讨论

1

(共1页)

t**********r 发帖数: 182	1 Here I have a large database. One key variable is BIO (CEO/CFO's bio description). I want to find the words that are mostly frequently appearned; say, the first 100 words that are mostly used. How can I do this using SAS? many thanks.
A*******s 发帖数: 3942	2 SAS EM有个text miner。虽然我没用过，不过顾名思义+看图识字，应该是它了吧。 appearned; SAS? 【在 t**********r 的大作中提到】 : Here I have a large database. One key variable is BIO (CEO/CFO's bio : description). I want to find the words that are mostly frequently appearned; : say, the first 100 words that are mostly used. How can I do this using SAS? : many thanks.
d*******1 发帖数: 854	3 data bioword; set database; i=1; do while (scan(bio,i,' ') ne '') ; word=scan(bio,i,' '); i=i+1; output; end; keep word; run; proc freq data=bioword; table word/out=freq; run; proc sort data=freq; by descending count; run; appearned; SAS? 【在 t**********r 的大作中提到】 : Here I have a large database. One key variable is BIO (CEO/CFO's bio : description). I want to find the words that are mostly frequently appearned; : say, the first 100 words that are mostly used. How can I do this using SAS? : many thanks.
A*******s 发帖数: 3942	4 鸡蛋里挑挑骨头--还得考虑uppercase/lowercase, single/plural,tenses for verbs... 【在 d*******1 的大作中提到】 : data bioword; : set database; : i=1; : do while (scan(bio,i,' ') ne '') ; : word=scan(bio,i,' '); : i=i+1; : output; : end; : keep word; : run;

1

(共1页)

进入Statistics版参与讨论

相关主题
● 中级SAS问题	● a SAS question
● 新人拜山，请教做SAS programmer主要用哪些procedure？	● 新人求问SAS简单问题~~
● SAS 问题请教	● 求教proc sql 问题
● SAS problem ask for help!	● SAS能不能批处理变量？
● 急问：医院里都用什么database啊？面试求助！	● 两个简单的SAS问题
● pro freq +table	● SAS EXCEL的问题
● SAS/S-plus/R 作图问题请教	● ASK FOR ONE SAS QUESTION
● ods rtf startpage=never does not work anymore in SAS9.2?	● R重要还是SAS重要？

相关话题的讨论汇总
话题: sas话题: mostly话题: words话题: bio话题: database

未名新帖统计// 7月16日

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

* 这里只显示发帖超过25的版面，努力灌水吧:-)