由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
JobHunting版 - 请教个问题
相关主题
Marvell电话面试题+问题请教Java Jobs
High Frequency Trading Firm 是干嘛的啊?an interview question, find mode in a rolling window along data sequence
谁会做>??????????????????????????????????????如何design google suggest
bloomberg面经讨论一道题
请教offer选择问题(Google vs iBank)关于startup
Is this normalG家面题
诚心请教两个offer的选择 (转载)现在google是不是都要问design题啊?
How to find 10 most frequent strings in 10 billion string list?Find top K most frequent numbers?
相关话题的讨论汇总
话题: file话题: 8gb话题: frequency话题: note话题: words
进入JobHunting版参与讨论
1 (共1页)
s******s
发帖数: 84
1
有一个问题想问问大家,谢谢了。
You have a 200 GB text file and a Linux box with 8GB of RAM and 4 cores.
Write a program/script that outputs a file listing the frequency of all
words in the file (i.e. a TSV file with two columns ). Note
that the set of words in the file may not fit in memory.
f**********t
发帖数: 1001
2
mlock 8GB as buffer;
4 threads: 1st process 0-2G buffer; 2nd process 2-4G buffer; 3rd 4-6G .. and
produce their own unorderded_maps.
mmap 8GB file each time into memory.
merge unordered_maps.

Note

【在 s******s 的大作中提到】
: 有一个问题想问问大家,谢谢了。
: You have a 200 GB text file and a Linux box with 8GB of RAM and 4 cores.
: Write a program/script that outputs a file listing the frequency of all
: words in the file (i.e. a TSV file with two columns ). Note
: that the set of words in the file may not fit in memory.

1 (共1页)
进入JobHunting版参与讨论
相关主题
Find top K most frequent numbers?请教offer选择问题(Google vs iBank)
leetcode 大侠:如何按标题sort问题?Is this normal
facebook hackercup里的一道题诚心请教两个offer的选择 (转载)
菜鸟问个题How to find 10 most frequent strings in 10 billion string list?
Marvell电话面试题+问题请教Java Jobs
High Frequency Trading Firm 是干嘛的啊?an interview question, find mode in a rolling window along data sequence
谁会做>??????????????????????????????????????如何design google suggest
bloomberg面经讨论一道题
相关话题的讨论汇总
话题: file话题: 8gb话题: frequency话题: note话题: words