分享Imo电面题 - JobHunting版

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

JobHunting版 - 分享Imo电面题

相关主题
● 这段LIS为啥崩溃？	● 发一个fb面经
● java没有指针真麻烦	● leetcode一题没看明白
● 发个Twitter的面试题	● 求解这个动态规划题
● 刚刚FB电面试完	● 大家帮忙解释一个 LeetCode DP (distinct subsequences)
● 寻找下一个回文数	● Question on leetcode Distinct Subsequences
● Maximum Sum of Increasing Sequence	● GOOG intern interview 题目
● 面试题count # of increasing subsequences of String求解	● google题
● Interleave Strings那个题目有O(n)时间　O(1)空间算法么？	● 最长递增子array的算法

相关话题的讨论汇总
话题: dp话题: string话题: si话题: int话题: boolean

进入JobHunting版参与讨论

(共1页)

r****m
发帖数: 70

一面
You are given two words A and B of the same length from a dictionary D. You
can only access this dictionary through a function boolean isInDictionary(
string word). We are going to make a word ladder. We start at A, we end with
B, and change one letter at every step.
All words are over the alphabet [a-z]. |A| <= 10 characters. |D| <= 10 000
000 words.
We are looking for a shortest word ladder, if any exists. If many exist,
return any one of them.
A=dog, B=let
D={dog, let, log, leg, puzzle, bicycle}
dog
log
leg
let
二面
1. given a cactus graph, determine the number of different spanning trees of
this graph.
2. Given a very large string T, |T| = 10 000 000 chars
a stream of small strings Si
check if Si is a subsequence of T ? return true/false
number of Si ~ 100 000 000 (strings)
|Si| ~ 100 chars
'a' to 'z'
T = abcdefg
S1 = abc yes
S2 = ag yes
S3 = ga no
S4 = aa no

j*****y
发帖数: 1071

cactus graph那道题是要写 code 吗？

You
with

【在 r****m 的大作中提到】

: 一面
: You are given two words A and B of the same length from a dictionary D. You
: can only access this dictionary through a function boolean isInDictionary(
: string word). We are going to make a word ladder. We start at A, we end with
: B, and change one letter at every step.
: All words are over the alphabet [a-z]. |A| <= 10 characters. |D| <= 10 000
: 000 words.
: We are looking for a shortest word ladder, if any exists. If many exist,
: return any one of them.
: A=dog, B=let

r****m
发帖数: 70

所有题都要

j********g
发帖数: 80

前一阵一面直接就问了我五道题，然后就跪了～～

p*****2
发帖数: 21240

第一题bfs，第三题trie
第二题忘记spaning tree的算法了。
这是什么公司？

l**b
发帖数: 457

二爷不用DP，奇怪啊！第3题用Trie的话memory吃不消吧？用DP是不是好点？
贴个简单的代码，helper是recursive的，helper2是DP的，2爷帮忙看看，是不是又出
错了：
public boolean isSubSequence(String s, String t) {
assert(s != null && t != null);
if (t.length() == 0) return true;
//return helper(s, t, 0, 0);
return helper2(s, t);
}

private boolean helper(String s, String t, int si, int ti) {
if (ti == t.length()) return true;
if (si >= s.length()) return false;

if (s.charAt(si) == t.charAt(ti)) {
return helper(s, t, si + 1, ti + 1);
} else {
return helper(s, t, si + 1, ti);
}
}

private boolean helper2(String s, String t) {
boolean[][] dp = new boolean[t.length() + 1][s.length() + 1];
dp[0][0] = true;
for (int i = 0; i <= s.length(); i++) {
dp[0][i] = true;
}

for (int i = 1; i <= t.length(); i++) {
for (int j = 1; j <= s.length(); j++) {
if (t.charAt(i - 1) == s.charAt(j - 1)) {
dp[i][j] = dp[i - 1][j - 1];
} else {
dp[i][j] = dp[i][j - 1];
}
}
}

return dp[t.length()][s.length()];
}

【在 p*****2 的大作中提到】

: 第一题bfs，第三题trie
: 第二题忘记spaning tree的算法了。
: 这是什么公司？

l**b
发帖数: 457

然后想了一下，DP的空间好像可以优化到O(|Si|):
private boolean helper3(String s, String t) {
boolean[] dp = new boolean[s.length() + 1];
for (int i = 0; i <= s.length(); i++) {
dp[i] = true;
}
for (int i = 1; i <= t.length(); i++) {
boolean[] newDp = new boolean[s.length() + 1];
for (int j = 1; j <= s.length(); j++) {
if (t.charAt(i - 1) == s.charAt(j - 1)) {
newDp[j] = dp[j - 1];
} else {
newDp[j] = newDp[j - 1];
}
}
dp = newDp;
}

return dp[s.length()];
}

p*****2
发帖数: 21240

刚看了一下题，昨天看错题目了。我再看看。

【在 l**b 的大作中提到】

: 二爷不用DP，奇怪啊！第3题用Trie的话memory吃不消吧？用DP是不是好点？
: 贴个简单的代码，helper是recursive的，helper2是DP的，2爷帮忙看看，是不是又出
: 错了：
: public boolean isSubSequence(String s, String t) {
: assert(s != null && t != null);
: if (t.length() == 0) return true;
: //return helper(s, t, 0, 0);
: return helper2(s, t);
: }
:

j********g
发帖数: 80

第二题是不是把s按顺序扫一遍就行了，我没理解错吧
bool find_substr(string si, string t){
int s_len=si.size();
int t_len=t.size();
int j=0;
for(int i=0; i while(si[i]!=t[j] && j j++;
if(j==t_len)
return false;
j++;
}
return true;
}

l**b
发帖数: 457

好像是啊，我想太复杂了。

【在 j********g 的大作中提到】

: 第二题是不是把s按顺序扫一遍就行了，我没理解错吧
: bool find_substr(string si, string t){
: int s_len=si.size();
: int t_len=t.size();
: int j=0;
: for(int i=0; i: while(si[i]!=t[j] && j: j++;
: if(j==t_len)
: return false;

相关主题
● Maximum Sum of Increasing Sequence	● 发一个fb面经
● 面试题count # of increasing subsequences of String求解	● leetcode一题没看明白
● Interleave Strings那个题目有O(n)时间　O(1)空间算法么？	● 求解这个动态规划题
进入JobHunting版参与讨论

H****s
发帖数: 247

原题要看是否subsequence 不是 substr

【在 j********g 的大作中提到】

a**********e
发帖数: 22

这样也行吧？
static boolean isSubString(char[] s, char[] t){
int s_len = s.length;
int t_len = t.length;
int j = 0;
for(int i = 0; i < s_len; i++){
if(s[i]==t[j]){
if(j == t_len -1){
return true;
}
j++;
}

}
return false;
}

【在 j********g 的大作中提到】

H****s
发帖数: 247

不过除了函数名，code 是对的！

【在 H****s 的大作中提到】

: 原题要看是否subsequence 不是 substr

H****s
发帖数: 247

这个code有问题吧？ if(s[i]!=t[j]) j不变？ s 跟 t 搞反了？

【在 a**********e 的大作中提到】

: 这样也行吧？
: static boolean isSubString(char[] s, char[] t){
: int s_len = s.length;
: int t_len = t.length;
: int j = 0;
: for(int i = 0; i < s_len; i++){
: if(s[i]==t[j]){
: if(j == t_len -1){
: return true;
: }

t****a
发帖数: 1212

第三题很有趣，偶来给个clojure的解
基本思路是
1. 对T做预处理，构建graph，同时，记录每个字符a-z所出现的位置
2. 由于S长度<100，数量却很大(100M)，意味着有很多重复，使用memoize function会
大大减少计算量
下面的程序在1/1000大小的S数据集上运行需要1.7秒。观察了一下时间和数据集大小趋
势，在完整数据集上几分钟内也可以产生结果
(def alphabet (map char (range (int \a) (inc (int \z)))))
(def T (vec (take 10000000 (repeatedly #(rand-nth alphabet)))))
(def S (take 100000 (repeatedly (fn [] (take (inc (int (* (rand) 100))) (
repeatedly #(rand-nth alphabet)))))))
(def T-ix (group-by #(T %) (range (count T))))
(def T-map (let [pairs (distinct (partition 2 (interleave (pop T) (next T))))
map0 (group-by #(first %) pairs)]
(zipmap (keys map0) (map last (vals map0)))))
(def good-align?
(memoize
(fn [[x & xs] pos]
(let [xp-set (T-ix x)
xp (first (drop-while #(<= % pos) xp-set))]
(if (nil? xp)
false
(if (empty? xs)
true
(let [y (first xs)]
(if (and (contains? T-map x) (contains? (T-map x) y))
false
(good-align? xs xp)))))))))
(time (count (pmap #(good-align? % -1) S)))
; "Elapsed time: 1727.207492 msecs"
另外请问楼主，Imo是什么公司啊？

l**h
发帖数: 893

第三题怎么用trie呢?

【在 p*****2 的大作中提到】

: 第一题bfs，第三题trie
: 第二题忘记spaning tree的算法了。
: 这是什么公司？

p*****2
发帖数: 21240

不好意思，我当时以为是substring

【在 l**h 的大作中提到】

: 第三题怎么用trie呢?

l**h
发帖数: 893

假设是substring的话，建立一个suffix tree?
感觉题目特别强调原始String很大，估计建立suffix tree memory消耗很大吧

【在 p*****2 的大作中提到】

:
: 不好意思，我当时以为是substring

t****a
发帖数: 1212

不懂suffix tree的用法，不过根据
http://en.wikipedia.org/wiki/Suffix_tree
alignment中的gap是不能乱来的，影响计算速度

【在 l**h 的大作中提到】

: 假设是substring的话，建立一个suffix tree?
: 感觉题目特别强调原始String很大，估计建立suffix tree memory消耗很大吧

r****m
发帖数: 70

相关主题
● 大家帮忙解释一个 LeetCode DP (distinct subsequences)	● google题
● Question on leetcode Distinct Subsequences	● 最长递增子array的算法
● GOOG intern interview 题目	● career cup book v4 9.7 题
进入JobHunting版参与讨论

j*****y
发帖数: 1071

cactus graph那道题是要写 code 吗？

You
with

【在 r****m 的大作中提到】

r****m
发帖数: 70

所有题都要

j********g
发帖数: 80

前一阵一面直接就问了我五道题，然后就跪了～～

p*****2
发帖数: 21240

第一题bfs，第三题trie
第二题忘记spaning tree的算法了。
这是什么公司？

l**b
发帖数: 457

: 第一题bfs，第三题trie
: 第二题忘记spaning tree的算法了。
: 这是什么公司？

l**b
发帖数: 457

p*****2
发帖数: 21240

刚看了一下题，昨天看错题目了。我再看看。

【在 l**b 的大作中提到】

j********g
发帖数: 80

l**b
发帖数: 457

好像是啊，我想太复杂了。

【在 j********g 的大作中提到】

H****s
发帖数: 247

原题要看是否subsequence 不是 substr

【在 j********g 的大作中提到】

相关主题
● 问个算法题5	● java没有指针真麻烦
● 【一个BB公司问的字母排序的问题】	● 发个Twitter的面试题
● 这段LIS为啥崩溃？	● 刚刚FB电面试完
进入JobHunting版参与讨论

a**********e
发帖数: 22

H****s
发帖数: 247

不过除了函数名，code 是对的！

【在 H****s 的大作中提到】

: 原题要看是否subsequence 不是 substr

H****s
发帖数: 247

这个code有问题吧？ if(s[i]!=t[j]) j不变？ s 跟 t 搞反了？

【在 a**********e 的大作中提到】

t****a
发帖数: 1212

l**h
发帖数: 893

第三题怎么用trie呢?

【在 p*****2 的大作中提到】

: 第一题bfs，第三题trie
: 第二题忘记spaning tree的算法了。
: 这是什么公司？

p*****2
发帖数: 21240

不好意思，我当时以为是substring

【在 l**h 的大作中提到】

: 第三题怎么用trie呢?

l**h
发帖数: 893

假设是substring的话，建立一个suffix tree?
感觉题目特别强调原始String很大，估计建立suffix tree memory消耗很大吧

【在 p*****2 的大作中提到】

:
: 不好意思，我当时以为是substring

t****a
发帖数: 1212

不懂suffix tree的用法，不过根据
http://en.wikipedia.org/wiki/Suffix_tree
alignment中的gap是不能乱来的，影响计算速度

【在 l**h 的大作中提到】

: 假设是substring的话，建立一个suffix tree?
: 感觉题目特别强调原始String很大，估计建立suffix tree memory消耗很大吧

h******2
发帖数: 13

有哪位大神能详细讲一下二面中的两道题吗？
对于第二道题，我能想到的是:
可以用一个map > m_Indexes;
对于每个char，value中记录了index列表。然后对于T中的每个char，先找第一个char
的leftest位置(smallest index), then find the second char, and it’s index
must be greater than the first’s char ‘s smallest index(that’s upper_
bound), and we do it iteratively until we all found it or not.
第一道 cactus的不太清楚怎么做啊？
二面
1. given a cactus graph, determine the number of different spanning trees of
this graph.
2. Given a very large string T, |T| = 10 000 000 chars
a stream of small strings Si
check if Si is a subsequence of T ? return true/false
number of Si ~ 100 000 000 (strings)
|Si| ~ 100 chars
'a' to 'z'

a*******3
发帖数: 27

第三题给个其他解法，说了查询字符串都较小，而且又是求subsequence，有个算法可
以将LCS转化成LIS来计算，转化方法在本题里面，是将短字符串中的每个字符，在T中
出现的位置，由大到小排列，然后求这个序列的LIS。
如
T=abacbdefb
a的位置有(0,2)，b的有(1,4,8),c(3),d(5),e(6),f(7)
S1 = aab =>(2,0,2,0,8,4,1),LIS=>(0,2,8)=3，故yes

相关主题
● 刚刚FB电面试完	● 面试题count # of increasing subsequences of String求解
● 寻找下一个回文数	● Interleave Strings那个题目有O(n)时间　O(1)空间算法么？
● Maximum Sum of Increasing Sequence	● 发一个fb面经
进入JobHunting版参与讨论

b*****u
发帖数: 648

free text search 应该是上suffix tree
尽管建tree开销大但是考虑是stream of short strings,还是有必要的，否则每个小字
符串都要遍历一遍大串

m****i
发帖数: 650

lmo什么公司

a*******3
发帖数: 27

T = abcdefg
S1 = abc yes
S2 = ag yes
S3 = ga no
S4 = aa no
看楼主给的example，S2，不是match substring，而是subsequence，所以suffix tree
在这里没有用的

【在 b*****u 的大作中提到】

: free text search 应该是上suffix tree
: 尽管建tree开销大但是考虑是stream of short strings,还是有必要的，否则每个小字
: 符串都要遍历一遍大串

d*******3
发帖数: 58

嗯，这个简单好写！

【在 j********g 的大作中提到】

d*******3
发帖数: 58

大牛，你这个我看不懂啊...

【在 t****a 的大作中提到】

: 第三题很有趣，偶来给个clojure的解
: 基本思路是
: 1. 对T做预处理，构建graph，同时，记录每个字符a-z所出现的位置
: 2. 由于S长度<100，数量却很大(100M)，意味着有很多重复，使用memoize function会
: 大大减少计算量
: 下面的程序在1/1000大小的S数据集上运行需要1.7秒。观察了一下时间和数据集大小趋
: 势，在完整数据集上几分钟内也可以产生结果
: (def alphabet (map char (range (int \a) (inc (int \z)))))
: (def T (vec (take 10000000 (repeatedly #(rand-nth alphabet)))))
: (def S (take 100000 (repeatedly (fn [] (take (inc (int (* (rand) 100))) (

(共1页)

进入JobHunting版参与讨论

相关主题
● 最长递增子array的算法	● 寻找下一个回文数
● career cup book v4 9.7 题	● Maximum Sum of Increasing Sequence
● 问个算法题5	● 面试题count # of increasing subsequences of String求解
● 【一个BB公司问的字母排序的问题】	● Interleave Strings那个题目有O(n)时间　O(1)空间算法么？
● 这段LIS为啥崩溃？	● 发一个fb面经
● java没有指针真麻烦	● leetcode一题没看明白
● 发个Twitter的面试题	● 求解这个动态规划题
● 刚刚FB电面试完	● 大家帮忙解释一个 LeetCode DP (distinct subsequences)

相关话题的讨论汇总
话题: dp话题: string话题: si话题: int话题: boolean

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

boards

未名新帖统计// 7月16日

历史上的今天