h**o 发帖数: 548 | 1 how to define hash function? if it is to hash content of document, will the
hash function be sth. like:
hash=A[length of content]...A[3]+131(A[2]+131(A[1] + 131 A[0])) mod 10G ??
如果document content很长, hash 岂不很复杂?
假如不考虑split, what is mod of hash? 为什么网上说是 mod/10G?
然后怎么办, 把正好在一个hash index的link list里的不同documents 内容 逐字比
吗? |
|