SimText
基于SimHash的适用于相似短文本监测
ps: 部分代码来自于:https://github.com/commoncrawl/commoncrawl.git
commoncrawl/src/main/java/org/commoncrawl/util/shared
基于SimHash的适用于相似短文本监测
ps: 部分代码来自于:https://github.com/commoncrawl/commoncrawl.git
commoncrawl/src/main/java/org/commoncrawl/util/shared