Pinned Repositories
awesome-data-deduplication
An awesome list of data deduplication use cases, papers, tools, and methods.
chenghaomou.github.io
Personal Blog
deduplicate-text-datasets
A modified version of Google's tool for pure text file
embeddings
zero-vocab or low-vocab embeddings
karafuru
Traditional Chinese colors in your terminal
pytorch-pQRNN
Implementation of pQRNN in PyTorch
simhash
Simhash in C++
text-dedup
All-in-one text de-duplication
touchbar-lyric
Show synced lyric in the touch-bar with BetterTouchTool and NetEase APIs
transformer-pointer-generator
Transformer with pointer generator for machine translation
ChenghaoMou's Repositories
ChenghaoMou/transformer-pointer-generator
Transformer with pointer generator for machine translation
ChenghaoMou/NodeRank
A PageRank-like algorithm with more focus on temporal attribute.