facebookresearch/SemDeDup
Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).
PythonNOASSERTION
Stargazers
- antx-code
- aqweteddyNTU / Academia Sinica
- bigheiniu
- ctscsu
- data4sci
- densechen
- developer0hyeMarkAny
- dumpmemory
- evdcush
- FallingStar624THUNDER, GSDS, SNU
- flrngel@Ainbr
- fly51flyPRIS
- gachzhan
- GitHub30Osaka, Japan
- hongjianBaidu
- iViolinSoloUniversity of Warwick
- jvsstertz
- KayneWestlastfrontiertechnologies.com
- licongguanBeijing Jiaotong University
- lihuibngbytedance&baidu
- looputHuazhong University of Science and Technology
- Mrwangkedong
- ncoop57
- nehasriknUniversity of Maryland, College Park
- qmdnlsSeoul, Korea
- scottwey
- seshurajup@dolcera
- slymaneOregon State University
- Sunburst0614Beihang university
- theblackcat102iKala
- tt6746690MIT
- vladserkoffMoscow
- worldveil
- wyshiNew York
- ypwang61University of Washington
- zhchbinChina