A collection of image deduplication repositories
listdedup.py is to use with repo https://github.com/knjcode/imgdupes to move image files to folders
repository | language | dependencies | comments |
https://github.com/nmante/image_deduplication | python | hashlib,Simhash | Using hashlib/simhash to find exactly/near duplicates. |
https://github.com/knjcode/imgdupes | python | ImageHash | using Python ImageHash Lib to deduplicate images |
https://github.com/leonsim/simhash | python | Simhash | This is a Python implementation of Simhash.(http://www.wwwconference.org/www2007/papers/paper215.pdf) |
https://github.com/philipbl/duplicate-images | python | phash | Using phash and MongoDB to find duplicates on large sets(not easy to configure). |