/elasticsearch-deduplication-python

A python approach for removal of duplicate documents from elasticsearch

Primary LanguagePython

elasticsearch-deduplication-python

A python scrip to fetch all the documents from elasticsearch, detect duplicate documents in them and delete the duplicate documents.