de-duplication
There are 10 repositories under de-duplication topic.
ChenghaoMou/text-dedup
All-in-one text de-duplication
saltudelft/CD4Py
CD4Py: Code De-Duplication for Python
ktock/container-bootfs
Container image converter aiming to minimize image size and speed up boot time dramatically with block-level de-dupliction and lazy-pull technology.
akki744/autosuggest-preprocessor
Preprocesses the query logs which can be used by suggesters like Most Popular Suggester (MPS).
hamedrnik/dedup_signature
A collection of algorithms to generate a signature/fingerprint/hash in order to be used for detecting duplicate/near duplicate documents.
cheefoo/centos
Repository contains java application code to stream records to a Kinesis Stream, consume , de-duplicate and strictly order records and display records on a dashboard
hariohmprasath/events-streamlined-with-dedups
Efficient Event Streamlining and Dynamic De-Duplication Across Message Brokers - A Technology-Agnostic approach
Dreffed/misc
A misc project of python based function to track, survey and manage files from mutiple systems
ananthrn/Duplicrypt
A secure image storage application written in Python
thc2cat/go-dedup
portable file de-duplication tool