de-duplication

There are 10 repositories under de-duplication topic.

  • ChenghaoMou/text-dedup

    All-in-one text de-duplication

    Language:Python62347171
  • saltudelft/CD4Py

    CD4Py: Code De-Duplication for Python

    Language:Python22903
  • ktock/container-bootfs

    Container image converter aiming to minimize image size and speed up boot time dramatically with block-level de-dupliction and lazy-pull technology.

    Language:C20632
  • akki744/autosuggest-preprocessor

    Preprocesses the query logs which can be used by suggesters like Most Popular Suggester (MPS).

    Language:Python7101
  • hamedrnik/dedup_signature

    A collection of algorithms to generate a signature/fingerprint/hash in order to be used for detecting duplicate/near duplicate documents.

    Language:Rust7101
  • cheefoo/centos

    Repository contains java application code to stream records to a Kinesis Stream, consume , de-duplicate and strictly order records and display records on a dashboard

    Language:Java1103
  • hariohmprasath/events-streamlined-with-dedups

    Efficient Event Streamlining and Dynamic De-Duplication Across Message Brokers - A Technology-Agnostic approach

    Language:TypeScript130
  • Dreffed/misc

    A misc project of python based function to track, survey and manage files from mutiple systems

    Language:Python0200
  • ananthrn/Duplicrypt

    A secure image storage application written in Python

    Language:Python401
  • thc2cat/go-dedup

    portable file de-duplication tool

    Language:Go10