/Lazy-Deduplication

Data deduplication, data reduction, capacity optimized storage - is a specialized data compression technique for eliminating redundant data in the storage file system. It is often designed to make network backups to disk faster and more economical. It removes redundant data segments to compress data into a highly compact form and makes it economical to store backups on disk drive instead of tape drive. The technique identifies unique chunks of data, or byte patterns and stores them during the time of analysis. The chunks of new data are hashed and compared with the existing hashes of the other chunks to determine if there is a match. This project is aimed at creating a data deduplication file system which performs post-processing (lazy) analysis on the data stored into the file system

Primary LanguageC

Stargazers