Lossless compression is a class of data compression that allows the original data to be perfectly reconstructed from the compressed data with no loss of information.
This repo contains some implementations of popular compression algorithms, focused on correctness and readability.
enwiki9 from the Hutter prize
- LZSS
- Dictionary
- Huffman
- Arithmetic
- Dynamic Markov
- python runtime is included
- compression ratio is source_size / (compressed + decode.py + model_weights)
- integrity is compared with diff
./eval_sh <compression_algo_directory>
- Detailed writeup on each algorithm
- Or provide some good sources