See this paper: https://aclanthology.org/2023.findings-acl.426.pdf And this article: https://backdrifting.net/post/068_text_classification_gzip
abstractqqq/gzip_topic_classifier_polars
A Polars Implementation of the GZIP Topic Classifier
Jupyter Notebook