datatrek
some code for data analysis
datatrek.make
a simple make like system: see https://github.com/luoq/avito-duplicate-ads-detection/blob/master/data/corpus_based.py for a complex example
datatrek.sklearn_addon
some transformer and estimator in sklearn style