google/magika

Split reference test file for features extraction in multiple smaller files

reyammer opened this issue · 0 comments

https://github.com/google/magika/blob/main/tests_data/features_extraction/reference.json.gz is too big and is creating issues in some scenarios. We need to either split it, or extract a subset of unittests that folks can use when porting magika features extraction to other languages.