A wrapper to make preprocessing generic and easy, the code is developing
example: turn on a jupyter notebook, and type: %run main_flow.py
you can then see example usage of that and get the document-feature matrix X and on-hot vector y
details can be set in config.py
CAUTION: MAke sure you can enough free memory for the load function! TODO: add documentations