To install:
- Clone repo
- Run the following (preferably in a virualenv):
python -m pip install -e path/to/repo/
Install requirements.txt
Download desired sent2vec weights here
Collect raw data with python build_table.py </path/to/datasets/dir>
- Builds a csv of column metadata and column types
Embed raw data with python3 embed.py </path/to/sent2vec_weights.bin>
Run models with python3 evaluate_models.py [<number of cores to use>]
- Set
use_small_data
manually in the code to switch between small and large data - Predictions saved to
results[_small]/predictions_<model_name>.csv
Score predictions with python3 score_results.py results[_small]/<filename>.csv