/datacleanup

Python Notebook for a workshop at Convercon Ireland 2019. The title is How to Curate and NLP Dataset and is about a process to find errors in a dataset to improve training.

Primary LanguageJupyter Notebook

datacleanup

Python Notebook for a workshop at Convercon Ireland 2019. The title is How to Curate and NLP Dataset and is about a process to find errors in a dataset to improve training.

Please install jupyter Go to your command line and in the directory you download this code into type 'jupyter notebook' The main other libraries you will need are scikit-learn and pandas