/MachineLearning_NLP

Genre Identification on (a sub-set of) Gutenberg Corpus

Primary LanguageJupyter Notebook

MachineLearning_NLP

Genre Identification on (a sub-set of) Gutenberg Corpus

We have submitted the code as a Jupyter Notebook(.ipynb), as well as a python file (.py). Feature extraction took around 8-14 hours, so the text file 'featureset236' can be used directly for modeling. ATiML_Project_Report.pdf