This repository contains various helper files that can be utilized for data processing tasks.
CSV-to-Corpus.ipynb: Convert a csv dataset into a corpus file after tokenization. The corpus file can be used to train models. The separator used is whitespace but that can be modified as requird.
Author: Rojina Deuja
Last Modified: 1/7/2010