TheDataLeek/Python-LSA

Dynamic Document Loading

Closed this issue · 1 comments

Currently the code only really works with the jeopardy.csv dataset that I've tweaked to be in a certain format. Ideally we make it so that it can load arbitrary document sets without much specification/tweaking.

The current expected format for a document set is one document per line. This format is open for discussion.

Basically done.