Debate notes
Hellisotherpeople opened this issue · 5 comments
I have this dataset which consists of a large amount of documents compiled by competitive debaters as evidence, along-side extractive and abstractive summaries. Can the documents portion be included in this dataset? I have ~180K documents.
Hi! Approximately how large is your dataset (in GB, not documents)? And what language(s) is it in?
Slightly less than 1GB, English
That sounds great! Thanks for bringing it to our attention. If you’d like to contribute it, feel free to submit a PR. Otherwise I’ll put it on the list of things we need to get around to doing.
Is this being included in a future release of the pile? I haven't had a lot of time to spend on this recently but I can try to get it in very soon if there is some kind of time-limit...
Is this being included in a future release of the pile? I haven't had a lot of time to spend on this recently but I can try to get it in very soon if there is some kind of time-limit...
We are not currently working on a Pile V2 or similar. So no.