wlin12/wang2vec

Training on large file

Opened this issue · 0 comments

sa-j commented

Hi there,

Is it possible to use wang2vec for training on a very large file (>70GB)? In gensim, this is made possible by the iterator LineSentences, which

for larger corpora, considers an iterable that streams the sentences directly from disk/network,

instead of loading everything into the RAM. Is there any similar option available for wang2vec?

Thanks!

@wlin12 @ftyers @sauravm8