Online learning
sademakn opened this issue · 5 comments
Is there any way to update the model on the fly with new data?
Hi @sademakn !
The embeddings are stable, which means, you can re-run on (old + new) data, and the results will be aligned with the old embeddings, for any downstream purposes.
Currently on-the-fly updates are not implemented, but what would be required is as follows:
- incremental creation of the sparse transition matrix,
- periodic recalculation of the embeddings.
Thanks for your well-organized answer,
If someday the on-the-fly updates will be available, that would help a lot with large-scale and non-stop data streams. In my case, I use a cronjob to train my model on whole data(old + new) and each time it takes hours to build a new model and during the training also I'm receiving a bunch of new data!
Is there any hope to achieve on-the-fly updates anytime soon?
For now there is no plan but I will come back to you next week because I have to check the pipeline. We'll see what we can do.