About batch processing for main architecture RNN

Question

About batch processing for main architecture RNN

Opened this issue a year ago · 2 comments

Hello,
Thank you for open-sourcing this repository!
If the main architecture is RNN, how should I implement batch processing?

Answer 1 · 2023-07-07T15:22:20.000Z

Could you elaborate about what do you mean by batch processing? By default, in hyperlight, a batch of data uses the same hypernetwork input and thus the same weights. To use multiple sets of weights within the same batch, the recommended way to do it is using gradient accumulation.

Answer 2 · 2023-07-09T02:48:19.000Z

Thank you for your reply! Ultimately, I have resolved my question regarding batch processing.