HtonS opened this issue 5 years ago · 0 comments
Change host norm implementation to use thrust and split the vector into chunks small enough to fit on the gpu