Closed this issue 9 months ago · 1 comments
This is needed for inference performance as weight memory transfers are usually the limiting factor
This is now fixed.