End-to-end FP8 training

Question

xrsrke opened this issue a year ago · 1 comments

Notes

Write an FP8Tensor that inherits from torch.Tensor (just support type hints).
Write an FP8Linear that binds to TransformerEngine's FP8 kernel in the forward pass

TODO

Answer 1 · 2023-11-30T10:08:58.000Z

@xrsrke On it