tangjicheng1/ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
C++Apache-2.0
No issues in this repository yet.
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
C++Apache-2.0
No issues in this repository yet.