xuzetao/ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
C++Apache-2.0
Watchers
No one’s watching this repository yet.
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
C++Apache-2.0
No one’s watching this repository yet.