hanliu9574/TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
C++NOASSERTION
Stargazers
No one’s star this repository yet.
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
C++NOASSERTION
No one’s star this repository yet.