/FasterTransformer

Transformer related optimization, including BERT, GPT

Primary LanguageC++Apache License 2.0Apache-2.0

FasterTransformer

This repository is based on FasterTransformer adapted to GLM-130B, for FasterTransformer, please read the original project.

Quick Start

Read inference-with-fastertransformer.