/parallel-decoding

Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers