/space-efficient-bert

Code repository for the research paper "Space Efficient Transformer Neural Network"

Primary LanguageC++GNU Affero General Public License v3.0AGPL-3.0

Space Efficient Transformer Neural Network Implementation

This is the code repository for the research paper "Space Efficient Transformer Neural Network". This code trains and performs inference on a memory-optimized implementation of the BERT transformer model.