Pinned Repositories
BERT-EMD
FastBERT
The score code of FastBERT (ACL2020)
Knowledge_distillation_via_TF2.0
The codes for recent knowledge distillation algorithms and benchmark results via TF2.0 low-level API
libtorch_tokenizer
BERT Tokenizer in C++
matxscript
A high-performance, extensible Python AOT compiler.
model_compression
Implementation of model compression with knowledge distilling method.
models
Models and examples built with TensorFlow
Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
R-Drop
rabbit
Deep learning models trained to correct input errors in short, message-like text
OftenDream's Repositories
OftenDream/BERT-EMD
OftenDream/FastBERT
The score code of FastBERT (ACL2020)
OftenDream/Knowledge_distillation_via_TF2.0
The codes for recent knowledge distillation algorithms and benchmark results via TF2.0 low-level API
OftenDream/libtorch_tokenizer
BERT Tokenizer in C++
OftenDream/matxscript
A high-performance, extensible Python AOT compiler.
OftenDream/model_compression
Implementation of model compression with knowledge distilling method.
OftenDream/models
Models and examples built with TensorFlow
OftenDream/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
OftenDream/R-Drop
OftenDream/rabbit
Deep learning models trained to correct input errors in short, message-like text
OftenDream/SimCSE
EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings
OftenDream/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.