Pinned Repositories
ESimCSE
gsInfoNCE
infocse
VLP
chaochen99.github.io
my blog
TextSmoothing
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
LongBench
LongBench v2 and LongBench (ACL 2024)