Pinned Repositories
nanotron
Minimalistic large language model 3D-parallelism training
picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
lighteval
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
linux-shell-in-c
LLaMA
Megatron-LM
Ongoing research training transformer models at scale
nanotron
Minimalistic large language model 3D-parallelism training
starcraft-ai
ai for starcraft
zzhhjjj's Repositories
zzhhjjj/nanotron
Minimalistic large language model 3D-parallelism training
zzhhjjj/starcraft-ai
ai for starcraft
zzhhjjj/lighteval
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
zzhhjjj/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
zzhhjjj/linux-shell-in-c
zzhhjjj/LLaMA
zzhhjjj/Megatron-LM
Ongoing research training transformer models at scale
zzhhjjj/RL-project
This is our RL project
zzhhjjj/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
zzhhjjj/Travel-advisor
zzhhjjj/Yelp-camp
zzhhjjj/zzhhjjj