BaileyWei

BaileyWei's Stars

yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)
Language:Jupyter Notebook2.9k359
xlang-ai/UnifiedSKG
[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models
Language:Python54958
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Language:Python40.5k5.2k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.3k2.3k
tloen/llama-int8
Quantized inference code for LLaMA models
Language:Python1.1k105
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.4k4k
pyg-team/pytorch_geometric
Graph Neural Network Library for PyTorch
Language:Python21.2k3.6k
shawwn/llama-dl
High-speed download of LLaMA, Facebook's 65B parameter GPT model
Language:Shell4.2k418
THUDM/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Language:Python7.7k607
anthropics/hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
1.6k122
meta-llama/llama
Inference code for Llama models
Language:Python56k9.5k
microsoft/torchscale
Foundation Architecture for (M)LLMs
Language:Python3k202
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
Language:Python22.4k5.5k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35.1k4.1k
codemayq/chinese-chatbot-corpus
中文公开聊天语料库
Language:Python4k789
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python134k26.7k
OpenBMB/BMTrain
Efficient Training (including pre-training and fine-tuning) for Big Models
Language:Python55477
JonasGeiping/cramming
Cramming the training of a (BERT-type) language model into limited compute.
Language:Python1.3k100
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python38.7k4.3k
lucidrains/electra-pytorch
A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch
Language:Python22143
andy-yangz/easy_bert_pretrain
The very easy BERT pretrain process by using tokenizers and transformers repos
Language:Python318
cosmoquester/transformers-bart-pretrain
Script to pre-train hugginface transformers BART with Tensorflow 2
Language:Python346
Tencent/TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Language:Python1k141
BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
Language:Python9.4k692
Spico197/DocEE
🕹️ A toolkit for document-level event extraction, containing some SOTA model implementations.
Language:Python23236
f/awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
Language:HTML112k15.3k
dyweb/awesome-resume-for-chinese
:page_facing_up: 适合中文的简历模板收集（LaTeX，HTML/JS and so on）由 @hoochanlon 维护
4.6k387
arasgungore/arasgungore-CV
My curriculum vitae (CV) written using LaTeX.
Language:TeX692232
zhanglj37/Tutorial-on-PhD-Application
Tutorial on PhD Application
861100
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.5k855