zzzzzzJg

zzzzzzJg's Stars

Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.4k156
RUC-NLPIR/LLM4IR-Survey
This is the repo for the survey of LLM4IR.
39333
shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，LLaMA等模型应用在纠错场景，开箱即用。
Language:Python5.5k1.1k
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
Language:HTML7.8k753
THUDM/WebGLM
WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)
Language:Python1.6k133
shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
Language:Python3.2k483
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Language:Python15.7k1.9k
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
Language:Python2.2k190
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3.2k201
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
17.2k1.4k
IBM/Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
Language:Python1.1k86
endgameinc/gym-malware
Language:Python607164
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.4k469
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
Language:Python11.9k1.1k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python34.6k4k
THUDM/GLM
GLM (General Language Model)
Language:Python3.2k321
THUDM/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Language:Python7.7k608
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Language:Python40.4k5.2k
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Language:Python19.8k2.5k
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python132k26.2k
anthropics/hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
1.6k119
reorx/awesome-chatgpt-api
Curated list of apps and tools that not only use the new ChatGPT API, but also allow users to configure their own API keys, enabling free and on-demand usage of their own quota.
Language:Python5.9k377
CyC2018/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
175k50.8k
Limmen/awesome-rl-for-cybersecurity
A curated list of resources dedicated to reinforcement learning applied to cyber security.
723107
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
4k719
OI-wiki/OI-wiki
:star2: Wiki of OI / ICPC for everyone. （某大型游戏线上攻略，内含炫酷算术魔法）
Language:TypeScript20.4k3.8k
MingchaoZhu/DeepLearning
Python for《Deep Learning》，该书为《深度学习》(花书) 数学推导、原理剖析与源码级别代码实现
Language:Python6.3k1.3k
TimeBreaker/MARL-papers-with-code
Multi-Agent Reinforcement Learning (MARL) papers with code
29137
TimeBreaker/Multi-Agent-Reinforcement-Learning-papers
Multi-Agent Reinforcement Learning (MARL) papers
19433
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python32.9k5.6k