zzzzzzJg's Stars
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
RUC-NLPIR/LLM4IR-Survey
This is the repo for the survey of LLM4IR.
shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
THUDM/WebGLM
WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)
shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
allenai/RL4LMs
A modular RL library to fine-tune language models to human preferences
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
IBM/Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
endgameinc/gym-malware
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
OpenMOSS/MOSS
An open-source tool-augmented conversational language model from Fudan University
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
THUDM/GLM
GLM (General Language Model)
THUDM/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
anthropics/hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
reorx/awesome-chatgpt-api
Curated list of apps and tools that not only use the new ChatGPT API, but also allow users to configure their own API keys, enabling free and on-demand usage of their own quota.
CyC2018/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
Limmen/awesome-rl-for-cybersecurity
A curated list of resources dedicated to reinforcement learning applied to cyber security.
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
OI-wiki/OI-wiki
:star2: Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)
MingchaoZhu/DeepLearning
Python for《Deep Learning》,该书为《深度学习》(花书) 数学推导、原理剖析与源码级别代码实现
TimeBreaker/MARL-papers-with-code
Multi-Agent Reinforcement Learning (MARL) papers with code
TimeBreaker/Multi-Agent-Reinforcement-Learning-papers
Multi-Agent Reinforcement Learning (MARL) papers
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.