cl011

cl011's Stars

MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Language:MATLAB3.7k497
zhangchuheng123/Reinforcement-Implementation
Implementation of benchmark RL algorithms
Language:Python45981
ZigeW/data_management_LLM
Collection of training data management explorations for large language models
27428
charent/ChatLM-mini-Chinese
中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。
Language:Python1.2k145
AI-Study-Han/Zero-Chatgpt
从0开始，将chatgpt的技术路线跑一遍。
Language:Python13521
DLLXW/baby-llama2-chinese
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
Language:Python2.5k306
Tongjilibo/build_MiniLLM_from_scratch
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)
Language:Python31441
lonePatient/awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Language:Python4.8k472
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！
Language:Python2.7k168
cbamls/AI_Tutorial
精选机器学习，NLP，图像识别，深度学习等人工智能领域学习资料，搜索，推荐，广告系统架构及算法技术资料整理。算法大牛笔记汇总
3.1k479
tongyx361/Awesome-LLM-Research
Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers get the gist as quickly as possible.
34
yule-BUAA/MergeLM
Codebase for Merging Language Models (ICML 2024)
Language:Python75844
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Language:Jupyter Notebook7k445
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python10.3k812
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.3k2.3k
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python168k44.3k
hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
Language:Python3.7k471
yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)
Language:Jupyter Notebook2.9k359
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
15.6k1.4k
meta-llama/llama
Inference code for Llama models
Language:Python56.1k9.5k
Tencent/NeuralNLP-NeuralClassifier
An Open-source Neural Hierarchical Multi-label Text Classification Toolkit
Language:Python1.8k404
fuzhengwei/small-spring
🌱《 Spring 手撸专栏》，本专栏以 Spring 源码学习为目的，通过手写简化版 Spring 框架，了解 Spring 核心原理。在手写的过程中会简化 Spring 源码，摘取整体框架中的核心逻辑，简化代码实现过程，保留核心功能，例如：IOC、AOP、Bean生命周期、上下文、作用域、资源处理等内容实现。
Language:Java4.7k1.4k
Skylark0924/Machine-Learning-is-ALL-You-Need
🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Language:Python38892
liu-nlper/dialogue-utterance-rewriter
ACL 2019论文复现：Improving Multi-turn Dialogue Modelling with Utterance ReWriter
Language:Python12923
probml/pyprobml
Python code for "Probabilistic Machine learning" book by Kevin Murphy
Language:Jupyter Notebook6.5k1.5k
PaddlePaddle/Research
novel deep learning research works with PaddlePaddle
Language:Python1.7k790
alibaba/AliceMind
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
Language:Python2k291
AdeDZY/DeepCT
DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.
Language:Python31246
budzianowski/multiwoz
Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)
Language:Python856199
PaddlePaddle/PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
Language:Python12.1k2.9k