liya2001

liya2001's Stars

MoonshotAI/MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
Language:Python1.7k101
deepseek-ai/DeepSeek-V3
Language:Python94.3k15.3k
modelscope/DiffSynth-Studio
Enjoy the magic of Diffusion models!
Language:Python8.1k728
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python10.2k1.8k
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python5.9k582
DSXiangLi/DecryptPrompt
总结Prompt&LLM论文，开源数据&模型，AIGC应用
3k293
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python142k28.4k
boyu-ai/Hands-on-RL
https://hrl.boyuai.com/
Language:Jupyter Notebook3.2k621
meta-llama/llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services
Language:Jupyter Notebook16.5k2.4k
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.5k202
OpenLMLab/MOSS-RLHF
Secrets of RLHF in Large Language Models Part I: PPO
Language:Python1.3k97
P1ayer-1/chatlogs.net-scraper
Language:Python4
openlm-research/open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
7.5k396
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Language:Python9.4k730
SeedV/generative-ai-roadmap
The roadmap of generative AI: use cases and applications | 生成式AI的应用路线图
61076
kyegomez/tree-of-thoughts
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Language:Python4.5k365
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Language:Jupyter Notebook2.7k136
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Language:Python4.7k355
gururise/AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
Language:Python1.5k151
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python38.2k4.7k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.9k4.1k
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
3.8k265
google/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Language:Python3k600
davinci1010/pinduoduo_backdoor
拼多多apk内嵌提权代码，及动态下发dex分析
5.4k1.9k
THUDM/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Language:Python7.7k603
HarderThenHarder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
Language:Jupyter Notebook2.3k397
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python40.7k4.5k
shibing624/text2vec
text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。
Language:Python4.7k407
Kobaayyy/Awesome-CVPR2025-CVPR2024-CVPR2021-CVPR2020-Low-Level-Vision
A Collection of Papers and Codes for CVPR2025/CVPR2024/CVPR2021/CVPR2020 Low Level Vision
1.2k137
megvii-research/NAFNet
The state-of-the-art image restoration model without nonlinear activation functions.
Language:Python2.4k308