WhisperT

WhisperT's Stars

deepseek-ai/DeepSeek-V3
Language:Python92.4k 721 45615k
deepseek-ai/DeepSeek-R1
86.7k 614 47911.2k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python44.6k 245 6.2k5.5k
huggingface/open-r1
Fully open reproduction of DeepSeek-R1
Language:Python22.9k 308 2782.1k
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell16.2k 106 9361.1k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook14.3k 100 191.2k
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Language:Python13.7k 99 7971.6k
Jiayi-Pan/TinyZero
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Language:Python11.2k 126 791.4k
InternLM/InternLM
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
Language:Python6.8k 57 348481
xiaoyaDev/xiaoya-alist
小雅Alist的相关周边
Language:Shell6.4k 36 229952
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
Language:Python6.3k 33 1.9k540
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python6k 36 630516
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
4.8k 43 100504
wainshine/Chinese-Names-Corpus
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
4.1k 103 281k
modelscope/data-juicer
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Language:Python3.9k 20 241218
hkust-nlp/simpleRL-reason
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Language:Python3.2k 33 45236
CLUEbenchmark/SuperCLUE
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
3.1k 37 50103
XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Language:Python2k 10 10255
PRIME-RL/PRIME
Scalable RL solution for advanced reasoning of language models
Language:Python1.4k 8 3885
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python1.2k 19 3890
SarvagyaVaish/FlappyBirdRL
Flappy Bird hack using Reinforcement Learning
Language:JavaScript919 55 1233
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
Language:Python743 21 8891
IEIT-Yuan/Yuan-2.0
Yuan 2.0 Large Language Model
Language:Python685 5 9387
allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
Language:Python522 6 7561
OpenBMB/Eurus
Language:Python311 11 1114
RLHF-V/RLAIF-V
[CVPR'25] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
Language:Python309 6 3612
apachecn/stanford-cs234-notes-zh
斯坦福 cs234 强化学习中文讲义
Language:Shell198 7 130
yuyq96/TextHawk
Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
Language:Python59 5 23
RLHFlow/Directional-Preference-Alignment
Directional Preference Alignment
56 3 43
Tlntin/qwen-ascend-llm
Language:Python38 1 94