csholder

csholder's Stars

lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.1k 353 1.8k4.6k
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.7k 107 290481
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6.1k 75 5391k
google-deepmind/alphageometry
Language:Python4.2k 53 125470
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
4.1k 239 9729
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python2.8k 32 158263
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Language:Python2.8k 24 311266
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Language:Jupyter Notebook2.6k 39 34131
XiaoxinHe/Awesome-Graph-LLM
A collection of AWESOME things about Graph-Related LLMs.
1.8k 46 16131
openai/prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
Language:Python1.7k 117 17103
google-research/FLAN
Language:Python1.5k 32 75156
dandelionsllm/pandallm
Panda项目是于2023年5月启动的开源海外中文大语言模型项目，致力于大模型时代探索整个技术栈，旨在推动中文自然语言处理领域的创新和合作。
Language:Python1.1k 38 3490
jannerm/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
Language:Python899 12 62143
GanjinZero/RRHF
[NIPS2023] RRHF & Wombat
Language:Python799 10 4949
dvlab-research/LLaMA-VID
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
Language:Python745 14 10944
tomhartke/knowledge-graph-from-GPT
Using GPT to organize and access information, and generate questions. Long term goal is to make an agent-like research assistant.
Language:Jupyter Notebook657 15 055
infer-actively/pymdp
A Python implementation of active inference for Markov Decision Processes
Language:Python478 32 4595
Yui010206/SeViLA
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
Language:Python178 3 2722
xudejing/video-question-answering
Video Question Answering via Gradually Refined Attention over Appearance and Motion
Language:Python154 4 027
qhduan/cn-chat-arxiv
Language:Python148 4 315
YifeiZhou02/ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
Language:Python109 5 1713
Leezekun/Directional-Stimulus-Prompting
[NeurIPS 2023] Codebase for the paper: "Guiding Large Language Models with Directional Stimulus Prompting"
Language:Python104 2 59
DeepGraphLearning/DiffPack
Implementation of DiffPack: A Torsional Diffusion Model for Autoregressive Protein Side-Chain Packing
Language:Python75 8 116
fangleai/Implicit-LVM
This code repository presents the pytorch implementation of the paper “Implicit Deep Latent Variable Models for Text Generation”(EMNLP 2019).
Language:OpenEdge ABL55 1 49
PersistenceForever/Neural-Question-Generation-Survey-List
A comprehensive overview of neural question generation across diverse input formats.
49 2 01
orybkin/video-gcp
Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"
Language:Python44 5 47
TianHongZXY/CoRe
[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models
Language:Python43 1 56
iwangjian/Color4Dial
Code and data for "Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue" (ACL Findings 2023).
Language:Python21 2 42
yl3800/TranSTR
Language:Python11 2 90
ryanshea10/personachat_offline_rl
Language:Python4 1 31