csholder's Stars
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
microsoft/DeepSpeedExamples
Example models using DeepSpeed
google-deepmind/alphageometry
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
XiaoxinHe/Awesome-Graph-LLM
A collection of AWESOME things about Graph-Related LLMs.
openai/prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
google-research/FLAN
dandelionsllm/pandallm
Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。
jannerm/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
GanjinZero/RRHF
[NIPS2023] RRHF & Wombat
dvlab-research/LLaMA-VID
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
tomhartke/knowledge-graph-from-GPT
Using GPT to organize and access information, and generate questions. Long term goal is to make an agent-like research assistant.
infer-actively/pymdp
A Python implementation of active inference for Markov Decision Processes
Yui010206/SeViLA
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
xudejing/video-question-answering
Video Question Answering via Gradually Refined Attention over Appearance and Motion
qhduan/cn-chat-arxiv
YifeiZhou02/ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
Leezekun/Directional-Stimulus-Prompting
[NeurIPS 2023] Codebase for the paper: "Guiding Large Language Models with Directional Stimulus Prompting"
DeepGraphLearning/DiffPack
Implementation of DiffPack: A Torsional Diffusion Model for Autoregressive Protein Side-Chain Packing
fangleai/Implicit-LVM
This code repository presents the pytorch implementation of the paper “Implicit Deep Latent Variable Models for Text Generation”(EMNLP 2019).
PersistenceForever/Neural-Question-Generation-Survey-List
A comprehensive overview of neural question generation across diverse input formats.
orybkin/video-gcp
Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"
TianHongZXY/CoRe
[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models
iwangjian/Color4Dial
Code and data for "Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue" (ACL Findings 2023).
yl3800/TranSTR
ryanshea10/personachat_offline_rl