csholder's Stars
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
infer-actively/pymdp
A Python implementation of active inference for Markov Decision Processes
fangleai/Implicit-LVM
This code repository presents the pytorch implementation of the paper “Implicit Deep Latent Variable Models for Text Generation”(EMNLP 2019).
YifeiZhou02/ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
OpenLLMAI/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
PersistenceForever/Neural-Question-Generation-Survey-List
A comprehensive overview of neural question generation across diverse input formats.(Accepted to IJCAI 2024)
google-deepmind/alphageometry
dvlab-research/LLaMA-VID
Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
yl3800/TranSTR
iwangjian/Color4Dial
Code and data for "Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive Dialogue" (ACL Findings 2023).
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
XiaoxinHe/Awesome-Graph-LLM
A collection of AWESOME things about Graph-Related LLMs.
Leezekun/Directional-Stimulus-Prompting
[NeurIPS 2023] Codebase for the paper: "Guiding Large Language Models with Directional Stimulus Prompting"
ryanshea10/personachat_offline_rl
qhduan/cn-chat-arxiv
DeepGraphLearning/DiffPack
Implementation of DiffPack: A Torsional Diffusion Model for Autoregressive Protein Side-Chain Packing
xudejing/video-question-answering
Video Question Answering via Gradually Refined Attention over Appearance and Motion
TianHongZXY/CoRe
[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
jannerm/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
tomhartke/knowledge-graph-from-GPT
Using GPT to organize and access information, and generate questions. Long term goal is to make an agent-like research assistant.
openai/prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
orybkin/video-gcp
Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"
google-research/FLAN
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Yui010206/SeViLA
[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
dandelionsllm/pandallm
Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。
GanjinZero/RRHF
[NIPS2023] RRHF & Wombat