yysjasmine's Stars
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
allenai/open-instruct
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
openai/summarize-from-feedback
Code for "Learning to summarize from human feedback"
openai/following-instructions-human-feedback
meta-llama/llama
Inference code for Llama models
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
salesforce/CodeGen
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
facebookresearch/metaseq
Repo for external large-scale work
juncongmoo/chatllama
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
dalinvip/Awesome-ChatGPT
ChatGPT资料汇总学习,持续更新......
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
opendilab/DI-store
OpenDILab RL Object Store
opendilab/DI-treetensor
Let DI-treetensor help you simplify the structure processing!(树形运算一不小心就逻辑混乱?DI-treetensor快速帮你搞定)
opendilab/awesome-exploration-rl
A curated list of awesome exploration RL resources (continually updated)
opendilab/InterFuser
[CoRL 2022] InterFuser: Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
opendilab/LightTuner
opendilab/DI-hpc
OpenDILab RL HPC OP Lib, including CUDA and Triton kernel
opendilab/DI-orchestrator
OpenDILab RL Kubernetes Custom Resource and Operator Lib
opendilab/GoBigger-Challenge-2021
Interested in multi-agents? The 1st Go-Bigger Multi-Agent Decision Intelligence Challenge is coming and a big bonus is waiting for you!
opendilab/Gobigger-Explore
Still struggling with the high threshold or looking for the appropriate baseline? Come here and new starters can also play with your own multi-agents!
opendilab/GoBigger
[ICLR 2023] Come & try Decision-Intelligence version of "Agar"! Gobigger could also help you with multi-agent decision intelligence study.
opendilab/DI-bioseq
Decision Intelligence platform for Biological Sequence Searching
opendilab/DI-drive
Decision Intelligence Platform for Autonomous Driving simulation.
opendilab/awesome-multi-modal-reinforcement-learning
A curated list of Multi-Modal Reinforcement Learning resources (continually updated)