yysjasmine

yysjasmine's Stars

haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.1k2.2k
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Language:Python2.6k203
allenai/open-instruct
Language:Python1.3k171
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
3.5k246
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3.4k210
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Language:Python1.2k161
openai/summarize-from-feedback
Code for "Learning to summarize from human feedback"
Language:Python989143
openai/following-instructions-human-feedback
1.2k141
meta-llama/llama
Inference code for Llama models
Language:Python56.3k9.6k
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Language:Python37k3.2k
salesforce/CodeGen
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
Language:Python4.9k380
facebookresearch/metaseq
Repo for external large-scale work
Language:Python6.5k725
juncongmoo/chatllama
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
Language:Python1.2k138
dalinvip/Awesome-ChatGPT
ChatGPT资料汇总学习，持续更新......
4.1k384
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Language:Python15k2.6k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.5k4.1k
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook18.6k2.2k
opendilab/DI-store
OpenDILab RL Object Store
Language:Go1787
opendilab/DI-treetensor
Let DI-treetensor help you simplify the structure processing!（树形运算一不小心就逻辑混乱？DI-treetensor快速帮你搞定）
Language:Python2064
opendilab/awesome-exploration-rl
A curated list of awesome exploration RL resources (continually updated)
38811
opendilab/InterFuser
[CoRL 2022] InterFuser: Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Language:Python54646
opendilab/LightTuner
Language:Python1742
opendilab/DI-hpc
OpenDILab RL HPC OP Lib, including CUDA and Triton kernel
Language:Python2247
opendilab/DI-orchestrator
OpenDILab RL Kubernetes Custom Resource and Operator Lib
Language:Go2435
opendilab/GoBigger-Challenge-2021
Interested in multi-agents? The 1st Go-Bigger Multi-Agent Decision Intelligence Challenge is coming and a big bonus is waiting for you!
Language:Python19533
opendilab/Gobigger-Explore
Still struggling with the high threshold or looking for the appropriate baseline? Come here and new starters can also play with your own multi-agents!
Language:Python1859
opendilab/GoBigger
[ICLR 2023] Come & try Decision-Intelligence version of "Agar"! Gobigger could also help you with multi-agent decision intelligence study.
Language:Python46034
opendilab/DI-bioseq
Decision Intelligence platform for Biological Sequence Searching
Language:Python1141
opendilab/DI-drive
Decision Intelligence Platform for Autonomous Driving simulation.
Language:Python57457
opendilab/awesome-multi-modal-reinforcement-learning
A curated list of Multi-Modal Reinforcement Learning resources (continually updated)
39713