rujiawang329

rujiawang329's Stars

haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21.1k 158 1.6k2.3k
mistralai/mistral-inference
Official inference library for Mistral models
Language:Jupyter Notebook9.8k 127 148871
QwenLM/Qwen-Agent
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Language:Python5.3k 43 408448
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python5k 51 215522
langchain-ai/open-canvas
📃 A better UX for chat, writing content, and coding with LLMs.
Language:TypeScript3.4k 32 96484
LLaVA-VL/LLaVA-NeXT
Language:Python3.3k 37 348288
jingyi0000/VLM_survey
Collection of AWESOME vision-language models for vision tasks
2.7k 125 10229
real-stanford/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
Language:Python1.9k 18 114359
InternLM/Tutorial
LLM&VLM Tutorial
Language:Python1.6k 21 811.5k
AgibotTech/agibot_x1_infer
The inference module for AgiBot X1.
Language:C++1.4k 13 10441
merveenoyan/smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
Language:Jupyter Notebook1.1k 13 1897
allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
Language:Python1.1k 24 76115
OpenDriveLab/DriveLM
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
Language:HTML928 21 10261
mbzuai-oryx/LLaVA-pp
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
Language:Python822 10 3462
AgibotTech/agibot_x1_hardware
The hardware design for AgiBot X1.
796 8 6248
zubair-irshad/Awesome-Robotics-3D
A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites
616 14 332
gokayfem/awesome-vlm-architectures
Famous Vision Language Models and Their Architectures
Language:Markdown560 12 330
allenai/OLMoE
OLMoE: Open Mixture-of-Experts Language Models
Language:Jupyter Notebook531 10 1139
zhengli97/Awesome-Prompt-Adapter-Learning-for-VLMs
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
379 9 315
123penny123/Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
344 6 020
allenai/OLMo-Eval
Evaluation suite for LLMs
Language:Python329 6 439
hustvl/Senna
Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Language:Python264 19 1110
IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
248 9 112
ge25nab/Awesome-VLM-AD-ITS
This repository collects research papers of large Vision Language Models in Autonomous driving and Intelligent Transportation System. The repository will be continuously updated to track the latest update.
186 5 014
MCZhi/DTPP
[ICRA 2024] Differentiable Joint Conditional Prediction and Cost Evaluation for Tree Policy Planning
Language:Python181 8 2730
AIR-THU/DAIR-V2X-Seq
Language:Python144 5 2715
UT-Austin-RPL/Coopernaut
Coopernaut: End-to-End Driving with Cooperative Perception for Networked Vehicles
Language:Jupyter Notebook81 2 1318
DLUT-LYZ/CODA-LM
Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)
Language:Python80 2 92
mistralai/mistral-evals
Language:Python56 21 14
AIR-THU/V2X-Graph
18 2 41