liangxiaowei00

liangxiaowei00's Stars

QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python1.8k102
RLHF-V/RLAIF-V
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
Language:Python2006
BAAI-DCAI/SpatialBot
The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
Language:Python1239
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python1.7k110
andimarafioti/florence2-finetuning
Quick exploration into fine tuning florence 2
Language:Jupyter Notebook25022
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.4k1.2k
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
Language:Python982125
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python4.6k361
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python3k244
magic-research/PLLaVA
Official repository for the paper PLLaVA
Language:Python54836
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python11.8k833
google-deepmind/open_x_embodiment
Language:Jupyter Notebook77555
octo-models/octo
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
Language:Python758148
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python6.3k549
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python5.5k425
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python30.8k3.8k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python26.1k2.9k
lpiccinelli-eth/UniDepth
Universal Monocular Metric Depth Estimation
Language:Python57346
NHirose/SACSoN
Scalable Autonomous Control for Social Navigation
Language:Python11
robodhruv/visualnav-transformer
Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.
Language:Python51667
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
6.2k377
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook92.3k14.7k
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.3k2.9k
Paitesanshi/LLM-Agent-Survey
2.5k148
ros2/examples
Example packages for ROS 2
Language:C++695311
MarkFzp/act-plus-plus
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
Language:Python2.9k545
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Language:Python2.1k147
Genesis-Embodied-AI/Genesis
A generative world for general-purpose robotics & embodied AI learning.
3219
meta-llama/llama
Inference code for Llama models
Language:Python55.5k9.5k
Genesis-Embodied-AI/RoboGen
A generative and self-guided robotic agent that endlessly propose and master new skills.
Language:Python55350