ZJU-PLP
Ph.D., college of Control Science and Engineering Computer Vision
Zhejiang UniversityHangzhou.China
ZJU-PLP's Stars
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
princeton-nlp/tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
RiseInRose/MiniGPT-4-ZH
MiniGPT-4 中文部署翻译 完善部署细节
robotics-survey/Awesome-Robotics-Foundation-Models
thuiar/MMSA
MMSA is a unified framework for Multimodal Sentiment Analysis.
lapisrocks/LanguageAgentTreeSearch
[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
NVlabs/RVT
Official Code for RVT-2 and RVT
yehengchen/DOPE-ROS-D435
Object 6DoF Pose Estimation for Assembly Robots Trained on Synthetic Data - ROS Kinetic/Melodic Using Intel® RealSense D435
Large-Trajectory-Model/ATM
Official codebase for "Any-point Trajectory Modeling for Policy Learning"
1989Ryan/llm-mcts
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.
zhouxian/act3d-chained-diffuser
A unified architecture for multimodal multi-task robotic policy learning.
xukechun/Vision-Language-Grasping
[ICRA 2023] A Joint Modeling of Vision-Language-Action for Target-oriented Grasping in Clutter
AnasIbrahim/image_agnostic_segmentation
aminebdj/OpenYOLO3D
Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up ∼16x speedup compared to the best existing method in literature.
IncideDigital/rvt2
An open source framework for computer forensics
sumedh7/RoboCLIP
Official Implementation of RoboCLIP (NeurIPS 2023)
etriantafyllidis/ROMAN
The RObotic MAnipulation Network
vlc-robot/polarnet
[CoRL2023] Official PyTorch implementation of PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation
JingyangXiang/OvSW
Pytorch implementation of our paper OvSW: Overcoming Silent Weights for Accurate Binary Neural Networks accepted by ECCV 2024.
Johann-Huber/qd_grasp
TongZhangTHU/sgr
Official Code for SGRv2 and SGR.
Aaron617/tree-planner
The source code for iclr 2024 tree-planner https://arxiv.org/abs/2310.08582
chenwei746/EEVG
zengy268/MIM
Open source code for paper: Multimodal Reaction: Information Modulation for Cross-modal Representation Learning
cv516Buaa/OVGNet
niiceMing/CMTA
(NIPS23)Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning
AZYoung233/CLGSI
Bugs-Bunny01/VTF-AVIT
Accelerated Transformer Model for Slip Detection in Robotic Grasping through Visual-Tactile Sensor Integration