ZJU-PLP

Ph.D., college of Control Science and Engineering Computer Vision

Zhejiang UniversityHangzhou.China

ZJU-PLP's Stars

Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.3k 218 4592.9k
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python6.4k 41 296661
princeton-nlp/tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Language:Python4.6k 122 54434
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
Language:Python1.1k 17 99137
RiseInRose/MiniGPT-4-ZH
MiniGPT-4 中文部署翻译完善部署细节
Language:Python856 10 15104
robotics-survey/Awesome-Robotics-Foundation-Models
847 25 285
thuiar/MMSA
MMSA is a unified framework for Multimodal Sentiment Analysis.
Language:Python654 10 103105
lapisrocks/LanguageAgentTreeSearch
[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
Language:Python642 9 2967
NVlabs/RVT
Official Code for RVT-2 and RVT
Language:Jupyter Notebook265 9 5632
yehengchen/DOPE-ROS-D435
Object 6DoF Pose Estimation for Assembly Robots Trained on Synthetic Data - ROS Kinetic/Melodic Using Intel® RealSense D435
Language:C++186 6 1236
Large-Trajectory-Model/ATM
Official codebase for "Any-point Trajectory Modeling for Policy Learning"
Language:Python153 0 1416
1989Ryan/llm-mcts
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.
Language:Python148 5 315
zhouxian/act3d-chained-diffuser
A unified architecture for multimodal multi-task robotic policy learning.
Language:Python110 4 219
xukechun/Vision-Language-Grasping
[ICRA 2023] A Joint Modeling of Vision-Language-Action for Target-oriented Grasping in Clutter
Language:Python86 1 1611
AnasIbrahim/image_agnostic_segmentation
Language:Python75 6 48
aminebdj/OpenYOLO3D
Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up ∼16x speedup compared to the best existing method in literature.
Language:Python63 6 134
IncideDigital/rvt2
An open source framework for computer forensics
Language:Python52 5 07
sumedh7/RoboCLIP
Official Implementation of RoboCLIP (NeurIPS 2023)
Language:Python33 3 36
etriantafyllidis/ROMAN
The RObotic MAnipulation Network
Language:C#29 1 25
vlc-robot/polarnet
[CoRL2023] Official PyTorch implementation of PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation
Language:Python29 2 30
JingyangXiang/OvSW
Pytorch implementation of our paper OvSW: Overcoming Silent Weights for Accurate Binary Neural Networks accepted by ECCV 2024.
Language:Python15 2 10
Johann-Huber/qd_grasp
Language:Python14 2 10
TongZhangTHU/sgr
Official Code for SGRv2 and SGR.
Language:Python14 3 20
Aaron617/tree-planner
The source code for iclr 2024 tree-planner https://arxiv.org/abs/2310.08582
Language:Python10 1 20
chenwei746/EEVG
Language:Python91
zengy268/MIM
Open source code for paper: Multimodal Reaction: Information Modulation for Cross-modal Representation Learning
Language:Python9
cv516Buaa/OVGNet
Language:Python5 1 00
niiceMing/CMTA
(NIPS23)Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning
Language:Python5 2 50
AZYoung233/CLGSI
Language:Python30
Bugs-Bunny01/VTF-AVIT
Accelerated Transformer Model for Slip Detection in Robotic Grasping through Visual-Tactile Sensor Integration
Language:Python3

ZJU-PLP

ZJU-PLP's Stars

Vision-CAIR/MiniGPT-4

IDEA-Research/GroundingDINO

princeton-nlp/tree-of-thought-llm

openvla/openvla

RiseInRose/MiniGPT-4-ZH

robotics-survey/Awesome-Robotics-Foundation-Models

thuiar/MMSA

lapisrocks/LanguageAgentTreeSearch

NVlabs/RVT

yehengchen/DOPE-ROS-D435

Large-Trajectory-Model/ATM

1989Ryan/llm-mcts

zhouxian/act3d-chained-diffuser

xukechun/Vision-Language-Grasping

AnasIbrahim/image_agnostic_segmentation

aminebdj/OpenYOLO3D

IncideDigital/rvt2

sumedh7/RoboCLIP

etriantafyllidis/ROMAN

vlc-robot/polarnet

JingyangXiang/OvSW

Johann-Huber/qd_grasp

TongZhangTHU/sgr

Aaron617/tree-planner

chenwei746/EEVG

zengy268/MIM

cv516Buaa/OVGNet

niiceMing/CMTA

AZYoung233/CLGSI

Bugs-Bunny01/VTF-AVIT