Millielele's Stars
EasonXiao-888/GrootVL
[NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model
xupei0610/guitar
[SIGGRAPH Asia 2024] Synchronize Dual Hands for Physics-Based Dexterous Guitar Playing
Reagan1311/LOCATE
LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)
MzeroMiko/VMamba
VMamba: Visual State Space Models,code is based on mamba
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
chenziwenhaoshuai/Vision-KAN
KAN for Vision Transformer
xmed-lab/CLIP_Surgery
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
amazon-science/AdaSlot
Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]
NVlabs/affordance_diffusion
Codes for "Affordance Diffusion: Synthesizing Hand-Object Interactions"
chenguolin/InstructScene
[ICLR 2024 spotlight] Official implementation of "InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior".
yl3800/LASO
ymxlzgy/echoscene
[ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.
MarSaKi/VLN-BEVBert
[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"
snuvclab/coma
Official Repository for ECCV 2024 paper Beyond the Contact: Discovering Comprehensive Affordance for 3D Objects from Pre-trained 2D Diffusion Models
XinyuanWangCS/PromptAgent
This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgent is a novel automatic prompt optimization method that autonomously crafts prompts equivalent in quality to those handcrafted by experts, i.e., expert-level prompts.
craigleili/GenZI
GenZI: Zero-Shot 3D Human-Scene Interaction Generation (CVPR 2024)
Sirui-Xu/InterDiff
[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"
lijiaman/omomo_release
Official Implementation of SIGGRAPH Asia 2023 (TOG) Paper: Object Motion Guided Human Motion Synthesis
neu-vi/HOI-Diff
HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models, arXiv 2023
dqj5182/CONTHO_RELEASE
[CVPR 2024] This repo is official PyTorch implementation of Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer.
mohamedhassanmus/POSA
Populating 3D Scenes by Learning Human-Scene Interaction https://posa.is.tue.mpg.de/
afford-motion/afford-motion
Official implementation of CVPR24 highlight paper "Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance"
xiexh20/HDM
Official implementation for Hierarachical Diffusion Model in CVPR24 Template free reconstruction of human object interaction
OpenRobotLab/UniHSI
[ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts
chihina/PJAE-ICCV2023
Official Code for the paper accepted at ICCV 2023
sangmin-git/MMSI
Code for "Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations" (CVPR 2024 Oral)
jacobkrantz/IVLN-CE
Official Implementation of IVLN-CE: Iterative Vision-and-Language Navigation in Continuous Environments
jacobkrantz/Sim2Sim-VLNCE
Official implementation of the ECCV 2022 Oral paper: Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments
yyvhang/IAGNet
The Pytorch implementation of Grounding 3D Object Affordance from 2D Interactios in Images.
DepthAnything/Depth-Anything-V2
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation