Millielele

Millielele's Stars

EasonXiao-888/GrootVL
[NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model
Language:Python702
xupei0610/guitar
[SIGGRAPH Asia 2024] Synchronize Dual Hands for Physics-Based Dexterous Guitar Playing
Language:Python183
Reagan1311/LOCATE
LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)
Language:Python336
MzeroMiko/VMamba
VMamba: Visual State Space Models，code is based on mamba
Language:Python2.1k123
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Language:Python2.9k185
chenziwenhaoshuai/Vision-KAN
KAN for Vision Transformer
Language:Python20913
xmed-lab/CLIP_Surgery
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
Language:Jupyter Notebook35023
amazon-science/AdaSlot
Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]
Language:Python201
NVlabs/affordance_diffusion
Codes for "Affordance Diffusion: Synthesizing Hand-Object Interactions"
Language:Python1015
chenguolin/InstructScene
[ICLR 2024 spotlight] Official implementation of "InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior".
Language:Python8111
yl3800/LASO
Language:Python151
ymxlzgy/echoscene
[ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.
Language:Python411
MarSaKi/VLN-BEVBert
[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"
Language:Python1804
snuvclab/coma
Official Repository for ECCV 2024 paper Beyond the Contact: Discovering Comprehensive Affordance for 3D Objects from Pre-trained 2D Diffusion Models
Language:Python433
XinyuanWangCS/PromptAgent
This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgent is a novel automatic prompt optimization method that autonomously crafts prompts equivalent in quality to those handcrafted by experts, i.e., expert-level prompts.
Language:Python18522
craigleili/GenZI
GenZI: Zero-Shot 3D Human-Scene Interaction Generation (CVPR 2024)
Language:Python353
Sirui-Xu/InterDiff
[ICCV 2023] Official PyTorch implementation of the paper "InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion"
Language:Python2259
lijiaman/omomo_release
Official Implementation of SIGGRAPH Asia 2023 (TOG) Paper: Object Motion Guided Human Motion Synthesis
Language:Python1226
neu-vi/HOI-Diff
HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models, arXiv 2023
Language:Python916
dqj5182/CONTHO_RELEASE
[CVPR 2024] This repo is official PyTorch implementation of Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer.
Language:Python662
mohamedhassanmus/POSA
Populating 3D Scenes by Learning Human-Scene Interaction https://posa.is.tue.mpg.de/
Language:Python9912
afford-motion/afford-motion
Official implementation of CVPR24 highlight paper "Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance"
Language:Python1075
xiexh20/HDM
Official implementation for Hierarachical Diffusion Model in CVPR24 Template free reconstruction of human object interaction
Language:Python601
OpenRobotLab/UniHSI
[ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts
Language:Python1537
chihina/PJAE-ICCV2023
Official Code for the paper accepted at ICCV 2023
Language:Python34
sangmin-git/MMSI
Code for "Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations" (CVPR 2024 Oral)
Language:Python91
jacobkrantz/IVLN-CE
Official Implementation of IVLN-CE: Iterative Vision-and-Language Navigation in Continuous Environments
Language:Python281
jacobkrantz/Sim2Sim-VLNCE
Official implementation of the ECCV 2022 Oral paper: Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments
Language:Python25
yyvhang/IAGNet
The Pytorch implementation of Grounding 3D Object Affordance from 2D Interactios in Images.
Language:Python1112
DepthAnything/Depth-Anything-V2
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Language:Python3.4k277