wyddmw

Bazinga

NTUSingapore

wyddmw's Stars

OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python13k 107 617913
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python9.7k 656 1581.3k
timothybrooks/instruct-pix2pix
Language:Python6.5k 69 129545
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Language:Python3.3k 55 68309
ActiveVisionLab/Awesome-LLM-3D
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
1.4k 51 783
zchoi/Awesome-Embodied-Agent-with-LLMs
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
1.1k 45 362
DAMO-NLP-SG/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Language:Python1k 10 12266
mbzuai-oryx/LLaVA-pp
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
Language:Python822 10 3462
ayaanzhaque/instruct-nerf2nerf
Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions (ICCV 2023)
Language:Python814 16 9675
henry123-boy/SpaTracker
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
Language:Python768 58 4225
shalfun/DrivingDiffusion
Layout-Guided multi-view driving scene video generation with latent diffusion model
Language:Python569 20 1217
nianticlabs/mickey
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
Language:Python519 14 2134
maitrix-org/Pandora
Pandora: Towards General World Model with Natural Language Actions and Video States
Language:Python492 17 835
SkyworkAI/Vitron
NeurIPS 2024 Paper: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Language:Python460 15 2227
swc-17/SparseDrive
SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation
Language:Python445 17 8462
autonomousvision/navsim
[NeurIPS 2024] NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking
Language:Python337 14 4424
fudan-zvg/PVG
Periodic Vibration Gaussian: Dynamic Urban Scene Reconstruction and Real-time Rendering
Language:Python293 27 3811
alfredgu001324/MapUncertaintyPrediction
[CVPR 2024 Award Candidate] Producing and Leveraging Online Map Uncertainty in Trajectory Prediction
Language:Python187 8 3214
wayveai/wayve_scenes
Codebase for the WayveScenes101 Dataset
Language:Python169 6 66
LostXine/LLaRA
LLaRA: Large Language and Robotics Assistant
Language:Python162 5 63
zd11024/NaviLLM
[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'
Language:Python147 5 2311
microsoft/Everything-of-Thoughts-XoT
An implemtation of Everyting of Thoughts (XoT).
Language:Python138 10 315
autodriving-heart/ECCV-2024-Papers-Autonomous-Driving
ECCV 2024 Paper List about Autonomous Driving
117 3 11
javyduck/ChatScene
[CVPR2024] ChatScene: Knowledge-Enabled Safety-Critical Scenario Generation for Autonomous Vehicles https://arxiv.org/abs/2405.14062
Language:Python100 2 325
yangxiaofeng/rectified_flow_prior
Official code for paper: Text-to-Image Rectified Flow as Plug-and-Play Priors
Language:Python91 4 83
wzcai99/Pixel-Navigator
Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", ICRA 2024
Language:Python79 1 136
lbaa2022/LLMTaskPlanning
LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)
Language:Jupyter Notebook64 3 85
ZCMax/ScanReason
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
Language:Python63 3 92
aim-uofa/GeoBench
A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.
Language:Python56 11 23
itl-ed/llm-dp
LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task
Language:Jupyter Notebook38 5 03

wyddmw

wyddmw's Stars

OpenBMB/MiniCPM-V

fudan-generative-vision/hallo

timothybrooks/instruct-pix2pix

X-PLUG/MobileAgent

ActiveVisionLab/Awesome-LLM-3D

zchoi/Awesome-Embodied-Agent-with-LLMs

DAMO-NLP-SG/VideoLLaMA2

mbzuai-oryx/LLaVA-pp

ayaanzhaque/instruct-nerf2nerf

henry123-boy/SpaTracker

shalfun/DrivingDiffusion

nianticlabs/mickey

maitrix-org/Pandora

SkyworkAI/Vitron

swc-17/SparseDrive

autonomousvision/navsim

fudan-zvg/PVG

alfredgu001324/MapUncertaintyPrediction

wayveai/wayve_scenes

LostXine/LLaRA

zd11024/NaviLLM

microsoft/Everything-of-Thoughts-XoT

autodriving-heart/ECCV-2024-Papers-Autonomous-Driving

javyduck/ChatScene

yangxiaofeng/rectified_flow_prior

wzcai99/Pixel-Navigator

lbaa2022/LLMTaskPlanning

ZCMax/ScanReason

aim-uofa/GeoBench

itl-ed/llm-dp