PeihaoChen's Stars
allenzren/open-pi-zero
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
gnobitab/RectifiedFlow
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
lucidrains/pi-zero-pytorch
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
real-stanford/im2Flow2Act
[CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interface
henry123-boy/SpaTracker
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
SnapDrop/snapdrop
A Progressive Web App for local file sharing
robodhruv/visualnav-transformer
Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.
voxel51/fiftyone
Refine high-quality datasets and visual AI models
XinyuSun/FGPrompt
official implementation of NeurIPS 2023 paper "FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation"
HappyColor/Vesper
A Compact and Effective Pretrained Model for Speech Emotion Recognition
HappyColor/SpeechFormer
Official implement of SpeechFormer written in Python (PyTorch).
colmap/colmap
COLMAP - Structure-from-Motion and Multi-View Stereo
XYZ-qiyh/colmap_3d_recon
🏡 Structure-from-Motion (SfM) and Multi-View Stereo (MVS)
UMass-Foundation-Model/3D-LLM
Code for 3D-LLM: Injecting the 3D World into Large Language Models
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
XinyuSun/MME
official implementation of CVPR 23 paper "M3Video: Masked Motion Modeling for Self-Supervised Video Representation Learning"
ZSHsh98/EPS-AD
This is the source code for Detecting Adversarial Data by Probing Multiple Perturbations Using Expected Perturbation Score (ICML2023).
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
PeihaoChen/ActiveCamera
Official implementation of NeurIPS 2022 paper "Learning Active Camera for Multi-Object Navigation"
gunagg/zson
ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings. NeurIPS 2022
PeihaoChen/WS-MGMap
Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation”
cshizhe/VLN-DUET
Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).
vincentcartillier/Semantic-MapNet
threedworld-mit/tdw
ThreeDWorld simulation environment
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
XinyuSun/awesome-self-supervised-representation-learning
awesome video representation learning
whwu95/MVFNet
【AAAI'2021】MVFNet: Multi-View Fusion Network for Efficient Video Recognition
PeihaoChen/RSPNet
Official Pytorch implementation for AAAI2021 paper (RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning)
facebookresearch/OccupancyAnticipation
This repository contains code for our publication "Occupancy Anticipation for Efficient Exploration and Navigation" in ECCV 2020.