PeihaoChen

PeihaoChen's Stars

allenzren/open-pi-zero
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
Language:Python67347
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python23.4k2.3k
gnobitab/RectifiedFlow
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
Language:Python1.1k63
lucidrains/pi-zero-pytorch
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
Language:Python34414
real-stanford/im2Flow2Act
[CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interface
Language:Python984
henry123-boy/SpaTracker
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
Language:Python79631
SnapDrop/snapdrop
A Progressive Web App for local file sharing
Language:JavaScript18.8k1.8k
robodhruv/visualnav-transformer
Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.
Language:Python72992
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python9.2k605
XinyuSun/FGPrompt
official implementation of NeurIPS 2023 paper "FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation"
Language:Python311
HappyColor/Vesper
A Compact and Effective Pretrained Model for Speech Emotion Recognition
Language:Python332
HappyColor/SpeechFormer
Official implement of SpeechFormer written in Python (PyTorch).
Language:Python778
colmap/colmap
COLMAP - Structure-from-Motion and Multi-View Stereo
Language:C++8.2k1.6k
XYZ-qiyh/colmap_3d_recon
🏡 Structure-from-Motion (SfM) and Multi-View Stereo (MVS)
Language:Python726
UMass-Foundation-Model/3D-LLM
Code for 3D-LLM: Injecting the 3D World into Large Language Models
Language:Python1k61
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
Language:Python1.7k87
XinyuSun/MME
official implementation of CVPR 23 paper "M3Video: Masked Motion Modeling for Self-Supervised Video Representation Learning"
Language:Python501
ZSHsh98/EPS-AD
This is the source code for Detecting Adversarial Data by Probing Multiple Perturbations Using Expected Perturbation Score (ICML2023).
Language:Python372
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.9k4.6k
PeihaoChen/ActiveCamera
Official implementation of NeurIPS 2022 paper "Learning Active Camera for Multi-Object Navigation"
Language:Python103
gunagg/zson
ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings. NeurIPS 2022
Language:Python678
PeihaoChen/WS-MGMap
Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation”
Language:Python284
cshizhe/VLN-DUET
Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).
Language:Python14711
vincentcartillier/Semantic-MapNet
Language:Python7911
threedworld-mit/tdw
ThreeDWorld simulation environment
Language:Python51875
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
18.8k2.6k
XinyuSun/awesome-self-supervised-representation-learning
awesome video representation learning
15
whwu95/MVFNet
【AAAI'2021】MVFNet: Multi-View Fusion Network for Efficient Video Recognition
Language:Python13312
PeihaoChen/RSPNet
Official Pytorch implementation for AAAI2021 paper (RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning)
Language:Python368
facebookresearch/OccupancyAnticipation
This repository contains code for our publication "Occupancy Anticipation for Efficient Exploration and Navigation" in ECCV 2020.
Language:Python7926