OolongQian's Stars
ycm-core/YouCompleteMe
A code-completion engine for Vim
wkentaro/labelme
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
milesial/Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
threestudio-project/threestudio
A unified framework for 3D content generation.
z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
thu-ml/prolificdreamer
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)
google-deepmind/tapnet
Tracking Any Point (TAP)
Yujun-Shi/DragDiffusion
[CVPR2024, Highlight] Official code for DragDiffusion
jonkhler/s2cnn
Spherical CNNs
tonyzhaozh/act
lucidrains/PaLM-pytorch
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
vimalabs/VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
Genesis-Embodied-AI/RoboGen
A generative and self-guided robotic agent that endlessly propose and master new skills.
huangwl18/VoxPoser
VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
Pointcept/GPT4Point
[CVPR'24 Highlight] GPT4Point: A Unified Framework for Point-Language Understanding and Generation.
JiauZhang/DragDiffusion
Implementation of DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing
f3rm/f3rm
F3RM: Feature Fields for Robotic Manipulation. Official repo for the paper "Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation" (CoRL 2023).
yyvhang/IAGNet
The Pytorch implementation of Grounding 3D Object Affordance from 2D Interactios in Images.
PKU-EPIC/GAPartNet
[CVPR 2023 Highlight] GAPartNet: Cross-Category Domain-Generalizable Object Perception and Manipulation via Generalizable and Actionable Parts.
lucys0/awe
Waypoint-Based Imitation Learning for Robotic Manipulation
zju3dv/gcasp
[CoRL 2022] Generative Category-Level Shape and Pose Estimation with Semantic Primitives
rh20t/rh20t_api
UT-Austin-RPL/sirius
Official codebase for Sirius: Robot Learning on the Job
CEC-Agent/CEC
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
jianlanluo/SAQ
isri-aist/MujocoTactileSensorPlugin
Plugin to simulate tactile sensors in MuJoCo
siyan-zhao/decision-stacks
Implementation of Decision Stacks: Flexible RL via Modular Generative Models