ponimatkin

CTU PraguePrague

ponimatkin's Stars

Delgan/loguru
Python logging made (stupidly) simple
Language:Python20.5k 142 1k711
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook11.1k 64 259944
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10.2k 97 679989
google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Language:Jupyter Notebook8.5k 110 1.6k866
sczhou/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
Language:Python5.8k 56 90675
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
Language:Python5.7k 54 172612
facebookresearch/co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
Language:Jupyter Notebook4.1k 33 115269
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Language:Python3k 32 138268
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
Language:Python2.7k 37 57259
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.7k 32 142218
apple/axlearn
An Extensible Deep Learning Library
Language:Python1.9k 63 23280
real-stanford/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
Language:Python1.9k 18 114360
openvla/openvla
OpenVLA: An open-source vision-language-action model for robotic manipulation.
Language:Python1.7k 22 182220
google-deepmind/mujoco_menagerie
A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.
Language:Python1.7k 30 81234
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language:Python1.5k 27 20194
criteo/autofaiss
Automatically create Faiss knn indices with the most optimal similarity search parameters.
Language:Python826 18 7974
hkchengrex/Cutie
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
Language:Python772 4 12477
jonbarron/camp_zipnerf
Language:Python677 18 2945
OpenGVLab/VideoMAEv2
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Language:Python561 7 6263
RaymondWang987/NVDS
ICCV 2023 "Neural Video Depth Stabilizer" (NVDS) & TPAMI 2024 "NVDS+: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation" (NVDS+)
Language:Python499 22 3424
geopavlakos/hamer
HaMeR: Reconstructing Hands in 3D with Transformers
Language:Python484 5 9149
TencentARC/UMT
UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.
Language:Python195 6 5419
PKU-EPIC/GAPartNet
[CVPR 2023 Highlight] GAPartNet: Cross-Category Domain-Generalizable Object Perception and Manipulation via Generalizable and Actionable Parts.
Language:Jupyter Notebook114 4 1915
shikharbahl/vrb
Language:Python95 4 97
schmidtdominik/LAPO
Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)
Language:Python84 3 37
yxKryptonite/RAM_code
Official implementation of RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation
Language:Python68 4 74
JeanElsner/panda_mujoco
MuJoCo model of the Franka Emika Robot System
Language:Python51 1 33
cvlab-columbia/dreamitate
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)
42 8 10
JeanElsner/dm_robotics_panda
Panda model for dm_robotics
Language:Python28 2 22
andvg3/LGD
Dataset and Code for CVPR 2024 paper "Language-driven Grasp Detection."
Language:Python16 3 21

ponimatkin

ponimatkin's Stars

Delgan/loguru

facebookresearch/segment-anything-2

salesforce/LAVIS

google-deepmind/mujoco

sczhou/ProPainter

naver/dust3r

facebookresearch/co-tracker

ali-vilab/VGen

facebookresearch/jepa

Doubiiu/DynamiCrafter

apple/axlearn

real-stanford/diffusion_policy

openvla/openvla

google-deepmind/mujoco_menagerie

OpenGVLab/InternVideo

criteo/autofaiss

hkchengrex/Cutie

jonbarron/camp_zipnerf

OpenGVLab/VideoMAEv2

RaymondWang987/NVDS

geopavlakos/hamer

TencentARC/UMT

PKU-EPIC/GAPartNet

shikharbahl/vrb

schmidtdominik/LAPO

yxKryptonite/RAM_code

JeanElsner/panda_mujoco

cvlab-columbia/dreamitate

JeanElsner/dm_robotics_panda

andvg3/LGD