dominickrei's Stars
SudeepDasari/data4robotics
facebookresearch/sapiens
High-resolution models for human tasks.
bdaiinstitute/theia
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
baaivision/DIVA
Diffusion Feedback Helps CLIP See Better
jongwoopark7978/LVNet
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
LostXine/LLaRA
LLaRA: Large Language and Robotics Assistant
Charlotte-CharMLab/Fibottention
Inceptive Visual Representation Learning with Diverse Attention Across Heads. Image Classification, Action Recognition, and Robot Learning.
yfeng95/PoseGPT
meta-llama/llama3
The official Meta Llama 3 GitHub site
firework8/Awesome-Skeleton-based-Action-Recognition
A curated paper list of awesome skeleton-based action recognition.
Buzz-Beater/LEMMA
Code for ECCV 2020 paper - LEMMA: A Multi-view Dataset for LEarning Multi-agent Multi-task Activities
xiaobai1217/Awesome-Video-Datasets
Video datasets
dominickrei/crossway_diffusion
HiroIshida/mohou
deep visuomotor behavior cloning framework
real-stanford/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
cage-challenge/cage-challenge-4
The TTCP CAGE Challenges are a series of public challenges instigated to foster the development of autonomous cyber defensive agents. This CAGE Challenge 4 (CC4) returns to a defence industry enterprise environment, and introduces a Multi-Agent Reinforcement Learning (MARL) scenario.
LostXine/crossway_diffusion
The official code of our ICRA'24 paper Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning
dominickrei/PoseAwareVT
Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
dominickrei/pi-vit
[CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living
RyannDaGreat/Diffusion-Illusions
Diffusion Illusions: Hiding Images in Plain Sight
jongwoopark7978/Grafting-Vision-Transformer
dominickrei/Limited-data-vits
[WACV 2024] Code for "Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders"
yysijie/st-gcn
Spatial Temporal Graph Convolutional Networks (ST-GCN) for Skeleton-Based Action Recognition in PyTorch
ZhouYuxuanYX/Hyperformer
This is the official implementation of our paper "Hypergraph Transformer for Skeleton-based Action Recognition."
dmlc/decord
An efficient video loader for deep learning with smart shuffling that's super easy to digest
Hzzone/pytorch-openpose
pytorch implementation of openpose including Hand and Body Pose Estimation.
tugstugi/dl-colab-notebooks
Try out deep learning models online on Google Colab