yashkant
phd @uoft-cs-robotics, @facebookresearch // prev @snap-research
University of TorontoToronto, Ontario
yashkant's Stars
CMU-Perceptual-Computing-Lab/openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
mem0ai/mem0
The Memory layer for your AI apps
black-forest-labs/flux
Official inference repo for FLUX.1 models
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
arogozhnikov/einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
facebookresearch/sapiens
High-resolution models for human tasks.
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
facebookresearch/MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
caizhongang/SMPLer-X
Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"
graphdeco-inria/hierarchical-3d-gaussians
Official implementation of the SIGGRAPH 2024 paper "A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets"
megvii-research/megactor
zhuzilin/ring-flash-attention
Ring attention implementation with flash attention
NVlabs/edm2
EDM2 and Autoguidance -- Official PyTorch implementation
apple/ml-mdm
Train high-quality text-to-image diffusion models in a data & compute efficient manner
hehao13/CameraCtrl
feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
fferflo/einx
Universal Tensor Operations in Einstein-Inspired Notation for Python.
lucidrains/autoregressive-diffusion-pytorch
Implementation of Autoregressive Diffusion in Pytorch
ttxskk/AiOS
[CVPR 2024] Official Code for "AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
lucidrains/mmdit
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
NJU-PCALab/OpenVid-1M
PKU-YuanGroup/ChronoMagic-Bench
[NeurIPS 2024 D&B Spotlightš„] ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
yu-rp/KANbeFair
A More Fair and Comprehensive Comparison between KAN and MLP
y-zheng18/point_odyssey
Official code for PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking (ICCV 2023)
imagegridworth/IG-VLM
facebookresearch/ava-256
Train universal codec avatars
cloneofsimo/scaling-guide
WIP