yashkant

phd @uoft-cs-robotics, @facebookresearch // prev @snap-research

University of TorontoToronto, Ontario

yashkant's Stars

CMU-Perceptual-Computing-Lab/openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
Language:C++31.5k 924 2k7.9k
mem0ai/mem0
The Memory layer for your AI apps
Language:Python23.5k 130 6962.2k
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python18.6k 159 01.3k
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Language:Python15.2k 122 1.1k2k
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python9.9k 127 478940
arogozhnikov/einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Language:Python8.6k 69 184355
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
Language:Python7.2k 66 71555
facebookresearch/sapiens
High-resolution models for human tasks.
Language:Python4.7k 45 164268
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
Language:Python3.4k 39 268442
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language:Python1.3k 21 29125
facebookresearch/MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
Language:Python1.3k 12 3456
caizhongang/SMPLer-X
Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"
Language:Python1k 22 7874
graphdeco-inria/hierarchical-3d-gaussians
Official implementation of the SIGGRAPH 2024 paper "A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets"
Language:Python1k 18 9797
megvii-research/megactor
Language:Python860 44 29120
zhuzilin/ring-flash-attention
Ring attention implementation with flash attention
Language:Python617 12 3852
NVlabs/edm2
EDM2 and Autoguidance -- Official PyTorch implementation
Language:Python586 12 525
apple/ml-mdm
Train high-quality text-to-image diffusion models in a data & compute efficient manner
Language:Python461 12 2535
hehao13/CameraCtrl
Language:Python458 12 1720
feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Language:Python389 5 1928
fferflo/einx
Universal Tensor Operations in Einstein-Inspired Notation for Python.
Language:Python333 4 109
lucidrains/autoregressive-diffusion-pytorch
Implementation of Autoregressive Diffusion in Pytorch
Language:Python326 12 59
ttxskk/AiOS
[CVPR 2024] Official Code for "AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
Language:Python280 22 246
lucidrains/mmdit
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
Language:Python271 3 37
NJU-PCALab/OpenVid-1M
Language:Python214 3 177
PKU-YuanGroup/ChronoMagic-Bench
[NeurIPS 2024 D&B Spotlight🔥] ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Language:Python189 3 614
yu-rp/KANbeFair
A More Fair and Comprehensive Comparison between KAN and MLP
Language:Jupyter Notebook152 1 1211
y-zheng18/point_odyssey
Official code for PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking (ICCV 2023)
Language:Python133 9 196
imagegridworth/IG-VLM
Language:Python129 4 85
facebookresearch/ava-256
Train universal codec avatars
Language:Jupyter Notebook105 11 165
cloneofsimo/scaling-guide
WIP
Language:Python90 9 01

yashkant

yashkant's Stars

CMU-Perceptual-Computing-Lab/openpose

mem0ai/mem0

black-forest-labs/flux

graphdeco-inria/gaussian-splatting

THUDM/CogVideo

arogozhnikov/einops

LargeWorldModel/LWM

facebookresearch/sapiens

google-research/scenic

PKU-YuanGroup/MagicTime

facebookresearch/MetaCLIP

caizhongang/SMPLer-X

graphdeco-inria/hierarchical-3d-gaussians

megvii-research/megactor

zhuzilin/ring-flash-attention

NVlabs/edm2

apple/ml-mdm

hehao13/CameraCtrl

feifeibear/long-context-attention

fferflo/einx

lucidrains/autoregressive-diffusion-pytorch

ttxskk/AiOS

lucidrains/mmdit

NJU-PCALab/OpenVid-1M

PKU-YuanGroup/ChronoMagic-Bench

yu-rp/KANbeFair

y-zheng18/point_odyssey

imagegridworth/IG-VLM

facebookresearch/ava-256

cloneofsimo/scaling-guide