JeremyCJM's Stars
junyanz/pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
KevinMusgrave/pytorch-metric-learning
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
naver/dust3r
DUSt3R: Geometric 3D Vision Made Easy
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
ajbrock/BigGAN-PyTorch
The author's officially unofficial PyTorch BigGAN implementation.
wolny/pytorch-3dunet
3D U-Net model for volumetric semantic segmentation written in pytorch
JosephPai/Awesome-Talking-Face
📖 A curated list of resources dedicated to talking face.
autonomousvision/mip-splatting
[CVPR'24 Best Student Paper] Mip-Splatting: Alias-free 3D Gaussian Splatting
lizhe00/AnimatableGaussians
Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"
YunjinPark/awesome_talking_face_generation
YuelangX/Gaussian-Head-Avatar
[CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"
VAST-AI-Research/TriplaneGaussian
TriplaneGaussian: A new hybrid representation for single-view 3D reconstruction.
SkalskiP/awesome-foundation-and-multimodal-models
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
joe-siyuan-qiao/WeightStandardization
Standardizing weights to accelerate micro-batch training
justimyhxu/GRM
Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
NVlabs/edm2
Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)
Tobias-Fischer/rt_gene
RT-GENE: Real-Time Eye Gaze and Blink Estimation in Natural Environments
Ahmednull/L2CS-Net
The official PyTorch implementation of L2CS-Net for gaze estimation and tracking
YingqingHe/Awesome-LLMs-meet-Multimodal-Generation
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
YuDeng/Portrait-4D
Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data (CVPR 24); Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer (ECCV 2024)
johndpope/VASA-1-hack
Using Claude Opus to reverse engineer code from VASA white paper - WIP - (this is for La Raza 🎷)
microsoft/Swin3D
A shift-window based transformer for 3D sparse tasks
xucong-zhang/ETH-XGaze
Official implementation of ETH-XGaze dataset baseline
Beckschen/3D-TransUNet
This is the official repository for the paper "3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers"
mit-han-lab/flatformer
[CVPR'23] FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer
johndpope/MegaPortrait-hack
Using Claude Opus to reverse engineer code from MegaPortraits: One-shot Megapixel Neural Head Avatars
r-zemblys/gazeNet
gazeNet: End-to-end eye-movement event detection with deep neural networks
Kevinfringe/MegaPortrait
Implementation of Megaportrait
zgchen33/MCGaze
[IEEE SPL] End-to-end Video Gaze Estimation via Capturing Head-face-eye Spatial-temporal Interaction Context