aengusng8
Love the combination of mathematics, coding, and intuition. Contributor @huggingface 🤗; AI Research Resident @VinAIResearch
@VinAIResearch
aengusng8's Stars
karpathy/llm.c
LLM training in simple, raw C/CUDA
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
luosiallen/latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
karpathy/build-nanogpt
Video+code lecture on building nanoGPT from scratch
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
deepseek-ai/DreamCraft3D
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
ChenHsing/Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
GaParmar/img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
christophschuhmann/improved-aesthetic-predictor
CLIP+MLP Aesthetic Score Predictor
facebookresearch/ov-seg
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
a1600012888/PhysDreamer
Code for PhysDreamer
segmind/segmoe
VinAIResearch/WaveDiff
Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)
vvictoryuki/AnimateZero
Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators"
AlonzoLeeeooo/awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
TencentARC/SmartEdit
Official code of SmartEdit [CVPR-2024 Highlight]
YuDeng/Portrait-4D
Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data (CVPR 24); Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer (ECCV 2024)
soraw-ai/Awesome-Text-to-Video-Generation
A list for Text-to-Video, Image-to-Video works
crowsonkb/simulacra-aesthetic-models
csxmli2016/w-plus-adapter
[CVPR 2024] When StyleGAN Meets Stable Diffusion: a W+ Adapter for Personalized Image Generation
lucidrains/mixture-of-attention
Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts
2y7c3/ASD
[CVPR2024] Official Codes for "Adversarial Score Distillation: When score distillation meets GAN"
HuyNguyen-hust/flash-attn-101
Luvata/aixiv2-new
Generated summarization and implementation of some papers, hosted at https://luvata.github.io/aixiv2-new/