WANGAndyYucheng's Stars
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
guoyww/AnimateDiff
Official implementation of AnimateDiff.
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
mosaicml/llm-foundry
LLM training code for Databricks foundation models
philz1337x/clarity-upscaler
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
XPixelGroup/DiffBIR
Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
ChenyangSi/FreeU
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
harlanhong/awesome-talking-head-generation
JosephPai/Awesome-Talking-Face
📖 A curated list of resources dedicated to talking face.
mini-sora/minisora
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
gnobitab/InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
DirtyHarryLYL/LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
yiranran/Audio-driven-TalkingFace-HeadPose
Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalized Head Movement From Short Video and Speech Signal" (TMM 2022)
princeton-nlp/LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
RenYurui/PIRender
The source code of the ICCV2021 paper "PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering"
facebookresearch/AGRoL
Code release for "Avatars Grow Legs Generating Smooth Human Motion from Sparse Tracking Inputs with Diffusion Model", CVPR 2023
shivangi-aneja/FaceTalk
[CVPR 2024] FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
tobias-kirschstein/nersemble
[Siggraph '23] NeRSemble: Neural Radiance Field Reconstruction of Human Heads
uuembodiedsocialai/FaceDiffuser
KumapowerLIU/PD-GAN
The official pytorch code of PD-GAN: Probabilistic Diverse GAN for Image Inpainting (CVPR 2021)
measure-infinity/mulan-code
JordanZh/C3Net
Official PyTorch implementation of C3Net: Compound Conditioned ControlNet for Multimodal Content Generation
zoryzhang/CurriculumMap
A visualization solution for curriculum and course relationships of Hongkong University of Science and Technology(HKUST)