Tenvence's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
lllyasviel/ControlNet
Let us control diffusion models!
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
JanDeDobbeleer/oh-my-posh
The most customisable and low-latency cross platform/shell prompt renderer
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
HugoBlox/hugo-blox-builder
🚨 GROW YOUR AUDIENCE WITH HUGOBLOX! 🚀 HugoBlox is an easy, fast no-code website builder for researchers, entrepreneurs, data scientists, and developers. Build stunning sites in minutes. 适合研究人员、企业家、数据科学家和开发者的简单快速无代码网站构建器。用拖放功能、可定制模板和内置SEO工具快速创建精美网站!
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
TencentARC/T2I-Adapter
T2I-Adapter
BadToBest/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
nerfies/nerfies.github.io
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
eliahuhorwitz/Academic-project-page-template
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
harlanhong/awesome-talking-head-generation
RayeRen/acad-homepage.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
muzishen/IMAGDressing
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing
ali-vilab/UniAnimate
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
mlch911/one-key-hidpi
Enable macOS HiDPI and have a native setting.
diffusion-classifier/diffusion-classifier
Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training
AILab-CVC/CV-VAE
[NeurIPS 24] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
liutaocode/talking-face-arxiv-daily
🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.
bloomberg/dataless-model-merging
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
zhiyuanhubj/LongRecipe
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
muzishen/PCDMs
[ICLR 2023] Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
llvy21/DUIC
NVlabs/CMD
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition (ICLR 2024)
npurson/fid-metrics
A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.