MatthieuFP's Stars
lllyasviel/ControlNet
Let us control diffusion models!
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Zulko/moviepy
Video editing with Python
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
mlfoundations/open_clip
An open source implementation of CLIP.
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Linzaer/Ultra-Light-Fast-Generic-Face-Detector-1MB
💎1MB lightweight face detection model (1MB轻量级人脸检测模型)
kyutai-labs/moshi
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
lllyasviel/ControlNet-v1-1-nightly
Nightly release of ControlNet 1.1
BadToBest/EchoMimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
jingyi0000/VLM_survey
Collection of AWESOME vision-language models for vision tasks
genmoai/models
The best OSS video generation models
X-LANCE/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
MRzzm/DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
facebookresearch/spiritlm
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
lucidrains/rotary-embedding-torch
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
liming-ai/ControlNet_Plus_Plus
Official PyTorch implementation of ECCV 2024 Paper: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.
uclanlp/awesome-fairness-papers
Papers on fairness in NLP
Weifeng-Chen/control-a-video
Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"
facebookresearch/unibench
Python Library to evaluate VLM models' robustness across diverse benchmarks
facebookresearch/SemDeDup
Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).
TheDenk/cogvideox-controlnet
Simple Controlnet module for CogvideoX model.
GiilDe/turbo-edit
Glovo/foodi-ml-dataset
bltlab/paranames
ParaNames: A multilingual resource for parallel names
meharbhatia/globalrg
Data and Code for Paper "From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models"
j0ma/paranames-named-entity-recognition
ParaNames - NER experiments