MatthieuFP

MatthieuFP's Stars

lllyasviel/ControlNet
Let us control diffusion models!
Language:Python31.2k 219 5592.8k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python27.1k 211 4.4k5.6k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21.1k 158 1.6k2.3k
Zulko/moviepy
Video editing with Python
Language:Python12.9k 250 1.5k1.6k
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Language:Python11.1k 173 6742.4k
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.8k 81 5061k
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python10.3k 131 511956
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
7.6k 335 266927
Linzaer/Ultra-Light-Fast-Generic-Face-Detector-1MB
💎1MB lightweight face detection model (1MB轻量级人脸检测模型)
Language:Python7.2k 191 2661.5k
kyutai-labs/moshi
Language:Python7.1k 80 96557
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
6.2k 94 11340
lllyasviel/ControlNet-v1-1-nightly
Nightly release of ControlNet 1.1
Language:Python4.9k 62 129384
BadToBest/EchoMimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python2.9k 43 180339
jingyi0000/VLM_survey
Collection of AWESOME vision-language models for vision tasks
2.7k 125 10229
genmoai/models
The best OSS video generation models
Language:Python2.1k 34 67209
X-LANCE/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Language:Jupyter Notebook1.5k 66 40140
MRzzm/DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
Language:Python1k 19 122180
facebookresearch/spiritlm
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
Language:Python859 18 1956
lucidrains/rotary-embedding-torch
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
Language:Python607 11 3247
liming-ai/ControlNet_Plus_Plus
Official PyTorch implementation of ECCV 2024 Paper: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.
Language:Python459 11 1919
uclanlp/awesome-fairness-papers
Papers on fairness in NLP
433 31 453
Weifeng-Chen/control-a-video
Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"
Language:Python378 22 2928
facebookresearch/unibench
Python Library to evaluate VLM models' robustness across diverse benchmarks
Language:Jupyter Notebook180 8 813
facebookresearch/SemDeDup
Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).
Language:Python123 2 913
TheDenk/cogvideox-controlnet
Simple Controlnet module for CogvideoX model.
Language:Jupyter Notebook97 4 97
GiilDe/turbo-edit
Language:Python91 1 46
Glovo/foodi-ml-dataset
Language:Jupyter Notebook57 13 16
bltlab/paranames
ParaNames: A multilingual resource for parallel names
Language:Python30 2 43
meharbhatia/globalrg
Data and Code for Paper "From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models"
2 1 10
j0ma/paranames-named-entity-recognition
ParaNames - NER experiments
Language:Python1 2 20