Pinned Repositories
AR-experiments
Templates for AR.js Experiments on web and mobile
dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Hotshot-XL
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
map-generator
A p5 illustrative map generator that provides soundcloud player for each principal asset
MoBlitz-V2
Traditional Animation Pad using p5js
pyChatGPT
An unofficial Python wrapper for OpenAI's ChatGPT API
soundkit-mograph
A soundkit keyboard that fires sounds and motion graphics, using P5 and bodymovin.
Storypoint
a simple storypoint generator helping you to tell fairy tales
TangoNode
A handy interface to build your own force-layout graph
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
fffiloni's Repositories
fffiloni/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
fffiloni/pyChatGPT
An unofficial Python wrapper for OpenAI's ChatGPT API
fffiloni/Hotshot-XL
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
fffiloni/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
fffiloni/daclip-uir
PyTorch implementation of the paper "Controlling Vision-Language Models for Universal Image Restoration"
fffiloni/MiniGPT4-video
fffiloni/OOTDiffusion
Official implementation of OOTDiffusion
fffiloni/style-aligned
Official code for "Style Aligned Image Generation via Shared Attention"
fffiloni/AnyV2V
A Plug-and-Play Framework For Any Video-to-Video Editing Tasks. Now with gradio demo
fffiloni/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
fffiloni/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
fffiloni/DemoFusion
Let us democratise high-resolution generation! (arXiv 2023)
fffiloni/PASD
fffiloni/Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
fffiloni/AniPortrait
AniPortrait with Gradio: Audio-Driven Synthesis of Photorealistic Portrait Animation
fffiloni/BasicPBC
Official Implementation of "Learning Inclusion Matching for Animation Paint Bucket Colorization"
fffiloni/CameraCtrl
fffiloni/cog-autocaption
Add caption to any video
fffiloni/DiffBIR
fffiloni/diffusion-motion-transfer
Official Pytorch Implementation for "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer""
fffiloni/DragNUWA
fffiloni/metavoice-src
AI for human-level speech intelligence
fffiloni/Open-Sora-Plan-v1-0-0
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
fffiloni/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
fffiloni/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
fffiloni/radio-olympiades
fffiloni/StoryDiffusion
Create Magic Story!
fffiloni/TAO-Amodal
Official Code for Tracking Any Object Amodally
fffiloni/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
fffiloni/zest_code
This is the official implementation of ZeST