firefishu's Stars
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
xtekky/gpt4free
The official gpt4free repository | various collection of powerful language models
deepfakes/faceswap
Deepfakes Software For All
s0md3v/roop
one-click face swap
BabylonJS/Babylon.js
Babylon.js is a powerful, beautiful, simple, and open game and rendering engine packed into a friendly JavaScript framework.
facefusion/facefusion
Industry leading face manipulation platform
MetaCubeX/ClashMetaForAndroid
A rule-based tunnel for Android.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
facebookresearch/demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
LargeWorldModel/LWM
1adrianb/face-alignment
:fire: 2D and 3D Face alignment library build using pytorch
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
geekyutao/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
sczhou/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
rany2/edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Hillobar/Rope
GUI-focused roop
YaoFANGUK/video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
FujiwaraChoki/MoneyPrinterV2
Automate the process of making money online.
Morakito/Real-Time-Rendering-4th-CN
《Real-Time Rendering 4th》 (RTR4) 中文翻译
VainF/Awesome-Anything
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
anothermartz/Easy-Wav2Lip
Colab for making Wav2Lip high quality and easy to use
yeungchenwa/OCR-SAM
Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
bryful/F-s-PluginsProjects
After Effects Plugins