EmreRed's Stars
erew123/alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
hzwer/ECCV2022-RIFE
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
facebookresearch/sapiens
High-resolution models for human tasks.
fofr/cog-consistent-character
Create images of a given character in different poses
ckkelvinchan/RealBasicVSR
Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
harry0703/MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
levihsu/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
CandyPack/CandyPack
Frikallo/MISST
A local GUI music source separation tool built on Tkinter and demucs serving as a free and open source Stem Player
ollama/ollama
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
ivanfioravanti/chatbot-ollama
Chatbot Ollama is an open source chat UI for Ollama.
leptonai/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
williamyang1991/Rerender_A_Video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
atom/atom
:atom: The hackable text editor
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
poly-glot/tensorflowjs-remove-background
Remove Background from the picture using WebAssembly & TensorFlow.js
FFmpeg/FFmpeg
Mirror of https://git.ffmpeg.org/ffmpeg.git
xanderfrangos/twinkle-tray
Easily manage the brightness of your monitors in Windows from the system tray
ahmetaa/zemberek-nlp
NLP tools for Turkish.
timsutton/brigadier
Fetch and install Boot Camp ESDs with ease.
usmannasir/cyberpanel
Cyber Panel - The hosting control panel for OpenLiteSpeed
netinternet/parasut-v4
Parasut Php Api V4