EmreRed's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
atom/atom
:atom: The hackable text editor
FFmpeg/FFmpeg
Mirror of https://git.ffmpeg.org/ffmpeg.git
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
harry0703/MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Rudrabha/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
leptonai/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
levihsu/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
xanderfrangos/twinkle-tray
Easily manage the brightness of your monitors in Windows from the system tray
facebookresearch/sapiens
High-resolution models for human tasks.
hzwer/ECCV2022-RIFE
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
williamyang1991/Rerender_A_Video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
timsutton/brigadier
Fetch and install Boot Camp ESDs with ease.
usmannasir/cyberpanel
Cyber Panel - The hosting control panel for OpenLiteSpeed
ivanfioravanti/chatbot-ollama
Chatbot Ollama is an open source chat UI for Ollama.
erew123/alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
ahmetaa/zemberek-nlp
NLP tools for Turkish.
ckkelvinchan/RealBasicVSR
Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"
fofr/cog-consistent-character
Create images of a given character in different poses
Frikallo/MISST
A local GUI music source separation tool built on Tkinter and demucs serving as a free and open source Stem Player
poly-glot/tensorflowjs-remove-background
Remove Background from the picture using WebAssembly & TensorFlow.js
netinternet/parasut-v4
Parasut Php Api V4
CandyPack/CandyPack