darkpeaceduck's Stars
cline/cline
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
DanielSWolf/rhubarb-lip-sync
Rhubarb Lip Sync is a command-line tool that automatically creates 2D mouth animation from voice recordings. You can use it for characters in computer games, in animated cartoons, or in any other project that requires animating mouths based on existing recordings.
lllyasviel/stable-diffusion-webui-forge
comfyanonymous/ComfyUI_bitsandbytes_NF4
black-forest-labs/flux
Official inference repo for FLUX.1 models
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
open-webui/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
janhq/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
IlyaGusev/saiga
LLaVA-VL/LLaVA-NeXT
Acly/krita-ai-diffusion
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
IlyaGusev/rulm
Language modeling and instruction tuning for Russian
google-deepmind/alphageometry
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
janvarev/Irene-Voice-Assistant
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
davabase/whisper_real_time
Real time transcription with OpenAI Whisper.
alphacep/awesome-russian-speech
Russian speech technology links
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
daniilrobnikov/vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
shigabeev/Q-VITS2-Voice-Cloning
WIP: VITS 2 with quantized output of text-encoder and voice cloning
resemble-ai/resemble-enhance
AI powered speech denoising and enhancement
dvmazur/mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
jeeliz/jeelizFaceFilter
Javascript/WebGL lightweight face tracking library designed for augmented reality webcam filters. Features : multiple faces detection, rotation, mouth opening. Various integration examples are provided (Three.js, Babylon.js, FaceSwap, Canvas2D, CSS3D...).
ArthurFDLR/whisper-youtube
🔉 Youtube Videos Transcription with OpenAI's Whisper
pydn/ComfyUI-to-Python-Extension
A powerful tool that translates ComfyUI workflows into executable Python code.
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
CiaraStrawberry/TemporalKit
An all in one solution for adding Temporal Stability to a Stable Diffusion Render via an automatic1111 extension
Vchitect/SEINE
[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction