Pinned Repositories
adetailer
Auto detecting, masking and inpainting with detection model.
talk-to-chatgpt
Talk to ChatGPT AI using your voice and listen to its answers through a voice
Whisper
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
alltalk_tts
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, DeepSpeed, narrator, model finetuning, custom models, wav file maintenance. It can also be used with 3rd Party software via JSON calls.
stable-diffusion-webui-state
Stable Diffusion extension that preserves ui state
noScribe
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
stable-diffusion-webui-forge
text-generation-webui
A Gradio web UI for Large Language Models.
Prometheatrics
Burning Man Camp founded in 2001, usually on the Esplanade, most known for the Tesseract.
sd-webui-infinite-image-browsing
A fast and powerful image/video browser for Stable Diffusion webui / ComfyUI / Fooocus / NovelAI / StableSwarmUI, featuring infinite scrolling and advanced search capabilities using image parameters. It also supports standalone operation.
Quidam2k's Repositories
Quidam2k/Prometheatrics
Burning Man Camp founded in 2001, usually on the Esplanade, most known for the Tesseract.