Pinned Repositories
AI-Song-Cover-RVC
All in One Version : Youtube WAV Download, Separating Vocal, Splitting Audio, Training, and Inference Using Google Colab
badgids
Hello World!
ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
ComfyUI-InstructorOllama
Custom comfyui https://github.com/comfyanonymous/ComfyUI Nodes for interacting with Ollama https://ollama.com/ using the Instructor https://github.com/jxnl/instructor Library to provide structured output from your LLM
ComfyUI-Manager
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.
CondaLauncher
f5-tts
F5-TTS is a web application that allows users to clone voices and generate text-to-speech audio using advanced AI models.
OpenKlyde
An AI Discord bot that connects to a koboldcpp instance by API calls. Have a more intelligent Clyde Bot of your own making!
so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
transcription-app
a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulates typing the transcription in real-time wherever your cursor is on the screen. It can also do realtime translation.
badgids's Repositories
badgids/badgids
Hello World!
badgids/CondaLauncher
badgids/f5-tts
F5-TTS is a web application that allows users to clone voices and generate text-to-speech audio using advanced AI models.
badgids/audacity
Audio Editor
badgids/AI-Youtube-Shorts-Generator
A python tool that uses GPT-4, FFmpeg, and OpenCV to automatically analyze videos, extract the most interesting sections, and crop them for an improved viewing experience.
badgids/blender
Official mirror of Blender
badgids/ComfyUI-GGUF
GGUF Quantization support for native ComfyUI models
badgids/exo
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
badgids/faust
Functional programming language for signal processing and sound synthesis
badgids/fausteditor
A simple Faust editor for the web
badgids/faustlibraries
The Faust libraries
badgids/FFmpeg
Mirror of https://git.ffmpeg.org/ffmpeg.git
badgids/freemocap
Free Motion Capture for Everyone 💀✨
badgids/kdenlive
Free and open source video editor, based on MLT Framework and KDE Frameworks
badgids/laravel
Laravel is a web application framework with expressive, elegant syntax. We’ve already laid the foundation for your next big idea — freeing you to create without sweating the small things.
badgids/LibreChat
Enhanced ChatGPT Clone: Features Agents, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.
badgids/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
badgids/marker
Convert PDF to markdown + JSON quickly with high accuracy
badgids/mlt
MLT Multimedia Framework
badgids/openvino-plugins-ai-audacity
A set of AI-enabled effects, generators, and analyzers for Audacity®.
badgids/powertabeditor
View and edit guitar tablature.
badgids/shotcut
cross-platform (Qt), open-source (GPLv3) video editor
badgids/SillyTavern
LLM Frontend for Power Users.
badgids/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
badgids/tts-generation-webui
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
badgids/vlc-android
VLC for Android, Android TV and ChromeOS
badgids/VMB
Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging different representations and enhancing generation with RAG.
badgids/voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.
badgids/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
badgids/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)