atong007's Stars
Vaibhavs10/insanely-fast-whisper
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
fpgaminer/joycaption
JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.
astral-sh/uv
An extremely fast Python package and project manager, written in Rust.
collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
Pythagora-io/gpt-pilot
The first real AI developer
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
public-apis/public-apis
A collective list of free APIs
n8n-io/n8n
Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.
ToTheBeginning/PuLID
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
guidance-ai/guidance
A guidance language for controlling large language models.
jbhuang0604/awesome-computer-vision
A curated list of awesome computer vision resources
instantX-research/InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
ggerganov/llama.cpp
LLM inference in C/C++
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
ltdrdata/ComfyUI-Manager
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.
Fanghua-Yu/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Hack-with-Github/Awesome-Hacking
A collection of various awesome lists for hackers, pentesters and security researchers
skylot/jadx
Dex to Java decompiler
danielmiessler/fabric
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
cubiq/ComfyUI_IPAdapter_plus
wenquanlu/HandRefiner
Fannovel16/comfyui_controlnet_aux
ComfyUI's ControlNet Auxiliary Preprocessors
pyenv/pyenv
Simple Python version management
jianfch/stable-ts
Transcription, forced alignment, and audio indexing with OpenAI's Whisper