ofirbb's Stars
coqui-ai/TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
lllyasviel/ControlNet
Let us control diffusion models!
microsoft/autogen
A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
facefusion/facefusion
Next generation face swapper and enhancer
TransformerOptimus/SuperAGI
<β‘οΈ> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
huggingface/peft
π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
unifyai/ivy
The Unified AI Framework
AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize π₯ We release the trained model on HuggingFace.
gfx-rs/wgpu
A cross-platform, safe, pure-Rust graphics API.
InstantID/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds π₯
Oneflow-Inc/oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
HumanAIGC/OutfitAnyone
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
microsoft/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
Bing-su/adetailer
Auto detecting, masking and inpainting with detection model.
SysCV/sam-hq
Segment Anything in High Quality [NeurIPS 2023]
riffusion/riffusion
Stable diffusion for real-time music generation
google/prompt-to-prompt
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
minar09/awesome-virtual-try-on
A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.
microsoft/NeuralSpeech
ViTAE-Transformer/ViTPose
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
11cafe/comfyui-workspace-manager
A ComfyUI workflows and models management extension to organize and manage all your workflows, models in one place. Seamlessly switch between workflows, as well as import, export workflows, reuse subworkflows, install models, browse your models in a single workspace
whylabs/langkit
π LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). π Extracts signals from prompts & responses, ensuring safety & security. π‘οΈ Features include text quality, relevance metrics, & sentiment analysis. π A comprehensive tool for LLM observability. π
plemeri/transparent-background
This is a background removing tool powered by InSPyReNet (ACCV 2022)
SHI-Labs/Matting-Anything
Matting Anything Model (MAM), an efficient and versatile framework for estimating the alpha matte of any instance in an image with flexible and interactive visual or linguistic user prompt guidance.
Lightricks/LongAnimateDiff
visual-layer/visuallayer
Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, mislabels and others.