dixitcy's Stars
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
open-webui/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
myshell-ai/OpenVoice
Instant voice cloning by MIT and MyShell.
sidekiq/sidekiq
Simple, efficient background processing for Ruby
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
TencentARC/PhotoMaker
PhotoMaker [CVPR 2024]
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
jiaaro/pydub
Manipulate audio with a simple and easy high level interface
jxnl/instructor
structured outputs for llms
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
metavoiceio/metavoice-src
Foundational model for human-like, expressive TTS
facebookresearch/co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
run-llama/llama-hub
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
Filimoa/open-parse
Improved file parsing for LLM’s
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
ShineChen1024/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
Vaibhavs10/open-tts-tracker
ubisoft/ubisoft-laforge-animation-dataset
Ubisoft La Forge - Animation Dataset
ZHO-ZHO-ZHO/ComfyUI-PhotoMaker-ZHO
Unofficial implementation of PhotoMaker for ComfyUI
KAIST-VICLab/FMA-Net
[CVPR 2024 Oral] Official repository of FMA-Net
AlexanderDzhoganov/ComfyTextures
Unreal Engine ⚔️ ComfyUI - Automatic texturing using generative diffusion models
fal-ai/aura-sr
AuraSR: GAN-based Super-Resolution for real-world
HilaManor/AudioEditingCode
video-db/PromptClip
Instantly create video clips from LLM prompts
dabit3/fal-with-react-native
AI inference using fal.ai on a React Native app