kowalgregy's Stars
hamadichihaoui/BIRD
This is the official implementation of "Blind Image Restoration via Fast Diffusion Inversion"
LituRout/RB-Modulation
Reference-Based Modulation (RB-Modulation)
ToonCrafter/ToonCrafter
a research paper for generative cartoon interpolation
HVision-NKU/StoryDiffusion
Create Magic Story!
MC-E/ReVideo
cloneofsimo/minRF
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
Inferencer/LipSick
🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮
Zejun-Yang/AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
layerdiffusion/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
SHI-Labs/OneFormer
OneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023
ali-vilab/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
hotshotco/Hotshot-XL
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
guoyww/AnimateDiff
Official implementation of AnimateDiff.
AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
tobias17/sd-exodia
openai/shap-e
Generate 3D objects conditioned on text or images
deep-floyd/IF
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
zetavg/LLaMA-LoRA-Tuner
UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.
mayooear/gpt4-pdf-chatbot-langchain
GPT4 & LangChain Chatbot for large PDF docs
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
h2oai/h2ogpt
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/
Stability-AI/StableLM
StableLM: Stability AI Language Models
IDEA-Research/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
bigscience-workshop/petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.