lifrary's Stars
microsoft/generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
google-research/google-research
Google Research
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
black-forest-labs/flux
Official inference repo for FLUX.1 models
KwaiVGI/LivePortrait
Bring portraits to life!
MrNeRF/awesome-3D-gaussian-splatting
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
lllyasviel/IC-Light
More relighting!
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
lllyasviel/Paints-UNDO
Understand Human Behavior to Align True Needs
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
IDEA-Research/T-Rex
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
PixArt-alpha/PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
luca-medeiros/lang-segment-anything
SAM with text prompt
apple/ml-4m
4M: Massively Multimodal Masked Modeling
SHI-Labs/OneFormer
OneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023
XLabs-AI/x-flux
HarborYuan/ovsam
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
dangeng/visual_anagrams
Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"
IDEA-Research/Grounded-SAM-2
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
IDEA-Research/OpenSeeD
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
showlab/DragAnything
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
Qinying-Liu/Awesome-Open-Vocabulary-Semantic-Segmentation
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
UX-Decoder/DINOv
[CVPR 2024] Official implementation of the paper "Visual In-context Learning"
segments-ai/panoptic-segment-anything
Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation
google/RB-Modulation
Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"
instantX-research/CSGO
CSGO: Content-Style Composition in Text-to-Image Generation 🔥
linzhiqiu/t2v_metrics
Evaluating text-to-image/video/3D models with VQAScore
liuff19/Physics3D
Official implementation of Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion
learn2phoenix/CSD