AnranXu's Stars
lllyasviel/ControlNet-v1-1-nightly
Nightly release of ControlNet 1.1
timothybrooks/instruct-pix2pix
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
allenai/visprog
Official code for VisProg (CVPR 2023 Best Paper!)
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
mlfoundations/open_clip
An open source implementation of CLIP.
openai/guided-diffusion
NVlabs/stylegan
StyleGAN - Official TensorFlow Implementation
Zhendong-Wang/Diffusion-GAN
Official PyTorch implementation for paper: Diffusion-GAN: Training GANs with Diffusion
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
lllyasviel/ControlNet
Let us control diffusion models!
tribhuvanesh/vpa
Towards a Visual Privacy Advisor: Understanding and Predicting Risks in Images, ICCV '17
advimman/lama
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
fahadshamshad/Clip2Protect
[CVPR 2023] Official repository of paper titled "CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent Search".
UniversalDataTool/react-image-annotate
Create image annotations. Classify, tag images with polygons, bounding boxes or points.
AIGText/GlyphControl-release
[NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"
google-gemini/generative-ai-python
The official Python library for the Google Gemini API
deep-floyd/IF
nvm-sh/nvm
Node Version Manager - POSIX-compliant bash script to manage multiple active node.js versions
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
geekyutao/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
openai/glide-text2im
GLIDE: a diffusion-based text-conditional image synthesis model
openai/consistency_models
Official repo for consistency models.
chenfei-wu/TaskMatrix