YuchenLiu-a's Stars
3DTopia/MaterialAnything
Material Anything: Generating Materials for Any 3D Object via Diffusion
SonyResearch/COALA
COALA: A Practical and Vision-Centric Federated Learning Platform, accepted to ICML'24
SonyResearch/micro_diffusion
Repo is required for the code of our research paper on micro-budget training of large scale diffusion model.
buaacyw/MeshAnything
From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
HyoKong/DreamDrone
Navigate dreamscapes with a click – your chosen point guides the drone’s flight in a thrilling visual journey.
MorphingDB/MorphingDB
PostgreSQL extension for supporting deep learning model inference within the database and vector storage
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
facebookresearch/sscd-copy-detection
Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).
rushilsrivastava/image_search
Python Library to download images and metadata from popular search engines.
typst/typst
A new markup-based typesetting system that is powerful and easy to learn.
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
ryanwebster90/snip-dedup
zyxElsa/InST
Official implementation of the paper “Inversion-Based Style Transfer with Diffusion Models” (CVPR 2023)
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
HighCWu/ControlLoRA
ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information
lllyasviel/ControlNet
Let us control diffusion models!
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
pengr/IKD-MMT
Our code for EMNLP'22 Oral paper "Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation".
pengr/Contrastive_AutoEval
Our code for ICCV'23 paper "CAME: Contrastive Automated Model Evaluation".
rinongal/textual_inversion
google/prompt-to-prompt
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
facebookresearch/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
multimodallearning/pytorch-mask-rcnn
yukimasano/PASS
The PASS dataset: pretrained models and how to get the data
czczup/ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
zbwxp/SegVit
Official Pytorch Implementation of SegViT: Semantic Segmentation with Plain Vision Transformers
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.