YuchenLiu-a

YuchenLiu-a's Stars

3DTopia/MaterialAnything
Material Anything: Generating Materials for Any 3D Object via Diffusion
Language:Python21111
SonyResearch/COALA
COALA: A Practical and Vision-Centric Federated Learning Platform, accepted to ICML'24
Language:Python1062
SonyResearch/micro_diffusion
Repo is required for the code of our research paper on micro-budget training of large scale diffusion model.
1551
buaacyw/MeshAnything
From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
Language:Python2.1k91
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python144k27.1k
HyoKong/DreamDrone
Navigate dreamscapes with a click – your chosen point guides the drone’s flight in a thrilling visual journey.
Language:Python433
MorphingDB/MorphingDB
PostgreSQL extension for supporting deep learning model inference within the database and vector storage
Language:C++56
ashawkey/stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
Language:Python8.3k736
facebookresearch/sscd-copy-detection
Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).
Language:Python26720
rushilsrivastava/image_search
Python Library to download images and metadata from popular search engines.
Language:Python12634
typst/typst
A new markup-based typesetting system that is powerful and easy to learn.
Language:Rust35.9k964
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Language:Python3.8k341
ryanwebster90/snip-dedup
Language:Python1006
zyxElsa/InST
Official implementation of the paper “Inversion-Based Style Transfer with Diffusion Models” (CVPR 2023)
Language:Jupyter Notebook54247
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Language:Python26.6k5.5k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.7k2.3k
HighCWu/ControlLoRA
ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information
Language:Python57127
lllyasviel/ControlNet
Let us control diffusion models!
Language:Python30.9k2.8k
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
Language:Jupyter Notebook7.1k483
pengr/IKD-MMT
Our code for EMNLP'22 Oral paper "Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation".
Language:Python301
pengr/Contrastive_AutoEval
Our code for ICCV'23 paper "CAME: Contrastive Automated Model Evaluation".
Language:Python263
rinongal/textual_inversion
Language:Jupyter Notebook2.9k282
google/prompt-to-prompt
Language:Jupyter Notebook3.2k300
kaixindelele/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Language:Python18.6k1.9k
facebookresearch/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Language:Python5.5k938
multimodallearning/pytorch-mask-rcnn
Language:Python2k556
yukimasano/PASS
The PASS dataset: pretrained models and how to get the data
Language:Python26217
czczup/ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
Language:Python1.3k140
zbwxp/SegVit
Official Pytorch Implementation of SegViT: Semantic Segmentation with Plain Vision Transformers
Language:Python22421
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Language:Python30.8k7.5k