Zj-BinXia's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Stability-AI/generative-models
Generative Models by Stability AI
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
Stability-AI/StableCascade
Official Code for Stable Cascade
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
sail-sg/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
Alpha-VLLM/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Yutong-Zhou-cv/Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
openai/consistencydecoder
Consistency Distilled Diff VAE
DmitryRyumin/ICCV-2023-Papers
ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!
caiyuanhao1998/Retinexformer
"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023) & (NTIRE 2024 Challenge)
dvlab-research/LLaMA-VID
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
caiyuanhao1998/MST
A toolbox for spectral compressive imaging reconstruction including MST (CVPR 2022), CST (ECCV 2022), DAUHST (NeurIPS 2022), BiSCI (NeurIPS 2023), HDNet (CVPR 2022), MST++ (CVPRW 2022), etc.
Yangyi-Chen/Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
SiatMMLab/Awesome-Diffusion-Model-Based-Image-Editing-Methods
Diffusion Model-Based Image Editing: A Survey (arXiv)
caiyuanhao1998/RSN
"Learning Delicate Local Representations for Multi-Person Pose Estimation" (ECCV 2020 Spotlight) & (COCO 2019 Human Keypoint Detection Challenge Winner) & (COCO 2019 Best Paper Award)
dvlab-research/LLMGA
This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral
AlonzoLeeeooo/awesome-text-to-image-studies
A collection of awesome text-to-image generation studies.
caiyuanhao1998/MST-plus-plus
"MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction" (CVPRW 2022) & (Winner of NTIRE 2022 Spectral Recovery Challenge) and a toolbox for spectral reconstruction
sled-group/InfEdit
[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"
AlphacatPlus/VmambaIR
This is official implementtaion of "VmambaIR: Visual State Space Model for Image Restoration"
linjing7/VR-Baseline
Video Restoration Toolbox including FGST (ICML 2022), S2SVR (ICML 2022), etc.
dvlab-research/Prompt-Highlighter
[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs
caiyuanhao1998/PNGAN
"Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training" (NeurIPS 2021)
Zj-BinXia/DiffRIR
Tao0-0/LGLA
This project is the official implementation of "Local and Global Logit Adjustments for Long-Tailed Learning", ICCV 2023
Tao0-0/FCGFace
This project is the official implementation of "Frontal-Centers Guided Face: Boosting Face Recognition by Learning Pose-Invariant Features", T-IFS 2022