Wei-i's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
CompVis/stable-diffusion
A latent text-to-image diffusion model
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
chenfei-wu/TaskMatrix
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
mli/paper-reading
深度学习经典、新论文逐段精读
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
amusi/Deep-Learning-Interview-Book
深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
voxel51/fiftyone
The open-source tool for building high-quality datasets and computer vision models
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
LokerL/tts-vue
🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。
ShoufaChen/DiffusionDet
[ICCV2023 Oral] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
frgfm/torch-cam
Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)
IDEA-Research/detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
yzy1996/English-Writing
Enhance Your English Writing for Science Research 写论文英语素材
pix2pixzero/pix2pix-zero
Zero-shot Image-to-Image Translation [SIGGRAPH 2023]
google-research/pix2seq
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
csuhan/OneLLM
[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language
shenyunhang/APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
gaopengcuhk/Stable-Pix2Seq
A full-fledged version of Pix2Seq
hujiecpp/YOSO
Code release for paper "You Only Segment Once: Towards Real-Time Panoptic Segmentation" [CVPR 2023]
ZhenglinZhou/STAR
[CVPR 2023] STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection
lxn96/awesome-few-shot-object-detection
Collect some papers and datastes about few-shot object detection for computer vision.
CamuseCao/XMU-thesis
A LaTeX template
csuhan/VFA
Official code of the paper "Few-Shot Object Detection via Variational Feature Aggregation" (AAAI 2023)
OatmealLiu/class-iNCD
PyTorch implementation for the paper Class-incremental Novel Class Discovery (ECCV 2022)
liuxingbin/dbot
[ICLR2024] Exploring Target Representations for Masked Autoencoders
Mi-Peng/Sparse-Sharpness-Aware-Minimization
[NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation
hujiecpp/Mini-Segment-Anything
Distilling the powerful segment anything models into lightweight ones for efficient segmentation.