lzy-tony's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
rust-lang/rustlings
:crab: Small exercises to get you used to reading and writing Rust code!
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
google/comprehensive-rust
This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
brendangregg/FlameGraph
Stack trace visualizer
mlfoundations/open_clip
An open source implementation of CLIP.
deep-floyd/IF
timothybrooks/instruct-pix2pix
showlab/Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
open-mmlab/mmtracking
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
baaivision/Painter
Painter & SegGPT Series: Vision Foundation Models from BAAI
xiaobai1217/Awesome-Video-Datasets
Video datasets
Computer-Vision-in-the-Wild/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
snap-research/EfficientFormer
EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]
jia-zhuang/pytorch-multi-gpu-training
整理 pytorch 单机多 GPU 训练方法与原理
TencentARC/MasaCtrl
[ICCV 2023] Consistent Image Synthesis and Editing
naver-ai/DenseDiffusion
Official Pytorch Implementation of DenseDiffusion (ICCV 2023)
vislearn/ControlNet-XS
LeapLabTHU/FLatten-Transformer
Official repository of FLatten Transformer (ICCV2023)
SHI-Labs/NATTEN
Neighborhood Attention Extension. Bringing attention to a neighborhood near you!
GitGyun/visual_token_matching
[ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching
LeapLabTHU/ARC
[ICCV 2023] Adaptive Rotated Convolution for Rotated Object Detection
weixr18/MLAN
A Note for Machine Learning Algorithms
MengLcool/AdaViT
[CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".
LeapLabTHU/LAUDNet
[IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition
LeapLabTHU/Dynamic_Perceiver
Official implementation of Dynamic Perceiver
YixuanEvenXu/perturbed-maximization