lzy-tony

PhD student @LeapLabTHU

Tsinghua UniversityBeijing

lzy-tony's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python170k 1.5k 3.1k44.8k
rust-lang/rustlings
:crab: Small exercises to get you used to reading and writing Rust code!
Language:Rust55.7k 341 69010.3k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook48.5k 313 6815.7k
google/comprehensive-rust
This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.
Language:Rust28.3k 144 3071.7k
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python27.1k 211 4.4k5.6k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python21k 158 1.6k2.3k
brendangregg/FlameGraph
Stack trace visualizer
Language:Perl17.6k 483 1502k
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.8k 81 5061k
deep-floyd/IF
Language:Python7.7k 84 101505
timothybrooks/instruct-pix2pix
Language:Python6.5k 69 129546
showlab/Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Language:Python4.3k 51 97388
open-mmlab/mmtracking
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
Language:Python3.6k 49 465598
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python3.6k 100 163242
baaivision/Painter
Painter & SegGPT Series: Vision Foundation Models from BAAI
Language:Python2.5k 37 71176
xiaobai1217/Awesome-Video-Datasets
Video datasets
1.3k 29 1296
Computer-Vision-in-the-Wild/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
1.2k 38 658
snap-research/EfficientFormer
EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]
Language:Python1k 37 5892
jia-zhuang/pytorch-multi-gpu-training
整理 pytorch 单机多 GPU 训练方法与原理
Language:Python788 5 883
TencentARC/MasaCtrl
[ICCV 2023] Consistent Image Synthesis and Editing
Language:Python757 20 5030
naver-ai/DenseDiffusion
Official Pytorch Implementation of DenseDiffusion (ICCV 2023)
Language:Jupyter Notebook486 11 2032
vislearn/ControlNet-XS
Language:Python457 16 3312
LeapLabTHU/FLatten-Transformer
Official repository of FLatten Transformer (ICCV2023)
Language:Python409 4 3324
SHI-Labs/NATTEN
Neighborhood Attention Extension. Bringing attention to a neighborhood near you!
Language:Cuda392 11 12231
GitGyun/visual_token_matching
[ICLR'23 Oral] Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching
Language:Python253 7 1713
LeapLabTHU/ARC
[ICCV 2023] Adaptive Rotated Convolution for Rotated Object Detection
Language:Python117 3 276
weixr18/MLAN
A Note for Machine Learning Algorithms
85 2 09
MengLcool/AdaViT
[CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".
Language:Python50 2 118
LeapLabTHU/LAUDNet
[IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition
Language:Jupyter Notebook45 3 12
LeapLabTHU/Dynamic_Perceiver
Official implementation of Dynamic Perceiver
Language:Python42 2 12
YixuanEvenXu/perturbed-maximization
Language:Python5 1 00

lzy-tony

lzy-tony's Stars

Significant-Gravitas/AutoGPT

rust-lang/rustlings

facebookresearch/segment-anything

google/comprehensive-rust

huggingface/diffusers

haotian-liu/LLaVA

brendangregg/FlameGraph

mlfoundations/open_clip

deep-floyd/IF

timothybrooks/instruct-pix2pix

showlab/Tune-A-Video

open-mmlab/mmtracking

Luodian/Otter

baaivision/Painter

xiaobai1217/Awesome-Video-Datasets

Computer-Vision-in-the-Wild/CVinW_Readings

snap-research/EfficientFormer

jia-zhuang/pytorch-multi-gpu-training

TencentARC/MasaCtrl

naver-ai/DenseDiffusion

vislearn/ControlNet-XS

LeapLabTHU/FLatten-Transformer

SHI-Labs/NATTEN

GitGyun/visual_token_matching

LeapLabTHU/ARC

weixr18/MLAN

MengLcool/AdaViT

LeapLabTHU/LAUDNet

LeapLabTHU/Dynamic_Perceiver

YixuanEvenXu/perturbed-maximization