memoiry's Stars
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
geekan/HowToLiveLonger
程序员延寿指南 | A programmer's guide to live longer
bloomberg/memray
Memray is a memory profiler for Python
TheR1D/shell_gpt
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
sczhou/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
ZHKKKe/MODNet
A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]
IceClear/StableSR
[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution
hkchengrex/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Meituan-AutoML/MobileVLM
Strong and Open Vision Language Assistant for Mobile Devices
jianzongwu/Awesome-Open-Vocabulary
(TPAMI 2024) A Survey on Open Vocabulary Learning
chuanyangjin/fast-DiT
Fast Diffusion Models with Transformers
dvlab-research/LLaMA-VID
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
Algolzw/daclip-uir
[ICLR 2024] Controlling Vision-Language Models for Universal Image Restoration. 5th place in the NTIRE 2024 Restore Any Image Model in the Wild Challenge.
hustvl/SparseInst
[CVPR 2022] SparseInst: Sparse Instance Activation for Real-Time Instance Segmentation
zhenyuw16/UniDetector
Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".
MCG-NJU/MixFormer
[CVPR 2022 Oral & TPAMI 2024] MixFormer: End-to-End Tracking with Iterative Mixed Attention
amazon-science/bigdetection
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
Junyi42/sd-dino
Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"
showlab/videollm-online
VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)
Arthur151/Relative_Human
Relative Human dataset, CVPR 2022
KuanchihHuang/MonoDTR
MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer (CVPR 2022)
pq-yang/PGDiff
[NeurIPS 2023] PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance
Megvii-BaseDetection/DenseTeacher
DenseTeacher: Dense Pseudo-Label for Semi-supervised Object Detection
AKASH2907/pi-consistency-activity-detection
End-to-End Semi-Supervised Learning for Video Action Detection [CVPR 2022]
Restricted-Memory/RMem
official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation
Caoyichao/UniHOI
Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation Models"