Pinned Repositories
Adaptively-Connected-Neural-Networks
A re-implementation of our CVPR 2019 paper "Adaptively Connected Neural Networks"
Adaptively-Connected-Neural-Networks-Pytorch
This is the pytorch implementation of "Adaptively Connected Neural Networks" for the currently popular EfficientNet and the efficient DNA network families.
Kalman-Normalization
Code of "Batch Kalman Normalization: Towards Training Deep Neural Networks with Micro-Batches"
Learning-Feature-Pyramids
Code of "Training ImageNet and PASCAL VOC2012 via Learning Feature Pyramids "
Learning-Feature-Pyramids-For-COCO
Training COCO 2017 Object Detection and Segmentation via Learning Feature Pyramids
Semantic-Aware-AE
SYSU-30k
SYSU-30k Dataset of "Weakly Supervised Person Re-ID: Differentiable Graphical Learning and A New Benchmark" https://arxiv.org/abs/1904.03845
TreeConv
This is a re-implementation of our KDD 2020 paper "Grammatically Recognizing Images with Tree Convolution."
triplet
Code of the paper "Solving Inefficiency of Self-supervised Representation Learning"
wanggrun.github.io
wanggrun's Repositories
wanggrun/wanggrun.github.io
wanggrun/Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
wanggrun/4D-Humans
4DHumans: Reconstructing and Tracking Humans with Transformers
wanggrun/Depth-Anything
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
wanggrun/materials_discovery
wanggrun/pytorch-image-models-v2
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
wanggrun/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
wanggrun/autogen
A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
wanggrun/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
wanggrun/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
wanggrun/DIS
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
wanggrun/FLatten-Transformer
Official repository of FLatten Transformer (ICCV2023)
wanggrun/GPT-4V-Act
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
wanggrun/humannerf
HumanNeRF turns a monocular video of moving people into a 360 free-viewpoint video.
wanggrun/IDM-VTON
IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
wanggrun/inpaint-anything
Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
wanggrun/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
wanggrun/latent-diffusion-inpainting
wanggrun/llama
Inference code for LLaMA models
wanggrun/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
wanggrun/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
wanggrun/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
wanggrun/PeRF
[Technical Report 2023] PERF: Panoramic Neural Radiance Field from a Single Panorama
wanggrun/pyllama
LLaMA: Open and Efficient Foundation Language Models
wanggrun/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
wanggrun/rcg
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
wanggrun/StableVITON
wanggrun/torch-ngp
A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.
wanggrun/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
wanggrun/ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions