wanggrun

Please visit my homepage: https://wanggrun.github.io/

Pinned Repositories

Adaptively-Connected-Neural-Networks
A re-implementation of our CVPR 2019 paper "Adaptively Connected Neural Networks"
Language:Python145 6 729
Adaptively-Connected-Neural-Networks-Pytorch
This is the pytorch implementation of "Adaptively Connected Neural Networks" for the currently popular EfficientNet and the efficient DNA network families.
10 5 10
Kalman-Normalization
Code of "Batch Kalman Normalization: Towards Training Deep Neural Networks with Micro-Batches"
Language:Python22 5 03
Learning-Feature-Pyramids
Code of "Training ImageNet and PASCAL VOC2012 via Learning Feature Pyramids "
Language:Python21 4 03
Learning-Feature-Pyramids-For-COCO
Training COCO 2017 Object Detection and Segmentation via Learning Feature Pyramids
Language:Python5 3 02
Semantic-Aware-AE
Language:Python7 2 21
SYSU-30k
SYSU-30k Dataset of "Weakly Supervised Person Re-ID: Differentiable Graphical Learning and A New Benchmark" https://arxiv.org/abs/1904.03845
Language:Python170 12 1724
TreeConv
This is a re-implementation of our KDD 2020 paper "Grammatically Recognizing Images with Tree Convolution."
Language:Python13 2 00
triplet
Code of the paper "Solving Inefficiency of Self-supervised Representation Learning"
Language:Python38 3 76
wanggrun.github.io
Language:HTML3 2 01

wanggrun's Repositories

wanggrun/wanggrun.github.io
Language:HTML3 2 01
wanggrun/Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook2 0 00
wanggrun/4D-Humans
4DHumans: Reconstructing and Tracking Humans with Transformers
Language:Python1 0 0
wanggrun/Depth-Anything
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Language:Python1 0 0
wanggrun/materials_discovery
Language:Python1 0 0
wanggrun/pytorch-image-models-v2
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python1 0 0
wanggrun/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
wanggrun/autogen
A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
Language:Jupyter Notebook0 0
wanggrun/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook0 0
wanggrun/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook0 0
wanggrun/DIS
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
Language:Jupyter Notebook0 0
wanggrun/FLatten-Transformer
Official repository of FLatten Transformer (ICCV2023)
Language:Python0 0
wanggrun/GPT-4V-Act
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
Language:JavaScript0 0
wanggrun/humannerf
HumanNeRF turns a monocular video of moving people into a 360 free-viewpoint video.
Language:Python0 0
wanggrun/IDM-VTON
IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Language:Python0 0
wanggrun/inpaint-anything
Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
Language:Python0 0
wanggrun/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook0 0
wanggrun/latent-diffusion-inpainting
Language:Jupyter Notebook0 0
wanggrun/llama
Inference code for LLaMA models
Language:Python0 0
wanggrun/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
Language:Python0 0
wanggrun/mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
wanggrun/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Language:Python0 0
wanggrun/PeRF
[Technical Report 2023] PERF: Panoramic Neural Radiance Field from a Single Panorama
Language:Python0 0
wanggrun/pyllama
LLaMA: Open and Efficient Foundation Language Models
Language:Python0 0
wanggrun/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
wanggrun/rcg
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
Language:Python0 0
wanggrun/StableVITON
Language:Python0 0
wanggrun/torch-ngp
A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.
Language:Python0 0
wanggrun/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
Language:Python0 0
wanggrun/ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
Language:Python0 0