Pinned Repositories
GroupViT
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
ODISE
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
mmdetection
OpenMMLab Detection Toolbox and Benchmark
mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
Deformable-Convolution-V2-PyTorch
Deformable ConvNets V2 in PyTorch
GCNet
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond
GroupViT
GroupViT: Semantic Segmentation Emerges from Text Supervision
IMProv
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
mmdetection
Open MMLab Detection Toolbox with PyTorch 1.0
VFS
Rethinking Self-Supervised Correspondence Learning: A Video Frame-level Similarity Perspective, in ICCV 2021 (Oral)
xvjiarui's Repositories
xvjiarui/VFS
Rethinking Self-Supervised Correspondence Learning: A Video Frame-level Similarity Perspective, in ICCV 2021 (Oral)
xvjiarui/IMProv
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
xvjiarui/GroupViT
GroupViT: Semantic Segmentation Emerges from Text Supervision
xvjiarui/mmdetection
Open MMLab Detection Toolbox with PyTorch 1.0
xvjiarui/ODISE
ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models
xvjiarui/OFA-fairseq
fairseq from OFA
xvjiarui/prismatic-vlms
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
xvjiarui/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
xvjiarui/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
xvjiarui/CenterNet2
Two-stage CenterNet
xvjiarui/davis2017-evaluation
Evaluation Framework for DAVIS 2017 Semi-supervised and Unsupervised used in the DAVIS Challenges
xvjiarui/DeepSegmentor
A Pytorch implementation of DeepCrack and RoadNet projects.
xvjiarui/detectron2
Detectron2 is FAIR's next-generation platform for object detection and segmentation.
xvjiarui/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
xvjiarui/fvcore
Collection of common code that's shared among different research projects in FAIR computer vision team.
xvjiarui/litgpt
Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
xvjiarui/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
xvjiarui/lm-evaluation-harness
A framework for few-shot evaluation of language models.
xvjiarui/LWM
xvjiarui/Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
xvjiarui/mmaction2
OpenMMLab's Next Generation Action Understanding Toolbox and Benchmark
xvjiarui/mmcv
Open MMLab Computer Vision Foundation
xvjiarui/mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
xvjiarui/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
xvjiarui/panopticapi
COCO 2018 Panoptic Segmentation Task API (Beta version)
xvjiarui/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
xvjiarui/stable-diffusion
xvjiarui/torchtune
A Native-PyTorch Library for LLM Fine-tuning
xvjiarui/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
xvjiarui/visual_prompting
Official implementation and data release of the paper "Visual Prompting via Image Inpainting".