Pinned Repositories
CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
awesome-computer-vision-conference-deadline
A curated list of Computer Vision related conferences with dates and paper registration deadlines.
cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
ImageBind
ImageBind One Embedding Space to Bind Them All
LLaVA-NeXT
open_clip
An open source implementation of CLIP.
open_flamingo
An open-source framework for training large multimodal models.
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
jxcv-sony's Repositories
jxcv-sony/LLaVA-NeXT
jxcv-sony/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
jxcv-sony/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
jxcv-sony/open_clip
An open source implementation of CLIP.
jxcv-sony/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
jxcv-sony/open_flamingo
An open-source framework for training large multimodal models.
jxcv-sony/Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
jxcv-sony/ImageBind
ImageBind One Embedding Space to Bind Them All
jxcv-sony/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
jxcv-sony/awesome-computer-vision-conference-deadline
A curated list of Computer Vision related conferences with dates and paper registration deadlines.