jxcv-sony

Pinned Repositories

CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
10
awesome-computer-vision-conference-deadline
A curated list of Computer Vision related conferences with dates and paper registration deadlines.
00
cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python00
Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook00
GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Language:Python00
ImageBind
ImageBind One Embedding Space to Bind Them All
Language:Python00
LLaVA-NeXT
Language:Python00
open_clip
An open source implementation of CLIP.
Language:Jupyter Notebook00
open_flamingo
An open-source framework for training large multimodal models.
Language:Python00
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python00

jxcv-sony's Repositories

jxcv-sony/LLaVA-NeXT
jxcv-sony/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
jxcv-sony/GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
jxcv-sony/open_clip
An open source implementation of CLIP.
jxcv-sony/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
jxcv-sony/open_flamingo
An open-source framework for training large multimodal models.
jxcv-sony/Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
jxcv-sony/ImageBind
ImageBind One Embedding Space to Bind Them All
jxcv-sony/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
1
jxcv-sony/awesome-computer-vision-conference-deadline
A curated list of Computer Vision related conferences with dates and paper registration deadlines.