Pinned Repositories
detectron2-plugin-wandb
Detectron2 "plugin" to support epoch-based training and logging with Wandb
dinov2-patch
Patch for DinoV2 training code to support PyTorch 2.4
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
mask2former-wandb
Mask2Former with Wandb support from OneFormer
MLLM-Resources
A selfishly-curated list of multi-modal LLM resources
oneformer-wandb
Wandb updates to OneFormer
open_clip
An open source implementation of CLIP.
RADIO
Extension of https://github.com/NVlabs/RADIO for further evaluation
VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
wandb-scripts
Wandb scripts
collinmccarthy's Repositories
collinmccarthy/wandb-scripts
Wandb scripts
collinmccarthy/detectron2-plugin-wandb
Detectron2 "plugin" to support epoch-based training and logging with Wandb
collinmccarthy/dinov2-patch
Patch for DinoV2 training code to support PyTorch 2.4
collinmccarthy/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
collinmccarthy/mask2former-wandb
Mask2Former with Wandb support from OneFormer
collinmccarthy/MLLM-Resources
A selfishly-curated list of multi-modal LLM resources
collinmccarthy/oneformer-wandb
Wandb updates to OneFormer
collinmccarthy/open_clip
An open source implementation of CLIP.
collinmccarthy/RADIO
Extension of https://github.com/NVlabs/RADIO for further evaluation
collinmccarthy/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
collinmccarthy/ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
collinmccarthy/ViT-CoMer
Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.