collinmccarthy

PhD Candidate, UC Davis.

@owensgroup Truckee, CA

Pinned Repositories

detectron2-plugin-wandb
Detectron2 "plugin" to support epoch-based training and logging with Wandb
Language:Python0 1 00
dinov2-patch
Patch for DinoV2 training code to support PyTorch 2.4
Language:Jupyter Notebook0 0 00
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python0 0 00
mask2former-wandb
Mask2Former with Wandb support from OneFormer
Language:Python0 0 00
MLLM-Resources
A selfishly-curated list of multi-modal LLM resources
00
oneformer-wandb
Wandb updates to OneFormer
Language:Jupyter Notebook0 0 00
open_clip
An open source implementation of CLIP.
Language:Python0 0 00
RADIO
Extension of https://github.com/NVlabs/RADIO for further evaluation
Language:Python0 0 00
VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python0 0 00
wandb-scripts
Wandb scripts
Language:Python2 1 02

collinmccarthy/wandb-scripts
Wandb scripts
Language:Python2 1 02
collinmccarthy/detectron2-plugin-wandb
Detectron2 "plugin" to support epoch-based training and logging with Wandb
Language:Python0 1 00
collinmccarthy/dinov2-patch
Patch for DinoV2 training code to support PyTorch 2.4
Language:Jupyter Notebook0 0 00
collinmccarthy/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python0 0 00
collinmccarthy/mask2former-wandb
Mask2Former with Wandb support from OneFormer
Language:Python0 0 00
collinmccarthy/MLLM-Resources
A selfishly-curated list of multi-modal LLM resources
00
collinmccarthy/oneformer-wandb
Wandb updates to OneFormer
Language:Jupyter Notebook0 0 00
collinmccarthy/open_clip
An open source implementation of CLIP.
Language:Python0 0 00
collinmccarthy/RADIO
Extension of https://github.com/NVlabs/RADIO for further evaluation
Language:Python0 0 00
collinmccarthy/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python0 0 00
collinmccarthy/ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
Language:Python0 0 00
collinmccarthy/ViT-CoMer
Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.
Language:Python0 0 00