dohyun1411's Stars
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
gyoogle/tech-interview-for-developer
👶🏻 신입 개발자 전공 지식 & 기술 면접 백과사전 📖
jacobgil/pytorch-grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
zedr/clean-code-python
:bathtub: Clean Code concepts adapted for Python
ShoufaChen/DiffusionDet
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
microsoft/Semi-supervised-learning
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
McGill-NLP/llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
NVlabs/RADIO
Official repository for "AM-RADIO: Reduce All Domains Into One"
tim-learn/awesome-test-time-adaptation
Collection of awesome test-time (domain/batch/instance) adaptation methods
DavidZhangdw/Visual-Tracking-Development
Visual Object Tracking
PengtaoJiang/Awesome-Weakly-Supervised-Semantic-Segmentation-Papers
Recent weakly supervised semantic segmentation paper
berkeley-hipie/HIPIE
[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"
wanghao9610/OV-DINO
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
yuanze-lin/Learnable_Regions
[CVPR 2024] Official code for "Text-Driven Image Editing via Learnable Regions"
gaomingqi/Awesome-Video-Object-Segmentation
:bookmark: Curated list of video object segmentation (VOS) papers, datasets, and projects.
kongds/E5-V
E5-V: Universal Embeddings with Multimodal Large Language Models
alimohammadiamirhossein/smite
Pytorch Implementation of "SMITE: Segment Me In TimE"
BBBBchan/Awesome-Semi-Supervised-Semantic-Segmentation
A summary of recent semi-supervised semantic segmentation methods
cilinyan/VISA
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
aliasgharkhani/SLiMe
1-shot image segmentation using Stable Diffusion
tommasocarraro/LTNtorch
PyTorch implementation of Logic Tensor Networks, a Neural-Symbolic framework.
toriving/KoEDA
Korean Easy Data Augmentation
JerryX1110/awesome-rvos
Referring Video Object Segmentation / Multi-Object Tracking Repo
IIT-PAVIS/SpatialCommonsenseGraph
Code and dataset for the paper "Spatial Commonsense Graph for Localisation In partial Scenes"
Shengcao-Cao/groundLMM
Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision
hee-suk-yoon/C-TPT
[ICLR'24] Official code for "C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion"
MaxwellYaoNi/PACE
[NeurIPS 2024 Spotlight] Official implementation for "PACE: marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization"
rirolab/LINGO-Space
[AAAI 2024] An official implementation of the paper "LINGO-Space: Language-Conditioned Incremental Grounding for Space"
kahnchana/locvlm
Unofficial Implementation of "Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs"