Pinned Repositories
AdaShare
AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning
cs655-geni-mini-project
DIME-FM
Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"
dqn_gomoku_pytorch
DualCoOp
Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))
HCP-MLR-PL
mcnet_pytorch
TAI_video_frame_inpainting
Inpainting video frames via a Temporally-Aware Interpolation network.
TwoStreamVAN
VideoIQ
sunxm2357's Repositories
sunxm2357/AdaShare
AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning
sunxm2357/DualCoOp
Implementation for "DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations" (NeurIPS 2022))
sunxm2357/TAI_video_frame_inpainting
Inpainting video frames via a Temporally-Aware Interpolation network.
sunxm2357/VideoIQ
sunxm2357/DIME-FM
Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"
sunxm2357/TwoStreamVAN
sunxm2357/cs655-geni-mini-project
sunxm2357/HCP-MLR-PL
sunxm2357/autocast
Forecasting Future World Events with Neural Networks (NeurIPS 2022)
sunxm2357/autoencoder
Text autoencoder with LSTMs
sunxm2357/bookcorpus
Crawl BookCorpus
sunxm2357/Boundary-Detection-Evaluation-Tools
A user-friendly evaluation tool that encompasses all necessary components for boundary detection on PASCAL-Context and NYUD-v2 datasets.
sunxm2357/DoReFa
sunxm2357/Elevater_Toolkit_IC
Toolkit for Elevater Benchmark
sunxm2357/GLIP
Grounded Language-Image Pre-training
sunxm2357/google-images-download
Google/Bing Images Web Downloader
sunxm2357/imagenet_resnet
sunxm2357/instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
sunxm2357/LLaVA
Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
sunxm2357/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
sunxm2357/Multi-Task-Transformer
Code of ICLR2023 paper "TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding" and ECCV2022 paper "Inverted Pyramid Multi-task Transformer for Dense Scene Understanding"
sunxm2357/open_clip
An open source implementation of CLIP.
sunxm2357/open_flamingo
An open-source framework for training large multimodal models.
sunxm2357/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
sunxm2357/reproducingASL
sunxm2357/SceneGraphParser
A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).
sunxm2357/sunxm2357.github.io
sunxm2357/TaI-DPT
sunxm2357/taskonomy
Taskonomy: Disentangling Task Transfer Learning
sunxm2357/UniCL
The official implementation for "Unified Contrastive Learning in Image-Text-Label Space. CVPR 2022"