HongkuanZhang

Nagoya,Japan

HongkuanZhang's Stars

ramavedantam/cider
python codes for CIDEr - Consensus-based Image Caption Evaluation
Language:OpenEdge ABL9268
showlab/EgoVLP
[NeurIPS2022] Egocentric Video-Language Pretraining
Language:Python23220
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Language:Python2.5k248
Vision-CAIR/VisualGPT
VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models
Language:Python32650
HuiGuanLab/nrccr
Source code of our MM'22 paper Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning
Language:Python132
fancy88/iBook
收藏一些电子书
4k1.1k
neubig/util-scripts
Various utility scripts useful for natural language processing, machine translation, etc.
Language:Perl4712
woojeongjin/FewVLM
A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models (ACL 2022)
Language:Python415
v-iashin/video_features
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Language:Python55896
NVlabs/SegFormer
Official PyTorch implementation of SegFormer
Language:Python2.7k367
google-research/vision_transformer
Language:Jupyter Notebook10.8k1.3k
Alibaba-MIIL/STAM
Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)
Language:Python21931
google-research/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
Language:Python3.4k440
ttengwang/Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
90872
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
Language:Python1.9k206
JingfengYang/Multi-modal-Deep-Learning
732
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
Language:C++64k9.7k
lixin4ever/Conference-Acceptance-Rate
Acceptance rates for the major AI conferences
Language:Jupyter Notebook4.3k307
dair-ai/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
14.1k1.4k
MILVLG/bottom-up-attention.pytorch
A PyTorch reimplementation of bottom-up-attention models
Language:Jupyter Notebook29676
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python20.6k2.6k
josephch405/curriculum-nmt
Language:Python173
NLP2CT/norm-nmt
Norm-Based Curriculum Learning for Neural Machine Translation (ACL 2020)
Language:C++17
YehLi/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Language:Python967105
hwanheelee1993/UMIC
An unreferenced image captioning metric (ACL-21)
Language:Python3017
ChenRocks/UNITER
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
Language:Python787109
NLP2CT/ua-cl-nmt
Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)
113
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook27.1k3.4k
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
6.2k861
Eurus-Holmes/Awesome-Multimodal-Research
A curated list of Multimodal Related Research.
Language:Python1.3k150

HongkuanZhang

HongkuanZhang's Stars

ramavedantam/cider

showlab/EgoVLP

OFA-Sys/OFA

Vision-CAIR/VisualGPT

HuiGuanLab/nrccr

fancy88/iBook

neubig/util-scripts

woojeongjin/FewVLM

v-iashin/video_features

NVlabs/SegFormer

google-research/vision_transformer

Alibaba-MIIL/STAM

google-research/scenic

ttengwang/Awesome_Prompting_Papers_in_Computer_Vision

KaiyangZhou/CoOp

JingfengYang/Multi-modal-Deep-Learning

tesseract-ocr/tesseract

lixin4ever/Conference-Acceptance-Rate

dair-ai/ml-visuals

MILVLG/bottom-up-attention.pytorch

microsoft/unilm

josephch405/curriculum-nmt

NLP2CT/norm-nmt

YehLi/xmodaler

hwanheelee1993/UMIC

ChenRocks/UNITER

NLP2CT/ua-cl-nmt

openai/CLIP

pliang279/awesome-multimodal-ml

Eurus-Holmes/Awesome-Multimodal-Research