ttengwang

Ph.D. student in computer science. My research interests lie in deep learning and computer vision, focusing on vision-language multimodal learning.

The University of Hong KongHong Kong

Pinned Repositories

FLM
Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
Language:Python31 6 02
action-detection
temporal action detection with SSN
Language:Python0 2 00
Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
89 7 02
Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
867 37 568
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
Language:Python1.6k 15 2197
dense-video-captioning-pytorch
Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
Language:Python72 4 923
ECHR
Code for paper "Event-centric hierarchical representation for dense video captioning" (TCSVT2020)
Language:Python8 1 13
ESGN
Event Sequence Generation Network
Language:Python13 2 41
PDVC
End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
Language:Python191 7 5922
VLMixer
VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix (ICML 2022)
17 6 31

ttengwang's Repositories

ttengwang/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
Language:Python1.6k 15 2197
ttengwang/Awesome_Prompting_Papers_in_Computer_Vision
A curated list of prompt-based paper in computer vision and vision-language learning.
867 37 568
ttengwang/PDVC
End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
Language:Python191 7 5922
ttengwang/Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
89 7 02
ttengwang/dense-video-captioning-pytorch
Second-place solution to dense video captioning task in ActivityNet Challenge (CVPR 2020 workshop)
Language:Python72 4 923
ttengwang/VLMixer
VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix (ICML 2022)
17 6 31
ttengwang/ESGN
Event Sequence Generation Network
Language:Python13 2 41
ttengwang/ECHR
Code for paper "Event-centric hierarchical representation for dense video captioning" (TCSVT2020)
Language:Python8 1 13
ttengwang/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
0 1 00
ttengwang/awesome-Vision-and-Language-Pre-training
Recent Advances in Vision and Language Pre-training (VLP)
0 1 00
ttengwang/awesome-vision-language-pretraining-papers
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
1 0
ttengwang/cider
python codes for CIDEr - Consensus-based Image Caption Evaluation
Language:OpenEdge ABL1 0
ttengwang/coco-caption
Language:Jupyter Notebook1 0
ttengwang/densecap
Dense video captioning in PyTorch
Language:Jupyter Notebook1 0
ttengwang/densevid_eval
Evaluation code for Dense-Captioning Events in Videos
Language:Python2 0
ttengwang/ENAS-pytorch
PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"
Language:Python1 0
ttengwang/EVA
EVA Series: Visual Representation Fantasies from BAAI
Language:Python1 0
ttengwang/faster-rcnn.pytorch
A faster pytorch implementation of faster r-cnn
Language:Python2 0
ttengwang/grounding_changing_distribution
1 0
ttengwang/hidden-networks
Language:Python1 0
ttengwang/ImageCaptioning.pytorch
image captioning codebase in pytorch(finetunable cnn in branch "with_finetune";diverse beam search can be found in 'dbs' branch; self-critical training is under my self-critical.pytorch repository.)
Language:Python2 0
ttengwang/lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
Language:Python0 0
ttengwang/merlot
MERLOT: Multimodal Neural Script Knowledge Models
Language:Python1 0
ttengwang/PrefixTuning
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Language:Python1 0
ttengwang/PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
1 0
ttengwang/rosita
ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration
Language:Python1 0
ttengwang/self-critical.pytorch
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
Language:Python1 0
ttengwang/slowfast_feature_extractor
Feature Extractor module for videos using the PySlowFast framework
Language:Python1 0
ttengwang/STR
TMM: show, tell and rephrase
Language:Python1 0
ttengwang/ttengwang
2 01