jamespark3922

Jae Sung (James) Park PhD Student at University of Washington

Pinned Repositories

adv-inf
Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)
Language:Python34 4 814
Ask-Anything
[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python0 0 00
cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python0 0 00
cider
python codes for CIDEr - Consensus-based Image Caption Evaluation
Language:OpenEdge ABL0 0 00
coco-caption
Language:Jupyter Notebook0 1 00
densevid_eval
Evaluation code for Dense-Captioning Events in Videos
Language:Python1 1 00
localized-skd
Localized Symbolic Knowledge Distillation for Visual Commonsense Models (Neurips 2023]
Language:Jupyter Notebook4 1 20
lsmdc-baseline
Language:Python15 1 27
lsmdc-fillin
Identity-Aware Multi-Sentence Video Description
Language:Python15 2 37
visual-comet
VisualCOMET: Reasoning about the Dynamic Context of a Still Image
Language:Python85 4 1312

jamespark3922's Repositories

jamespark3922/visual-comet
VisualCOMET: Reasoning about the Dynamic Context of a Still Image
Language:Python85 4 1312
jamespark3922/adv-inf
Adversarial Inference for Multi-Sentence Video Descriptions (CVPR 2019)
Language:Python34 4 814
jamespark3922/lsmdc-baseline
Language:Python15 1 27
jamespark3922/lsmdc-fillin
Identity-Aware Multi-Sentence Video Description
Language:Python15 2 37
jamespark3922/localized-skd
Localized Symbolic Knowledge Distillation for Visual Commonsense Models (Neurips 2023]
Language:Jupyter Notebook4 1 20
jamespark3922/densevid_eval
Evaluation code for Dense-Captioning Events in Videos
Language:Python1 1 00
jamespark3922/Ask-Anything
[VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python0 0 00
jamespark3922/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python0 0 00
jamespark3922/cider
python codes for CIDEr - Consensus-based Image Caption Evaluation
Language:OpenEdge ABL0 0 00
jamespark3922/coco-caption
Language:Jupyter Notebook0 1 00
jamespark3922/Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook0 0 00
jamespark3922/movie_eval
Language:Python0 0 00
jamespark3922/nlg-eval
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
Language:Python0 0 00
jamespark3922/RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Language:Python0 0 00
jamespark3922/self-critical.pytorch
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning
Language:Python0 1 00
jamespark3922/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook0 0
jamespark3922/Video-ChatGPT
"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
Language:Python0 0
jamespark3922/video-lang-contrast-set
Language:Shell1 0
jamespark3922/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python0 0

jamespark3922

Pinned Repositories

adv-inf

Ask-Anything

cambrian

cider

coco-caption

densevid_eval

localized-skd

lsmdc-baseline

lsmdc-fillin

visual-comet

jamespark3922's Repositories

jamespark3922/visual-comet

jamespark3922/adv-inf

jamespark3922/lsmdc-baseline

jamespark3922/lsmdc-fillin

jamespark3922/localized-skd

jamespark3922/densevid_eval

jamespark3922/Ask-Anything

jamespark3922/cambrian

jamespark3922/cider

jamespark3922/coco-caption

jamespark3922/Grounded-Segment-Anything

jamespark3922/movie_eval

jamespark3922/nlg-eval

jamespark3922/RETRO-pytorch

jamespark3922/self-critical.pytorch

jamespark3922/segment-anything

jamespark3922/Video-ChatGPT

jamespark3922/video-lang-contrast-set

jamespark3922/Video-LLaMA