jacobswan1

Applied Research Scientist for Vision and Language.

Amazon Alexa AI.San Jose

Pinned Repositories

att_entity_grounding
Language:Jupyter Notebook1 2 10
FACE_SRGAN
Face generation from a given extremely low resolution images using DC_GAN.
Language:Jupyter Notebook12 2 02
keras-for-second-order-based-SGD
Second order information based SGD
Language:Python1 4 01
keras_BEGAN
Implementation BEGAN([Boundary Equilibrium Generative Adversarial Networks](https://arxiv.org/pdf/1703.10717.pdf)) by Keras.
Language:Python1 3 00
MTG-pytorch
Gender/Age attribute grounding using weak supervised manner.
Language:Jupyter Notebook12 3 00
SEED
Language:Python36 2 1113
SpineSegment
Using deep neural network to detect and make segmentation of human beings' lesion regions in spine.
Language:Jupyter Notebook6 2 02
Video2Commonsense
Video captioning baseline models on Video2Commonsense Dataset.
Language:Python57 4 1112
ViTCAP
Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".
Language:Python41 1 111
Weakly-Supervised-Action-Localization-by-Sparse-Temporal-Pooling-Network
Language:Jupyter Notebook8 3 24

jacobswan1's Repositories

jacobswan1/Video2Commonsense
Video captioning baseline models on Video2Commonsense Dataset.
Language:Python57 4 1112
jacobswan1/ViTCAP
Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".
Language:Python41 1 111
jacobswan1/SEED
Language:Python36 2 1113
jacobswan1/maskrcnn-benchmark
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
Language:Python1 2 0
jacobswan1/SparseR-CNN
End-to-End Object Detection with Learnable Proposal, CVPR2021
Language:Python1 1 0
jacobswan1/all-in-one
[Arxiv2022] All in One: Exploring Unified Video-Language Pre-training
Language:Python1 0
jacobswan1/ASU-Thesis-Format
ASU Thesis Format
Language:TeX1 0
jacobswan1/botocore
The low-level, core functionality of boto 3.
Language:Python1 0
jacobswan1/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Language:Python1 0
jacobswan1/CMC
Contrastive Multiview Coding
Language:Python2 0
jacobswan1/coco-caption
Language:Jupyter Notebook1 0
jacobswan1/color-aware-style-transfer
Reference code for the paper CAMS: Color-Aware Multi-Style Transfer.
Language:Jupyter Notebook1 0
jacobswan1/Dense_Flow_Extraction
Language:Jupyter Notebook2 0
jacobswan1/denseflow
Extracting optical flow and frames
Language:C++1 0
jacobswan1/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Language:Jupyter Notebook1 0
jacobswan1/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Language:Python0 0
jacobswan1/IMRAM
code for our CVPR2020 paper "IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval"
Language:Python1 0
jacobswan1/info-ground
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
Language:Python1 0
jacobswan1/LocalizingMoments
Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"
Language:OpenEdge ABL1 0
jacobswan1/markdown-content
Markdown content for the www.aerobatic.io website
1 0
jacobswan1/Oscar
Oscar and VinVL
Language:Python1 0
jacobswan1/PaddleSeg
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.
Language:Python0 0
jacobswan1/pytorchvideo
A deep learning library for video understanding research.
Language:Python1 0
jacobswan1/RESUME
2 0
jacobswan1/stable-diffusion
Language:Jupyter Notebook1 0
jacobswan1/stable-diffusion-1
Language:Jupyter Notebook1 0
jacobswan1/SwinBERT
Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
Language:Python1 0
jacobswan1/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python1 0
jacobswan1/video-swin-transformer-pytorch
Video Swin Transformer - PyTorch
Language:Python1 0
jacobswan1/WebQA
Language:Shell1 0