Pinned Repositories
att_entity_grounding
FACE_SRGAN
Face generation from a given extremely low resolution images using DC_GAN.
keras-for-second-order-based-SGD
Second order information based SGD
keras_BEGAN
Implementation BEGAN([Boundary Equilibrium Generative Adversarial Networks](https://arxiv.org/pdf/1703.10717.pdf)) by Keras.
MTG-pytorch
Gender/Age attribute grounding using weak supervised manner.
SEED
SpineSegment
Using deep neural network to detect and make segmentation of human beings' lesion regions in spine.
Video2Commonsense
Video captioning baseline models on Video2Commonsense Dataset.
ViTCAP
Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".
Weakly-Supervised-Action-Localization-by-Sparse-Temporal-Pooling-Network
jacobswan1's Repositories
jacobswan1/Video2Commonsense
Video captioning baseline models on Video2Commonsense Dataset.
jacobswan1/ViTCAP
Implementation for CVPR 2022 paper " Injecting Semantic Concepts into End-to-End Image Captionin".
jacobswan1/SEED
jacobswan1/maskrcnn-benchmark
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
jacobswan1/SparseR-CNN
End-to-End Object Detection with Learnable Proposal, CVPR2021
jacobswan1/all-in-one
[Arxiv2022] All in One: Exploring Unified Video-Language Pre-training
jacobswan1/ASU-Thesis-Format
ASU Thesis Format
jacobswan1/botocore
The low-level, core functionality of boto 3.
jacobswan1/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
jacobswan1/CMC
Contrastive Multiview Coding
jacobswan1/coco-caption
jacobswan1/color-aware-style-transfer
Reference code for the paper CAMS: Color-Aware Multi-Style Transfer.
jacobswan1/Dense_Flow_Extraction
jacobswan1/denseflow
Extracting optical flow and frames
jacobswan1/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
jacobswan1/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
jacobswan1/IMRAM
code for our CVPR2020 paper "IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval"
jacobswan1/info-ground
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
jacobswan1/LocalizingMoments
Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"
jacobswan1/markdown-content
Markdown content for the www.aerobatic.io website
jacobswan1/Oscar
Oscar and VinVL
jacobswan1/PaddleSeg
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.
jacobswan1/pytorchvideo
A deep learning library for video understanding research.
jacobswan1/RESUME
jacobswan1/stable-diffusion
jacobswan1/stable-diffusion-1
jacobswan1/SwinBERT
Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
jacobswan1/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
jacobswan1/video-swin-transformer-pytorch
Video Swin Transformer - PyTorch
jacobswan1/WebQA