ybCliff's Stars
katsura-jp/extruct-video-feature
Extract video feature from C3D pretrained on Sports-1M and Kinetics
fawazsammani/show-edit-tell
Show, Edit and Tell: A Framework for Editing Image Captions, CVPR 2020
PKU-ICST-MIPL/FGCrossNet_ACMMM2019
Source code of our ACM MM 2019 paper "A New Benchmark and Approach for Fine-grained Cross-media Retrieval".
zhegan27/Semantic_Compositional_Nets
The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"
lancopku/bag-of-words
Code for "Bag-of-Words as Target for Neural Machine Translation"
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
danieljf24/dual_encoding
[CVPR2019] Dual Encoding for Zero-Example Video Retrieval
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
li-xirong/avs
Ad-hoc Video Search
harritaylor/torchvggish
Pytorch port of Google Research's VGGish model used for extracting audio features.
jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
facebookresearch/Mask-Predict
A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a partially masked target translation.
pbloem/former
Simple transformer implementation from scratch in pytorch.
shadowsocks/shadowsocks-libev
Bug-fix-only libev port of shadowsocks. Future development moved to shadowsocks-rust
jihunchoi/recurrent-batch-normalization-pytorch
PyTorch implementation of recurrent batch normalization
ruotianluo/self-critical.pytorch
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
mynlp/cst_captioning
PyTorch Implementation of Consensus-based Sequence Training for Video Captioning
adwardlee/multitask-end-to-end-video-captioning
with reinforcement learning
forwchen/mfcc_boaw
Extract MFCCs from videos and make bag-of-audio-words (BOAW) representations.
jameslyons/python_speech_features
This library provides common speech features for ASR including MFCCs and filterbank energies.
adambielski/siamese-triplet
Siamese and triplet networks with online pair/triplet mining in PyTorch
DavideA/c3d-pytorch
Pytorch porting of C3D network, with Sports1M weights
yaoli/arctic-capgen-vid
automatic video description generation with GPU training
kenshohara/3D-ResNets-PyTorch
3D ResNets for Action Recognition (CVPR 2018)
rock3125/sentence2vec
Sentence2vec by Rock
clcarwin/focal_loss_pytorch
A PyTorch Implementation of Focal Loss.
utkuozbulak/pytorch-cnn-visualizations
Pytorch implementation of convolutional neural network visualization techniques
JakubSochor/BoxCars
Source code related to BoxCars publication
dhaalves/CEAL_keras
Implementation of "Cost-Effective Active Learning for Deep Image Classification" paper