Pinned Repositories
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
audio-data-augmentation
Audio data augmentation examples
audio-pretrained-model
A collection of Audio and Speech pre-trained models.
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
awesome-causal-vision
A curated list of research papers in exploring causality in vision. Link to the code if available is also present.
awesome-domain-adaptation
A collection of AWESOME things about domian adaptation
Awesome-Learning-with-Label-Noise
A curated list of resources for Learning with Noisy Labels
awesome-semi-supervised-learning
:scroll: An up-to-date & curated list of awesome semi-supervised learning papers, methods & resources.
mlci
mlci model for textvqa
voca
This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.
zhangshengHust's Repositories
zhangshengHust/mlci
mlci model for textvqa
zhangshengHust/voca
This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.
zhangshengHust/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
zhangshengHust/audio-pretrained-model
A collection of Audio and Speech pre-trained models.
zhangshengHust/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
zhangshengHust/awesome-causal-vision
A curated list of research papers in exploring causality in vision. Link to the code if available is also present.
zhangshengHust/awesome-domain-adaptation
A collection of AWESOME things about domian adaptation
zhangshengHust/Awesome-Learning-with-Label-Noise
A curated list of resources for Learning with Noisy Labels
zhangshengHust/awesome-semi-supervised-learning
:scroll: An up-to-date & curated list of awesome semi-supervised learning papers, methods & resources.
zhangshengHust/DeepSpeaker-pytorch
Speaker embedding(verification and recognition) using Pytorch
zhangshengHust/Emotion-FAN
ICIP 2019: Frame Attention Networks for Facial Expression Recognition in Videos
zhangshengHust/espnet
End-to-End Speech Processing Toolkit
zhangshengHust/FixMatch-pytorch
Unofficial PyTorch implementation of "FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence"
zhangshengHust/google-research
Google Research
zhangshengHust/KDD_WinnieTheBest
KDD Cup 2020 Challenges for Modern E-Commerce Platform: Multimodalities Recall first place
zhangshengHust/mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
zhangshengHust/MMSA
CH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotations of Modality (ACL2020)
zhangshengHust/multimodal-speech-emotion
TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18
zhangshengHust/Multimodal-Transformer
[ACL'19] [PyTorch] Multimodal Transformer
zhangshengHust/ResidualMaskingNetwork
Facial Expression Recognition using Residual Masking Network
zhangshengHust/Self-Supervised-Speech-Pretraining-and-Representation-Learning
The S3PRL speech toolkit: self-supervised pre-training and representation learning of Mockingjay, TERA, A-ALBERT, APC, and more to come. With easy-to-use standard downstream evaluation scripts including phone classification, speaker recognition, and ASR. (All in Pytorch!)
zhangshengHust/simmc
With the aim of building next generation virtual assistants that can handle multimodal inputs and perform multimodal actions, we introduce two new datasets (both in the virtual shopping domain), the annotation schema, the core technical tasks, and the baseline models. The code for the baselines and the datasets will be opensourced.
zhangshengHust/SkipVQVC
An implementation of SkipVQVC with various settings.
zhangshengHust/thexp-implement
Paper implement with thexp
zhangshengHust/Transformer-TTS
TTS model based on Transformer.
zhangshengHust/Transformer-TTS-1
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
zhangshengHust/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
zhangshengHust/VGGVox-PyTorch-1
Implementing VGGVox for VoxCeleb1 dataset in PyTorch.
zhangshengHust/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
zhangshengHust/xai-iml-sota
Interesting resources related to Explainable Artificial Intelligence, Interpretable Machine Learning, Interactive Machine Learning, Human in Loop and Visual Analytics.