Pinned Repositories
OpenLabeling
Label images and video for Computer Vision applications
ass1_PR
asteroid-filterbanks
:rocket: Asteroid's filterbanks
first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
GazeML
Gaze Estimation using Deep Learning, a Tensorflow-based framework.
hackathon-agel
heatmap-based-landmarker
An Implementation of Heatmap Regression via Randomized Rounding
speech_separation_PIT
The Simple project to separate mixed voice. Using "Permutation Invariant Training Loss" and "PairWise Neg SisDr Loss"
vuthede.github.io
zalo-hit-song-prediction
Hit Song prediction given singer name,composer name, release day, name of the song, the mp3 of the song. The detail of competition here https://challenge.zalo.ai/portal/hit-song. Here is the code and idea for 2nd place position on public and private board
vuthede's Repositories
vuthede/zalo-hit-song-prediction
Hit Song prediction given singer name,composer name, release day, name of the song, the mp3 of the song. The detail of competition here https://challenge.zalo.ai/portal/hit-song. Here is the code and idea for 2nd place position on public and private board
vuthede/first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
vuthede/GazeML
Gaze Estimation using Deep Learning, a Tensorflow-based framework.
vuthede/Audiovisual-Synthesis
Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders
vuthede/avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
vuthede/barcode_detection
vuthede/BeautyGAN_pytorch
Official PyTorch implementation of BeautyGAN (ACM MM 2018)
vuthede/Deep-Clustering-for-Speech-Separation
Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation
vuthede/Deep_White_Balance
Reference code for the paper: Deep White-Balance Editing, CVPR 2020 (Oral). Our method is a deep learning multi-task framework for white-balance editing.
vuthede/dlib
A toolkit for making real world machine learning and data analysis applications in C++
vuthede/face-parsing.PyTorch
Using modified BiSeNet for face parsing in PyTorch
vuthede/Face-Super-Resolution
Face super resolution based on ESRGAN
vuthede/gcbapp-dockerfile-example
Example used in the Cloud Build GitHub app tutorial
vuthede/generative_inpainting
DeepFill v1/v2 with Contextual Attention and Gated Convolution, CVPR 2018, and ICCV 2019 Oral
vuthede/HRNet-Facial-Landmark-Detection
High-resolution representation learning (HRNets) for facial landmark detection
vuthede/labelImg
:metal: LabelImg is a graphical image annotation tool and label object bounding boxes in images
vuthede/LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
vuthede/MMNet
Code for Towards Real-Time Automatic Portrait Matting on Mobile Devices
vuthede/multisensory
Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
vuthede/OpenLabeling
Label images and video for computer vision applications
vuthede/PFLD_UltraLight
vuthede/portrait-shadow-manipulation
vuthede/PRNet
Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network (ECCV 2018)
vuthede/PSGAN
PyTorch code for "PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer" (CVPR 2020 Oral)
vuthede/PyTorch-YOLOv3
Minimal PyTorch implementation of YOLOv3
vuthede/Sound-of-Pixels
Codebase for ECCV18 "The Sound of Pixels"
vuthede/speech_separation
Include some core functions and model to handle speech separation
vuthede/ssd_keras
A Keras port of Single Shot MultiBox Detector
vuthede/supervision-by-registration
Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors
vuthede/syncnet_trainer
Baseline for the VoxSRC 2020 self-supervised speaker verification