vuthede

VinAI Research

Pinned Repositories

OpenLabeling
Label images and video for Computer Vision applications
Language:Python934 32 43265
ass1_PR
Language:C++1 1 00
asteroid-filterbanks
:rocket: Asteroid's filterbanks
Language:Python0 0 00
first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
Language:Jupyter Notebook1 0 00
GazeML
Gaze Estimation using Deep Learning, a Tensorflow-based framework.
Language:Python0 0 00
hackathon-agel
Language:Python1 2 00
heatmap-based-landmarker
An Implementation of Heatmap Regression via Randomized Rounding
Language:Python21 1 28
speech_separation_PIT
The Simple project to separate mixed voice. Using "Permutation Invariant Training Loss" and "PairWise Neg SisDr Loss"
Language:Python33 1 16
vuthede.github.io
Language:HTML3 1 10
zalo-hit-song-prediction
Hit Song prediction given singer name,composer name, release day, name of the song, the mp3 of the song. The detail of competition here https://challenge.zalo.ai/portal/hit-song. Here is the code and idea for 2nd place position on public and private board
Language:Jupyter Notebook69 6 115

vuthede's Repositories

vuthede/zalo-hit-song-prediction
Hit Song prediction given singer name,composer name, release day, name of the song, the mp3 of the song. The detail of competition here https://challenge.zalo.ai/portal/hit-song. Here is the code and idea for 2nd place position on public and private board
Language:Jupyter Notebook69 6 115
vuthede/first-order-model
This repository contains the source code for the paper First Order Motion Model for Image Animation
Language:Jupyter Notebook1 0 00
vuthede/GazeML
Gaze Estimation using Deep Learning, a Tensorflow-based framework.
Language:Python0 0 00
vuthede/Audiovisual-Synthesis
Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders
Language:Jupyter Notebook0 0
vuthede/avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
Language:Python0 0
vuthede/barcode_detection
0 0
vuthede/BeautyGAN_pytorch
Official PyTorch implementation of BeautyGAN (ACM MM 2018)
vuthede/Deep-Clustering-for-Speech-Separation
Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation
vuthede/Deep_White_Balance
Reference code for the paper: Deep White-Balance Editing, CVPR 2020 (Oral). Our method is a deep learning multi-task framework for white-balance editing.
vuthede/dlib
A toolkit for making real world machine learning and data analysis applications in C++
Language:C++
vuthede/face-parsing.PyTorch
Using modified BiSeNet for face parsing in PyTorch
vuthede/Face-Super-Resolution
Face super resolution based on ESRGAN
Language:Python0 0
vuthede/gcbapp-dockerfile-example
Example used in the Cloud Build GitHub app tutorial
Language:Dockerfile
vuthede/generative_inpainting
DeepFill v1/v2 with Contextual Attention and Gated Convolution, CVPR 2018, and ICCV 2019 Oral
Language:Python0 0
vuthede/HRNet-Facial-Landmark-Detection
High-resolution representation learning (HRNets) for facial landmark detection
Language:Python0 0
vuthede/labelImg
:metal: LabelImg is a graphical image annotation tool and label object bounding boxes in images
Language:Python0 0
vuthede/LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Language:C++0 0
vuthede/MMNet
Code for Towards Real-Time Automatic Portrait Matting on Mobile Devices
Language:Python0 0
vuthede/multisensory
Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
Language:Python0 0
vuthede/OpenLabeling
Label images and video for computer vision applications
Language:Python
vuthede/PFLD_UltraLight
Language:Python0 0
vuthede/portrait-shadow-manipulation
Language:Jupyter Notebook0 0
vuthede/PRNet
Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network (ECCV 2018)
Language:Python
vuthede/PSGAN
PyTorch code for "PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer" (CVPR 2020 Oral)
vuthede/PyTorch-YOLOv3
Minimal PyTorch implementation of YOLOv3
Language:Python0 0
vuthede/Sound-of-Pixels
Codebase for ECCV18 "The Sound of Pixels"
Language:Python0 0
vuthede/speech_separation
Include some core functions and model to handle speech separation
Language:Python0 0
vuthede/ssd_keras
A Keras port of Single Shot MultiBox Detector
Language:Python1 0
vuthede/supervision-by-registration
Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors
vuthede/syncnet_trainer
Baseline for the VoxSRC 2020 self-supervised speaker verification
Language:Python0 0