unilight

Assistant professor at Nagoya University, Japan.

Nagoya UniversityNagoya, Japan

unilight's Stars

joonson/syncnet_python
Out of time: automated lip sync in the wild
Language:Python652147
serengil/deepface
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Language:Python12.1k2k
facebookresearch/pytorchvideo
A deep learning library for video understanding research.
Language:Python3.3k410
yistLin/universal-vocoder
A PyTorch implementation of the universal neural vocoder
Language:Python679
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Language:Python1.9k506
microsoft/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
Language:Python1.1k410
felixkreuk/UnsupSeg
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
Language:Python13629
xinjli/ucla-phonetic-corpus
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION
Language:Python343
lilianemomeni/KWS-Net
Seeing Wake Words: Audio-visual Keyword Spotting
Language:Python6212
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python8.7k1.4k
maxrmorrison/torchcrepe
Pytorch implementation of the CREPE pitch tracker
Language:Python39962
kylebgorman/textgrid
A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat
Language:Python28162
zhouhaoyi/Informer2020
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
Language:Python5.3k1.1k
dmort27/epitran
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
Language:Python638121
tuanvu92/VCC2020
Language:Jupyter Notebook22
BYVoid/OpenCC
Conversion between Traditional and Simplified Chinese
Language:C++8.4k975
nii-yamagishilab/VCC2020-listeningtest
1
n1243645679976/espnet
End-to-End Speech Processing Toolkit
1
Sinica-SLAM/Bottleneck_feature_extractor
Language:Python2
xinjli/allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Language:Python55486
nii-yamagishilab/VCC2020-database
539
openai/vdvae
Repository for the paper "Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images"
Language:Python43685
HLTSingapore/Emotional-Speech-Data
This is the GitHub page for publicly available emotional speech data.
31622
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Language:Python826157
taesungp/contrastive-unpaired-translation
Contrastive unpaired image-to-image translation, faster and lighter training than cyclegan (ECCV 2020, in PyTorch)
Language:Python2.2k417
microsoft/P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
Language:HTML20758
twtrubiks/docker-tutorial
Docker 基本教學 - 從無到有 Docker-Beginners-Guide 教你用 Docker 建立 Django + PostgreSQL 📝
Language:Python1.6k296
kpu/kenlm
KenLM: Faster and Smaller Language Model Queries
Language:C++2.5k513
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.2k484
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.3k6.4k

unilight

unilight's Stars

joonson/syncnet_python

serengil/deepface

facebookresearch/pytorchvideo

yistLin/universal-vocoder

jik876/hifi-gan

microsoft/DNS-Challenge

felixkreuk/UnsupSeg

xinjli/ucla-phonetic-corpus

lilianemomeni/KWS-Net

speechbrain/speechbrain

maxrmorrison/torchcrepe

kylebgorman/textgrid

zhouhaoyi/Informer2020

dmort27/epitran

tuanvu92/VCC2020

BYVoid/OpenCC

nii-yamagishilab/VCC2020-listeningtest

n1243645679976/espnet

Sinica-SLAM/Bottleneck_feature_extractor

xinjli/allosaurus

nii-yamagishilab/VCC2020-database

openai/vdvae

HLTSingapore/Emotional-Speech-Data

Tomiinek/Multilingual_Text_to_Speech

taesungp/contrastive-unpaired-translation

microsoft/P.808

twtrubiks/docker-tutorial

kpu/kenlm

s3prl/s3prl

facebookresearch/fairseq