Xu-Shihao

NTUSingapore

Pinned Repositories

Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
Language:Python00
at16k
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Language:Python00
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Language:Python00
AVEC2019
Baseline scripts for the Audio/Visual Emotion Challenge 2019
Language:Python00
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
00
awesome-public-datasets
A topic-centric list of HQ open datasets. PR ☛☛☛
00
Awesome_ML_for_mental_health
A curated list of awesome work on machine learning for mental health applications. Includes topics broadly captured by affective computing. Facial expressions, speech analysis, emotion prediction, depression, interactions, psychiatry etc. etc.
00
Ensemble-Learning
Ensemble learning based on sk-learn, where we merge the output of 5 different classifiers, and utilize cross validation grid search to optimize the hyperparameters.
Language:Python30
Feature-Selection
Features selector based on the self selected-algorithm, loss function and validation method
Language:Python10
Platoon-NS3
The aim of this project is to simulate the communication system switching between CCH and SCH of a autonomous vehicle platoon merging and splitting in the broadcasting environment
Language:C++6 2 01

Xu-Shihao's Repositories

Xu-Shihao/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Language:Python00
Xu-Shihao/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
00
Xu-Shihao/BangalASR
Transformer based Bangla Speech Recognition
Xu-Shihao/bert-as-service
Mapping a variable-length sentence to a fixed-length vector using BERT model
Xu-Shihao/BertGCN
Xu-Shihao/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Xu-Shihao/deep-speaker
Deep Speaker: an End-to-End Neural Speaker Embedding System.
Xu-Shihao/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Xu-Shihao/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Xu-Shihao/human
Human: AI-powered 3D Face Detection & Rotation Tracking, Face Description & Recognition, Body Pose Tracking, 3D Hand & Finger Tracking, Iris Analysis, Age & Gender & Emotion Prediction, Gaze Tracking, Gesture Recognition
Language:HTML0 0
Xu-Shihao/kaggle-birdclef-2021
Language:Jupyter Notebook
Xu-Shihao/kaldi
This is the official location of the Kaldi project.
Xu-Shihao/math-dataset
Xu-Shihao/Med-BERT
Med-BERT, contextualized embedding model for structured EHR data
Xu-Shihao/mlrun
Machine Learning automation and tracking
Xu-Shihao/nlpaug
Data augmentation for NLP
Xu-Shihao/noisereduce
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
Xu-Shihao/opencv_contrib
Repository for OpenCV's extra modules
Xu-Shihao/OpenFace
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
Xu-Shihao/py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
Xu-Shihao/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Xu-Shihao/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
Xu-Shihao/shap
A game theoretic approach to explain the output of any machine learning model.
Xu-Shihao/Speaker_Verification
Tensorflow implementation of generalized end-to-end loss for speaker verification
Xu-Shihao/speechbrain
A PyTorch-based Speech Toolkit
Xu-Shihao/spleeter
Deezer source separation library including pretrained models.
Xu-Shihao/text_gcn
Graph Convolutional Networks for Text Classification. AAAI 2019
Xu-Shihao/UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Xu-Shihao/voicefixer
General Speech Restoration
Xu-Shihao/wespeaker
Research and Production Oriented Speaker Recognition Toolkit