Pinned Repositories
A-Survey-on-Generative-Diffusion-Model
ActflowToolbox
The Brain Activity Flow ("Actflow") Toolbox. Tools to quantify the relationship between connectivity and task activity through network simulations and machine learning prediction. Helps determine how connections contribute to specific brain functions.
Albert
albert_zh对应的pytorch版本
Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
AlgorithmsByPython
算法/数据结构/Python/剑指offer/机器学习/leetcode
Alibaba-MIT-Speech
Alibaba speech technology
amazon-polly-developer-guide
The open source version of the Amazon Polly docs. You can submit feedback & requests for changes by submitting issues in this repo or by making proposed changes & submitting a pull request.
ASC_baseline
asr-study
Implementation of all-neural speech recognition systems using Keras and Tensorflow
asr_preprocessing
Python implementation of pre-processing for End-to-End speech recognition
xiaoyeye1117's Repositories
xiaoyeye1117/A-Survey-on-Generative-Diffusion-Model
xiaoyeye1117/ActflowToolbox
The Brain Activity Flow ("Actflow") Toolbox. Tools to quantify the relationship between connectivity and task activity through network simulations and machine learning prediction. Helps determine how connections contribute to specific brain functions.
xiaoyeye1117/ASC_baseline
xiaoyeye1117/audacity
Audio Editor
xiaoyeye1117/audioset_tagging_cnn
xiaoyeye1117/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
xiaoyeye1117/awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
xiaoyeye1117/Awesome-Vision-Transformer-Collection
Variants of Vision Transformer and its downstream tasks
xiaoyeye1117/braindecode
Deep learning software to decode EEG, ECG or MEG signals
xiaoyeye1117/cmu-thesis
Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling
xiaoyeye1117/ddx7
Differentiable FM Synthesis of Musical Instrument Sounds
xiaoyeye1117/DeepExplain
A unified framework of perturbation and gradient-based attribution methods for Deep Neural Networks interpretability. DeepExplain also includes support for Shapley Values sampling. (ICLR 2018)
xiaoyeye1117/EIN-SELD
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
xiaoyeye1117/ESC-50
ESC-50: Dataset for Environmental Sound Classification
xiaoyeye1117/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
xiaoyeye1117/FunpySpiderSearchEngine
Word2vec 千人千面 个性化搜索 + Scrapy2.3.0(爬取数据) + ElasticSearch7.9.1(存储数据并提供对外Restful API) + Django3.1.1 搜索
xiaoyeye1117/libri-light
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
xiaoyeye1117/Metrics
Machine learning evaluation metrics, implemented in Python, R, Haskell, and MATLAB / Octave
xiaoyeye1117/ml-ane-transformers
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)
xiaoyeye1117/music2video
Making a AI-generated music video from any song with Wav2CLIP and VQGAN-CLIP
xiaoyeye1117/NeuroKit
NeuroKit2: The Python Toolbox for Neurophysiological Signal Processing
xiaoyeye1117/porcupine
On-device wake word detection powered by deep learning.
xiaoyeye1117/pyRiemann
Python machine learning package based on sklearn API for multivariate signal processing and statistical analysis of symmetric positive definite matrices via Riemannian geometry
xiaoyeye1117/Query-by-Example
xiaoyeye1117/snowboy
DNN based hotword and wake word detection toolkit (model generation included)
xiaoyeye1117/THE-2020-PERSONALIZED-VOICE-TRIGGER-CHALLENGE-BASELINE-SYSTEM
xiaoyeye1117/torchvggish
Pytorch port of Google Research's VGGish model used for extracting audio features.
xiaoyeye1117/vadnet
Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks
xiaoyeye1117/voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
xiaoyeye1117/wekws
Production First and Production Ready End-to-End Keyword Spotting Toolkit