Pinned Repositories
APOProject
A trial of developing a APO (Audio Processing Object), working on Windows 10.
ASR_Theory
语音识别理论,包括研一与研二期间部分所学,论文和PPT
athena-signal
beamforming
Matlab files for various types of beamforming
Beamforming-for-speech-enhancement
simple delaysum, MVDR and CGMM-MVDR
btk20_documentation
btk 2.0 documentation
CGMM-MVDR
Implementation of the CGMM-MVDR beamforming
ComputeLibrary
The ARM Computer Vision and Machine Learning library is a set of functions optimised for both ARM CPUs and GPUs using SIMD technologies.
cosmoflow-sims
Running the simulations for the CosmoFlow project
dagger
Dagger 是一个基于 Loki 的日志查询和管理系统,它是由达闼科技( CloudMinds )云团队的`大禹基础设施平台`派生出来的一个项目。Dagger 运行在 Loki 前端,具备日志查询、搜索,保存和下载等特性,适用于云原生场景下的容器日志管理场景。
gavin-pu's Repositories
gavin-pu/APOProject
A trial of developing a APO (Audio Processing Object), working on Windows 10.
gavin-pu/ASR_Theory
语音识别理论,包括研一与研二期间部分所学,论文和PPT
gavin-pu/athena-signal
gavin-pu/btk20_documentation
btk 2.0 documentation
gavin-pu/ComputeLibrary
The ARM Computer Vision and Machine Learning library is a set of functions optimised for both ARM CPUs and GPUs using SIMD technologies.
gavin-pu/cosmoflow-sims
Running the simulations for the CosmoFlow project
gavin-pu/dagger
Dagger 是一个基于 Loki 的日志查询和管理系统,它是由达闼科技( CloudMinds )云团队的`大禹基础设施平台`派生出来的一个项目。Dagger 运行在 Loki 前端,具备日志查询、搜索,保存和下载等特性,适用于云原生场景下的容器日志管理场景。
gavin-pu/dancenet
DanceNet -💃💃Dance generator using Autoencoder, LSTM and Mixture Density Network. (Keras)
gavin-pu/DeepLearning
深度学习入门教程, 优秀文章, Deep Learning Tutorial
gavin-pu/distant_speech_recognition
spatial signal processing toolkit a.k.a beamforming toolkit 2.0 (BTK2.0)
gavin-pu/EA-SVC
An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"
gavin-pu/HyperFT
开源移动端快速视频人脸跟踪-移动端150FPS+
gavin-pu/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
gavin-pu/LPCNet
Efficient neural speech synthesis
gavin-pu/MASP
Microphone Array Speech Processing
gavin-pu/Microphone-Array-postfilter
gavin-pu/nara_wpe
Different implementations of "Weighted Prediction Error" for speech dereverberation
gavin-pu/odas
ODAS: Open embeddeD Audition System
gavin-pu/odas_web
A desktop visualization GUI for the ODAS library
gavin-pu/online-offline-CGMM-for-MVDR
Offline CGMM and CGMM with spatial prior distribution in an online manner
gavin-pu/pifuhd
High-Resolution 3D Human Digitization from A Single Image.
gavin-pu/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
gavin-pu/Sound_Localization_Algorithms
Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.
gavin-pu/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
gavin-pu/Spherical-Harmonic-Transform
A collection of MATLAB routines for the Spherical Harmonic Transform and related manipulations in the spherical harmonic spectrum.
gavin-pu/Tacotron2-Wavenet-Korean-TTS
Korean TTS, Tacotron2, Wavenet
gavin-pu/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese)
gavin-pu/ue4-mediapipe-plugin
UE4 MediaPipe plugin
gavin-pu/voice-web
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
gavin-pu/voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system