gavin-pu

Pinned Repositories

APOProject
A trial of developing a APO (Audio Processing Object), working on Windows 10.
Language:C++00
ASR_Theory
语音识别理论，包括研一与研二期间部分所学，论文和PPT
0 1 01
athena-signal
Language:C0 1 00
beamforming
Matlab files for various types of beamforming
Language:MATLAB00
Beamforming-for-speech-enhancement
simple delaysum, MVDR and CGMM-MVDR
Language:Python00
btk20_documentation
btk 2.0 documentation
Language:Python0 1 00
CGMM-MVDR
Implementation of the CGMM-MVDR beamforming
Language:Python00
ComputeLibrary
The ARM Computer Vision and Machine Learning library is a set of functions optimised for both ARM CPUs and GPUs using SIMD technologies.
Language:C++00
cosmoflow-sims
Running the simulations for the CosmoFlow project
Language:C++00
dagger
Dagger 是一个基于 Loki 的日志查询和管理系统，它是由达闼科技（ CloudMinds ）云团队的`大禹基础设施平台`派生出来的一个项目。Dagger 运行在 Loki 前端，具备日志查询、搜索，保存和下载等特性，适用于云原生场景下的容器日志管理场景。
Language:Vue0 1 00

gavin-pu's Repositories

gavin-pu/APOProject
A trial of developing a APO (Audio Processing Object), working on Windows 10.
Language:C++00
gavin-pu/ASR_Theory
语音识别理论，包括研一与研二期间部分所学，论文和PPT
0 1 01
gavin-pu/athena-signal
Language:C0 1 00
gavin-pu/btk20_documentation
btk 2.0 documentation
Language:Python0 1 00
gavin-pu/ComputeLibrary
The ARM Computer Vision and Machine Learning library is a set of functions optimised for both ARM CPUs and GPUs using SIMD technologies.
Language:C++00
gavin-pu/cosmoflow-sims
Running the simulations for the CosmoFlow project
Language:C++00
gavin-pu/dagger
Dagger 是一个基于 Loki 的日志查询和管理系统，它是由达闼科技（ CloudMinds ）云团队的`大禹基础设施平台`派生出来的一个项目。Dagger 运行在 Loki 前端，具备日志查询、搜索，保存和下载等特性，适用于云原生场景下的容器日志管理场景。
Language:Vue0 1 00
gavin-pu/dancenet
DanceNet -💃💃Dance generator using Autoencoder, LSTM and Mixture Density Network. (Keras)
gavin-pu/DeepLearning
深度学习入门教程, 优秀文章, Deep Learning Tutorial
Language:Jupyter Notebook1 0
gavin-pu/distant_speech_recognition
spatial signal processing toolkit a.k.a beamforming toolkit 2.0 (BTK2.0)
Language:C++
gavin-pu/EA-SVC
An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"
gavin-pu/HyperFT
开源移动端快速视频人脸跟踪-移动端150FPS+
Language:C++1 0
gavin-pu/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
gavin-pu/LPCNet
Efficient neural speech synthesis
Language:C1 0
gavin-pu/MASP
Microphone Array Speech Processing
Language:MATLAB1 0
gavin-pu/Microphone-Array-postfilter
1
gavin-pu/nara_wpe
Different implementations of "Weighted Prediction Error" for speech dereverberation
Language:Python1 0
gavin-pu/odas
ODAS: Open embeddeD Audition System
gavin-pu/odas_web
A desktop visualization GUI for the ODAS library
Language:JavaScript1 0
gavin-pu/online-offline-CGMM-for-MVDR
Offline CGMM and CGMM with spatial prior distribution in an online manner
gavin-pu/pifuhd
High-Resolution 3D Human Digitization from A Single Image.
gavin-pu/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
Language:Python1 0
gavin-pu/Sound_Localization_Algorithms
Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.
Language:MATLAB1
gavin-pu/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Language:CSS1 0
gavin-pu/Spherical-Harmonic-Transform
A collection of MATLAB routines for the Spherical Harmonic Transform and related manipulations in the spherical harmonic spectrum.
gavin-pu/Tacotron2-Wavenet-Korean-TTS
Korean TTS, Tacotron2, Wavenet
Language:Python1 0
gavin-pu/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese)
gavin-pu/ue4-mediapipe-plugin
UE4 MediaPipe plugin
gavin-pu/voice-web
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
Language:TypeScript
gavin-pu/voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system