Pinned Repositories
23_3090_speakerfilter_new_deepfilter_final_1024_new
4x_superresolution_cnn
Investigation in 4x Super-resolution by Deep Convolutional Neural Networks
A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch
aac-datasets
Audio Captioning datasets for PyTorch.
abseil-cpp
Abseil Common Libraries (C++)
AcoustDSP
Acoustic DSP is a Python library which aims to implement several digital signal processing algorithms regarding acoustic analysis in one place.
Acoustic-Beamforming-Advanced
Scan-frequency Version for Acoustic Imaging, including the following methods: DAS, MUSIC, DAMAS, DAMAS2, DAMAS-FISTA, CLEAN-PSF, CLEAN-SC, FFT-NNLS, and FFT-DFISTA...
acoustic-interference-cancellation
acoustic interference (echo) cancellation project in summer internship
ERNIE
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
webrtc-3
webrtc ns aecm agc vad run on linux
runngezhang-jx's Repositories
runngezhang-jx/AP-BWE
Towards Efficient and High-Quality Bandwidth Extension with Parallel Amplitude-Phase Prediction
runngezhang-jx/awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
runngezhang-jx/basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
runngezhang-jx/dlrover
DLRover: An Automatic Distributed Deep Learning System
runngezhang-jx/DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
runngezhang-jx/DTLN_pytorch
Dual-signal Transformation LSTM Network, PyTorch,NCNN
runngezhang-jx/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
runngezhang-jx/fastRAG
Efficient Retrieval Augmentation and Generation Framework
runngezhang-jx/FSPEN2
runngezhang-jx/gpuRIR
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
runngezhang-jx/IF
runngezhang-jx/KWS_NLTM
runngezhang-jx/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
runngezhang-jx/mlx-examples
Examples in the MLX framework
runngezhang-jx/MoeVoiceStudio
一个使用C++编写的音频处理软件
runngezhang-jx/mvdrpf
runngezhang-jx/odas
ODAS: Open embeddeD Audition System
runngezhang-jx/pybind11
Seamless operability between C++11 and Python
runngezhang-jx/pytest
The pytest framework makes it easy to write small tests, yet scales to support complex functional testing
runngezhang-jx/python
Boost.org python module
runngezhang-jx/room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
runngezhang-jx/silero-vad2
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
runngezhang-jx/SRP-DNN
A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
runngezhang-jx/tf1-phase-aware-speech-enhancement
TF1 implementation of 'Phase-Aware Speech Enhancement with Deep Complex U-Net' paper
runngezhang-jx/TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning.
runngezhang-jx/txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
runngezhang-jx/VideoGPT
runngezhang-jx/WavLM-DIHARD3
runngezhang-jx/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
runngezhang-jx/zhconv
Simple conversion and localization between simplified and traditional Chinese using tables from MediaWiki.