Pinned Repositories
23_3090_speakerfilter_new_deepfilter_final_1024_new
4x_superresolution_cnn
Investigation in 4x Super-resolution by Deep Convolutional Neural Networks
A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch
aac-datasets
Audio Captioning datasets for PyTorch.
abseil-cpp
Abseil Common Libraries (C++)
AcoustDSP
Acoustic DSP is a Python library which aims to implement several digital signal processing algorithms regarding acoustic analysis in one place.
Acoustic-Beamforming-Advanced
Scan-frequency Version for Acoustic Imaging, including the following methods: DAS, MUSIC, DAMAS, DAMAS2, DAMAS-FISTA, CLEAN-PSF, CLEAN-SC, FFT-NNLS, and FFT-DFISTA...
acoustic-interference-cancellation
acoustic interference (echo) cancellation project in summer internship
ERNIE
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
webrtc-3
webrtc ns aecm agc vad run on linux
runngezhang-jx's Repositories
runngezhang-jx/AudioFile
A simple C++ library for reading and writing audio files.
runngezhang-jx/basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
runngezhang-jx/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
runngezhang-jx/cvxpy
A Python-embedded modeling language for convex optimization problems.
runngezhang-jx/fucking-algorithm
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
runngezhang-jx/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
runngezhang-jx/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
runngezhang-jx/Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
runngezhang-jx/libsndfile
A C library for reading and writing sound files containing sampled audio data.
runngezhang-jx/lite.ai.toolkit
🛠 A lite C++ toolkit of awesome AI models, support ONNXRuntime, MNN, TNN, NCNN and TensorRT.
runngezhang-jx/MOSA-Net-Cross-Domain
runngezhang-jx/ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
runngezhang-jx/nltk
NLTK Source
runngezhang-jx/noise-suppression-for-voice
Noise suppression plugin based on Xiph's RNNoise
runngezhang-jx/onnx
Open standard for machine learning interoperability
runngezhang-jx/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
runngezhang-jx/Phase-aware-Deep-Complex-UNet
[Not Official] Implementation DC-UNet, ICLR 2019
runngezhang-jx/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
runngezhang-jx/QMixCAT
runngezhang-jx/ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
runngezhang-jx/SEMamba
This is the official implementation of the SEMamba paper.
runngezhang-jx/SenseVoice
Multilingual Voice Understanding Model
runngezhang-jx/SonicSim
runngezhang-jx/speechbrain
A PyTorch-based Speech Toolkit
runngezhang-jx/SpeechT5
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
runngezhang-jx/spleeter
Deezer source separation library including pretrained models.
runngezhang-jx/streamlit
Streamlit — A faster way to build and share data apps.
runngezhang-jx/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
runngezhang-jx/tree-of-thought-puzzle-solver
The Tree of Thoughts (ToT) framework for solving complex reasoning tasks using LLMs
runngezhang-jx/umap
Uniform Manifold Approximation and Projection