runngezhang's Stars
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
josStorer/RWKV-Runner
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Jamie-Stirling/RetNet
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
xiph/LPCNet
Efficient neural speech synthesis
HuijieL/Ren
任正非讲话
LAION-AI/audio-dataset
Audio Dataset for training CLAP and other models
ddlBoJack/Speech-Resources
语音方向实验室/公司/资源/实习等,欢迎推荐或自荐
Sato-Kunihiko/audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
ddlBoJack/Awesome-Speech-Pretraining
Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.
felixpatzelt/colorednoise
Python package to generate Gaussian (1/f)**beta noise (e.g. pink noise)
RookieJunChen/Inter-SubNet
The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.
YangangCao/TRUNet
unofficial PyTorch implementation of 《REAL-TIME DENOISING AND DEREVERBERATION WTIH TINY RECURRENT U-NET》
wangwei2009/spatial-temporal-LCMV
multi-channel microphone array noise reduction
nttcslab/dcase2023_task2_baseline_ae
ddlBoJack/MT4SSL
[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets
DCASE2023-Task7-Foley-Sound-Synthesis/dcase2023_task7_baseline
msalhab96/SNR-Estimation-Using-Deep-Learning
An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning
Okrio/tinyrecurrentunet
Real-Time De-noising and De-reverbing with Tiny Recurrent UNet
drammock/praat-semiauto
Praat scripts for streamlining manual measurements in acoustic analysis
ffxiong/stsubnet
kinggongzilla/DCASE2023_Task2
wilkinghoff/DCASE2023_task2
Submission for task 2 "First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring" of the DCASE challenge 2023 (https://dcase.community/challenge2023/task-first-shot-unsupervised-anomalous-sound-detection-for-machine-condition-monitoring)
marmoi/dcase2023_task4b_baseline
Baseline code for DCASE 2023 task 4 B
msalhab96/Listen-Attend-and-Spell
PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper
msalhab96/audesc
Audesc is an open-source library for descriptive Audio analysis, and parsing
thinkerchan/renzhengfei
speech analysis of renzhengfei
Moplast/moplast.github.io