runngezhang

runngezhang's Stars

BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.4k 133 204844
josStorer/RWKV-Runner
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.
Language:TypeScript5.1k 43 356484
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook5k 61 375325
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k 40 394293
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Language:C1.8k 105 158299
Jamie-Stirling/RetNet
An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"
Language:Python1.2k 13 26100
xiph/LPCNet
Efficient neural speech synthesis
Language:C1.1k 72 197295
HuijieL/Ren
任正非讲话
1k 74 0368
LAION-AI/audio-dataset
Audio Dataset for training CLAP and other models
Language:Python615 21 5753
ddlBoJack/Speech-Resources
语音方向实验室/公司/资源/实习等，欢迎推荐或自荐
484 20 1762
Sato-Kunihiko/audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
Language:Python213 9 674
ddlBoJack/Awesome-Speech-Pretraining
Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.
196 13 013
felixpatzelt/colorednoise
Python package to generate Gaussian (1/f)**beta noise (e.g. pink noise)
Language:Python190 4 719
RookieJunChen/Inter-SubNet
The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.
Language:Python91 4 812
YangangCao/TRUNet
unofficial PyTorch implementation of 《REAL-TIME DENOISING AND DEREVERBERATION WTIH TINY RECURRENT U-NET》
Language:Python88 3 620
wangwei2009/spatial-temporal-LCMV
multi-channel microphone array noise reduction
Language:MATLAB57 2 214
nttcslab/dcase2023_task2_baseline_ae
Language:Python53 4 615
ddlBoJack/MT4SSL
[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets
Language:Python42 4 23
DCASE2023-Task7-Foley-Sound-Synthesis/dcase2023_task7_baseline
Language:Python32 1 07
msalhab96/SNR-Estimation-Using-Deep-Learning
An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning
Language:Jupyter Notebook29 1 15
Okrio/tinyrecurrentunet
Real-Time De-noising and De-reverbing with Tiny Recurrent UNet
Language:Python29 1 012
drammock/praat-semiauto
Praat scripts for streamlining manual measurements in acoustic analysis
25 4 017
ffxiong/stsubnet
19 2 32
kinggongzilla/DCASE2023_Task2
Language:Python18 3 03
wilkinghoff/DCASE2023_task2
Submission for task 2 "First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring" of the DCASE challenge 2023 (https://dcase.community/challenge2023/task-first-shot-unsupervised-anomalous-sound-detection-for-machine-condition-monitoring)
Language:Python14 1 03
marmoi/dcase2023_task4b_baseline
Baseline code for DCASE 2023 task 4 B
Language:Python13 1 73
msalhab96/Listen-Attend-and-Spell
PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper
Language:Python10 2 02
msalhab96/audesc
Audesc is an open-source library for descriptive Audio analysis, and parsing
Language:Python2 1 01
thinkerchan/renzhengfei
speech analysis of renzhengfei
Language:JavaScript2 2 03
Moplast/moplast.github.io
Language:CSS1 1 04