Pinned Repositories
APC-SNR
Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch
CVHomework
计算机视觉作业:基于直方图的自适应阈值分割、利用聚类技术实现纹理图像分割、模板匹配技术、目标跟踪、背景建模、目标检测
DCCRN
implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch
HGCN
The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"
My-notes
:books:学习随笔
NutritionMaster
:fire:菜谱/食谱/针对慢性病的饮食推荐/病情诊断小游戏/菜品识别/卡路里获取
OldPeopleHome
:fire:智能养老院项目
RUI_SE
The official repo of "A Refining Underlying Information Framework for Speech Enhancement"
SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
TFonAndroid
tensorflow在android上的移植:car:(画风迁移和手写数字识别)
wangtianrui's Repositories
wangtianrui/OldPeopleHome
:fire:智能养老院项目
wangtianrui/HGCN
The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"
wangtianrui/APC-SNR
Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch
wangtianrui/My-notes
:books:学习随笔
wangtianrui/MindSpore4Speech
wangtianrui/Audio-Enhancement-via-ONMF
wangtianrui/DPCRN_DNS3
Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"
wangtianrui/FAcodec
Training code for FAcodec presented in NaturalSpeech3
wangtianrui/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
wangtianrui/RUI_SE
The official repo of "A Refining Underlying Information Framework for Speech Enhancement"
wangtianrui/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
wangtianrui/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
wangtianrui/asr_labs
ASR labs
wangtianrui/BigVGAN
Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training
wangtianrui/conditional-flow-matching
wangtianrui/EnCodec_Trainer
wangtianrui/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
wangtianrui/paper2gui
Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术
wangtianrui/poolformer
PoolFormer: MetaFormer is Actually What You Need for Vision
wangtianrui/ProgRE
wangtianrui/QuadTreeAttention
QuadTree Attention for Vision Transformers (ICLR2022)
wangtianrui/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
wangtianrui/SDDNet
Coarse implement of the paper "A Simultaneous Denoising and Dereverberation Framework with Target Decoupling", On DNS-2020 dataset, the DNSMOS of first stage is 3.42 and second stage is 3.47.
wangtianrui/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector
wangtianrui/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
wangtianrui/Uformer
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
wangtianrui/UniSpeech
wangtianrui/vits_chinese
vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统,兼容性非常好的合成框架
wangtianrui/VoiceLDM
VoiceLDM: Text-to-Speech with Environmental Context
wangtianrui/voiceldm-data