Pinned Repositories
A-real-time-time-domain-speech-enhancement-model
AEC-ANS-AGC
AEC/ANS/AGC from webrtc
asteroid
The PyTorch-based audio source separation toolkit for researchers || Current highlight : we got our WHAMR results check it out here !
AudioBSS
Blind source seperation of audio records
awesome-vad
A curated list of awesome voice activity detection
Chinese-Synonyms
Chinese Synonyms 中文同义词查询工具包
chinese_keybert
A minimal chinese keywords extraction with BERT
essentia
C++ library for audio and music analysis, description and synthesis, including Python bindings
Realtime_AudioDenoise_EchoCancellation
Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
ROAD2018's Repositories
ROAD2018/Awesome-Speech-Pretraining
Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.
ROAD2018/awesome-talking-head-generation
ROAD2018/basic-pitch
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
ROAD2018/Bert-VITS2
vits2 backbone with bert
ROAD2018/CapsWriter-Offline
CapsWriter 简陋但好用的离线版,一个 PC 端的语音输入工具
ROAD2018/ccmusic-database.github.io
This platform is a multi-functional music data sharing platform for academic research. It contains many music datas such as the sound information of Chinese traditional musical instruments and the labeling information of Chinese pop music, which is available for free use by MIR researchers.
ROAD2018/Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
ROAD2018/cJSON
Ultralightweight JSON parser in ANSI C
ROAD2018/CLAP
Contrastive Language-Audio Pretraining
ROAD2018/cobra
On-device voice activity detection (VAD) powered by deep learning
ROAD2018/curl
A command line tool and library for transferring data with URL syntax, supporting DICT, FILE, FTP, FTPS, GOPHER, GOPHERS, HTTP, HTTPS, IMAP, IMAPS, LDAP, LDAPS, MQTT, POP3, POP3S, RTMP, RTMPS, RTSP, SCP, SFTP, SMB, SMBS, SMTP, SMTPS, TELNET, TFTP, WS and WSS. libcurl offers a myriad of powerful features
ROAD2018/fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
ROAD2018/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
ROAD2018/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as PTTS、music generation.
ROAD2018/funNLP
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、历史名人词库、诗词词库、医学词库、饮食词库、法律词库、汽车词库、动物词库、中文聊天语料、中文谣言数据、百度中文问答数据集、句子相似度匹配算法集合、bert资源、文本生成&摘要相关工具、cocoNLP信息抽取工具
ROAD2018/keras-attention
Keras Attention Layer (Luong and Bahdanau scores).
ROAD2018/KnowledgeGraph
knowledge graph知识图谱,从零开始构建知识图谱
ROAD2018/LiveChat
Code and Dataset for the paper "LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming" ACL 2023
ROAD2018/m3u8_To_MP4
Python downloader for saving m3u8 videos to local MP4 files.
ROAD2018/NeMo
NeMo: a toolkit for conversational AI
ROAD2018/noisy-student-training-asr
Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem
ROAD2018/pyannote-onnx
ROAD2018/pyWhat
🐸 Identify anything. pyWhat easily lets you identify emails, IP addresses, and more. Feed it a .pcap file or some text and it'll tell you what it is! 🧙♀️
ROAD2018/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
ROAD2018/sematch
semantic similarity framework for knowledge graph
ROAD2018/SpeechMOS
Easy-to-Use Speech MOS predictors
ROAD2018/uie_pytorch
PaddleNLP UIE模型的PyTorch版实现
ROAD2018/vad1
Voice activity detector (VAD) for the browser with a simple API
ROAD2018/VisionGPT2
Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.
ROAD2018/Websocket-2
WSServer is a fast, configurable, and extendable WebSocket Server for UNIX systems written in C (C11).