Pinned Repositories
3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
ai_challenger_mt
AI Challenger 全球AI挑战赛-英中文本机器翻译-baseline(基于tensor2tensor搭建)
albert_zh
海量中文预训练ALBERT模型 Chinese version of ALBERT pre-trained model
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
audacity
Audio Editor
CNN_4_Verifycode
使用Keras搭建CNN模型,破解简单的网页验证码
crf_torch_onnx
可以转成onnx的torch版本的CRF
funNLP
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、历史名人词库、诗词词库、医学词库、饮食词库、法律词库、汽车词库、动物词库、中文聊天语料、中文谣言数据、百度中文问答数据集、句子相似度匹配算法集合、bert资源、文本生成&摘要相关工具、cocoNLP信息抽取工具
ImageClassification
a demo to use CNNs on Image classification tasks in keras
NER
基于tensorflow深度学习的中文的命名实体识别
jin1258804025's Repositories
jin1258804025/crf_torch_onnx
可以转成onnx的torch版本的CRF
jin1258804025/3D-Speaker
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
jin1258804025/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
jin1258804025/bark
🔊 Text-Prompted Generative Audio Model
jin1258804025/Bert-VITS2
vits2 backbone with multilingual-bert
jin1258804025/ChatTTS
TTS
jin1258804025/Chinese-Names-Corpus
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
jin1258804025/CosyVoice
LLM based TTS model, providing inference/training/deployment full-stack ability.
jin1258804025/EasyBertVits2
文章から感情豊かな音声を生成する Bert-VITS2 を簡単に使えます。
jin1258804025/fish-speech
Brand new TTS solution
jin1258804025/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。
jin1258804025/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
jin1258804025/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
jin1258804025/MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
jin1258804025/MassTTS
a TTS demo for training new characters.
jin1258804025/megatts2
Unoffical implement of Megatts2
jin1258804025/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
jin1258804025/mistral-finetune
jin1258804025/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
jin1258804025/parler-tts
Inference and training library for high-quality TTS models.
jin1258804025/polyphone
Chinese polyphone disambiguation for Text-to-Speech application
jin1258804025/pycantonese
Cantonese Linguistics and NLP
jin1258804025/sherpa-onnx
Speech-to-text and text-to-speech using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go
jin1258804025/StyleTTS
Official Implementation of StyleTTS
jin1258804025/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
jin1258804025/tts-frontend-dataset
TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization
jin1258804025/Viphoneme
Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA
jin1258804025/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
jin1258804025/wetts
Production First and Production Ready End-to-End Text-to-Speech Toolkit
jin1258804025/yue2latex
Convert Cantonese to IPA or Pinyin or LaTeX format;将粤语转换为IPA或拼音或latex格式