Pinned Repositories
About-Mel-feature
Agently
🚀 A fast way to build LLM Agent based Application 🤵 A light weight framework helps developers to create amazing LLM based applications. 🎭 You can use it to create an LLM based agent instance with role set and memory easily. ⚙️ You can use Agently agent instance just like an async function and put it anywhere in your code.
aidatatang_200zh
Aidatatang_200zh is an open source Chinese Mandarin speech corpus released by DataTang Technology Co., Ltd (www.datatang.com).
Alibaba-MIT-Speech
Alibaba speech technology
audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
awesome-deep-learning-music
List of articles related to deep learning applied to music
bark
🔊 Text-prompted Generative Audio Model
modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
PaddleSpeech
Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.
paddlespeech_tts_cpp
PaddleSpeech TTS cpp
lym0302's Repositories
lym0302/paddlespeech_tts_cpp
PaddleSpeech TTS cpp
lym0302/bark
🔊 Text-prompted Generative Audio Model
lym0302/PaddleSpeech
Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.
lym0302/Agently
🚀 A fast way to build LLM Agent based Application 🤵 A light weight framework helps developers to create amazing LLM based applications. 🎭 You can use it to create an LLM based agent instance with role set and memory easily. ⚙️ You can use Agently agent instance just like an async function and put it anywhere in your code.
lym0302/aidatatang_200zh
Aidatatang_200zh is an open source Chinese Mandarin speech corpus released by DataTang Technology Co., Ltd (www.datatang.com).
lym0302/audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
lym0302/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
lym0302/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
lym0302/Bert-VITS2
vits2 backbone with multilingual-bert
lym0302/CommonCode
Save some common code
lym0302/deep-clustering
deep clustering method for single-channel speech separation
lym0302/deepcluster
Deep Clustering for Unsupervised Learning of Visual Features
lym0302/DeepClustering
Deep Clustering
lym0302/DeepSpeech
A TensorFlow implementation of Baidu's DeepSpeech architecture
lym0302/docker-kaldi-gstreamer-server
Dockerfile for kaldi-gstreamer-server.
lym0302/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
lym0302/FastGPT
FastGPT is a knowledge-based question answering system built on the LLM. It offers out-of-the-box data processing and model invocation capabilities. Moreover, it allows for workflow orchestration through Flow visualization, thereby enabling complex question and answer scenarios!
lym0302/git_test
lym0302/HarvestText
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
lym0302/KWS_RUIM
scripts used for kws project
lym0302/masr
中文语音识别,提供预训练模型,高识别率 Chinese Speech Recognition; Mandarin Automatic Speech Recognition;
lym0302/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
lym0302/NMFLibrary
MATLAB library for non-negative matrix factorization (NMF): Version 1.8.0
lym0302/Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)
lym0302/pytorch-lightning
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
lym0302/resample
重采样 8k 变 16k 或者其他
lym0302/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
lym0302/ttsdemo.github.io
lym0302/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
lym0302/Wave-U-Net
Implementation of the Wave-U-Net for audio source separation