AppalachianWine

S2S | TTS | ASR | KWS

Beijing, China

Pinned Repositories

algorithm-visualizer
:fireworks:Interactive Online Platform that Visualizes Algorithms from Code
Language:JavaScript0 2 00
Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
Language:Python0 2 00
aliyunpanshare
阿里云盘影视资源分享，每日发布最新电视剧、综艺、电影资源。
00
alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook00
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python00
audio-preprocess
Preprocess Audio for training
Language:Python0 1 00
audio_crnn
Language:Python0 2 00
free-programming-books-zh_CN
:books: 免费的计算机编程类中文书籍，欢迎投稿
1 2 00
interview_internal_reference
2019年最新总结，阿里，腾讯，百度，美团，头条等技术面试题目，以及答案，专家出题人分析汇总。
Language:Python1 2 00
WebRtcNs_WavtoPcm
# WebRTC_NS Noise Suppression Module Port From WebRTC.
Language:C2 2 01

AppalachianWine's Repositories

AppalachianWine/aliyunpanshare
阿里云盘影视资源分享，每日发布最新电视剧、综艺、电影资源。
00
AppalachianWine/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook00
AppalachianWine/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Python00
AppalachianWine/audio-preprocess
Preprocess Audio for training
Language:Python0 1 00
AppalachianWine/bilibot
A local chatbot fine-tuned by bilibili user comments.
AppalachianWine/bpemb
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
AppalachianWine/ChatTTS
ChatTTS is a generative speech model for daily dialogue.
Language:Jupyter Notebook0 0
AppalachianWine/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
AppalachianWine/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
AppalachianWine/g2p-mix
Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English
AppalachianWine/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python1 0
AppalachianWine/GPTs
leaked prompts of GPTs
AppalachianWine/LangSegment
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言（97种语言）混合文本内容自动分词工具。
AppalachianWine/LASER
Language-Agnostic SEntence Representations
AppalachianWine/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
AppalachianWine/LLMForEverybody
每个人都能看懂的大模型知识分享，LLMs春/秋招大模型面试前必看，让你和面试官侃侃而谈
AppalachianWine/Make-An-Audio-2
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
Language:Python0 0
AppalachianWine/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
AppalachianWine/multipa
Universal multilingual automatic speech transcription into IPA
AppalachianWine/my-tv
安卓电视直播软件，内置直播源
AppalachianWine/open-tts-tracker
1 0
AppalachianWine/parler-tts
Inference and training library for high-quality TTS models.
Language:Python0 0
AppalachianWine/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
Language:Python1 0
AppalachianWine/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
AppalachianWine/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
AppalachianWine/TeleSpeech-ASR
AppalachianWine/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
AppalachianWine/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
AppalachianWine/Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
AppalachianWine/WeTextProcessing
Text Normalization & Inverse Text Normalization