kznmft

kznmft's Stars

langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript51.3k 367 4.8k7.4k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python35.3k 294 1.1k4.3k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python34.1k 210 5.2k4.2k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python29.9k 243 5.2k4.5k
phidatahq/phidata
Build AI Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.
Language:Python15k 110 2452.1k
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
Language:TypeScript14.6k 106 3011.4k
fishaudio/fish-speech
Brand new TTS solution
Language:Python14.3k 97 4071.1k
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python12.4k 137 7091.3k
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，同时支持语音识别转录、语音合成、字幕翻译。
Language:Python10.7k 69 5681.2k
LibreTranslate/LibreTranslate
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
Language:Python9.6k 93 408872
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
Language:Python8k 50 01.1k
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Language:Jupyter Notebook7.5k 78 189554
Plachtaa/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Language:Python4.7k 41 566712
argosopentech/argos-translate
Open-source offline translation library written in Python
Language:Python3.9k 53 267282
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python2.4k 62 170264
pndurette/gTTS
Python library and CLI tool to interface with Google Translate's text-to-speech API
Language:Python2.3k 67 211362
coqui-ai/STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Language:C++2.3k 62 183277
IAHispano/Applio
A simple, high-quality voice conversion tool focused on ease of use and performance
Language:Python1.8k 22 432285
jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
1.7k 42 21227
nidhaloff/deep-translator
A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators.
Language:Python1.6k 23 154183
gabrie30/ghorg
Quickly clone an entire org/users repositories into one directory - Supports GitHub, GitLab, Bitbucket, and more 🐇🥚
Language:Go1.6k 25 175167
savoirfairelinux/num2words
Modules to convert numbers to words. 42 --> forty-two
Language:Python825 38 207498
Helsinki-NLP/Opus-MT
Open neural machine translation models and web services
Language:Python616 17 8471
fe1ixxu/ALMA
State-of-the-art LLM-based translation models.
Language:Ruby425 13 6235
rhasspy/gruut
A tokenizer, text cleaner, and phonemizer for many human languages.
Language:Python279 8 3736
lucidrains/spear-tts-pytorch
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
Language:Python257 29 619
domesticatedviking/TextyMcSpeechy
Easily create text-to-speech models in any voice for rhasspy/piper. Make a text-to-speech model with your own voice recordings, or use thousands of RVC voices. Works offline on a Raspberry pi. Rapidly record custom datasets for any metadata.csv file and listen to your model as it is training.
Language:Shell234 5 118
rhasspy/piper-recording-studio
Local voice recording for creating Piper datasets
Language:JavaScript95 4 1623
inlife/nexrender-boilerplate
Boilerplate project for rendering a video using nexrender.
Language:JavaScript59 8 1518
austin-bowen/voicebox
Python text-to-speech library with built-in voice effects and support for multiple TTS engines
Language:Python16 2 53