kznmft's Stars
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
phidatahq/phidata
Build AI Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
fishaudio/fish-speech
Brand new TTS solution
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
jianchang512/pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
LibreTranslate/LibreTranslate
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Plachtaa/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
argosopentech/argos-translate
Open-source offline translation library written in Python
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
pndurette/gTTS
Python library and CLI tool to interface with Google Translate's text-to-speech API
coqui-ai/STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
IAHispano/Applio
A simple, high-quality voice conversion tool focused on ease of use and performance
jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
nidhaloff/deep-translator
A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators.
gabrie30/ghorg
Quickly clone an entire org/users repositories into one directory - Supports GitHub, GitLab, Bitbucket, and more 🐇🥚
savoirfairelinux/num2words
Modules to convert numbers to words. 42 --> forty-two
Helsinki-NLP/Opus-MT
Open neural machine translation models and web services
fe1ixxu/ALMA
State-of-the-art LLM-based translation models.
rhasspy/gruut
A tokenizer, text cleaner, and phonemizer for many human languages.
lucidrains/spear-tts-pytorch
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
domesticatedviking/TextyMcSpeechy
Easily create text-to-speech models in any voice for rhasspy/piper. Make a text-to-speech model with your own voice recordings, or use thousands of RVC voices. Works offline on a Raspberry pi. Rapidly record custom datasets for any metadata.csv file and listen to your model as it is training.
rhasspy/piper-recording-studio
Local voice recording for creating Piper datasets
inlife/nexrender-boilerplate
Boilerplate project for rendering a video using nexrender.
austin-bowen/voicebox
Python text-to-speech library with built-in voice effects and support for multiple TTS engines