vyoz

vyoz's Stars

donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Language:Python290k 6.7k 32248.3k
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript75.3k 514 6.9k11k
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language:Python46.8k 451 9.4k8k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python41.4k 232 1.5k4.6k
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:Python35.9k 305 8865.2k
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。
Language:Python29.9k 163 6543k
vbenjs/vue-vben-admin
A modern vue admin panel built with Vue3, Shadcn UI, Vite, TypeScript, and Monorepo. It's fast!
Language:Vue26.9k 255 3.1k7.3k
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python26.6k 180 1294.9k
ente-io/ente
FOSS, End-to-End Encrypted Cloud
Language:Dart18.1k 54 997986
postalserver/postal
📮 A fully featured open source mail delivery platform for incoming & outgoing e-mail
Language:Ruby15.2k 213 1.7k1.1k
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Language:Python14.9k 124 1.3k2k
voicepaw/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
Language:Python8.9k 67 3691.2k
getumbrel/umbrel
A beautiful home server OS for self-hosting with an app store. Buy a pre-built Umbrel Home with umbrelOS, or install on a Raspberry Pi or any x86 system.
Language:TypeScript8.4k 90 964578
andrewyng/translation-agent
Language:Python5.2k 54 18628
collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
Language:Jupyter Notebook4k 76 112221
signalwire/freeswitch
FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile software implementation that runs on any commodity hardware. From a Raspberry PI to a multi-core server, FreeSWITCH can unlock the telecommunications potential of any device.
Language:C3.9k 146 1.4k1.5k
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Language:Python3.4k 60 110345
linyiLYi/bilibot
A local chatbot fine-tuned by bilibili user comments.
Language:Python3.2k 21 29368
seetafaceengine/SeetaFace2
SeetaFace 2: open source, full stack face recognization toolkit.
Language:C++2.2k 104 105626
Delta-ML/delta
DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/
Language:Python1.6k 66 75288
UKPLab/EasyNMT
Easy to use, state-of-the-art Neural Machine Translation for 100+ languages
Language:Python1.2k 19 94121
1inch/shieldy
@shieldy_bot Telegram bot repository
Language:TypeScript885 41 116263
winterx/color4bg.js
Cool colorful backgrounds, generated by JS
Language:JavaScript721 5 1259
howl-anderson/Chinese_models_for_SpaCy
SpaCy 中文模型 | Models for SpaCy that support Chinese
Language:Jupyter Notebook661 31 37111
ai/audio-recorder-polyfill
MediaRecorder polyfill to record audio in Edge and Safari
Language:JavaScript589 18 6476
chenkui164/FastASR
这是一个用C++实现ASR推理的项目，它依赖很少，安装也很简单，推理速度很快，在树莓派4B等ARM平台也可以流畅的运行。支持的模型是由Google的Transformer模型中优化而来，数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时)，所以识别效果也很好，可以媲美许多商用的ASR软件。
Language:C499 24 7077
speechio/BigCiDian
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
Language:Python256 8 156
ShawnHymel/ei-keyword-spotting
Language:C167 13 1352
swsamleo/MLSTGCN
Graph Neural Network
Language:Python76 5 79
zkmkarlsruhe/language-identification
Spoken Language Identification on Common Voice and AudioSet using Deep Learning
Language:Python37 4 57