vyoz's Stars
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
babysor/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
vbenjs/vue-vben-admin
A modern vue admin panel built with Vue3, Shadcn UI, Vite, TypeScript, and Monorepo. It's fast!
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
ente-io/ente
FOSS, End-to-End Encrypted Cloud
postalserver/postal
📮 A fully featured open source mail delivery platform for incoming & outgoing e-mail
eosphoros-ai/DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
voicepaw/so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
getumbrel/umbrel
A beautiful home server OS for self-hosting with an app store. Buy a pre-built Umbrel Home with umbrelOS, or install on a Raspberry Pi or any x86 system.
andrewyng/translation-agent
collabora/WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
signalwire/freeswitch
FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile software implementation that runs on any commodity hardware. From a Raspberry PI to a multi-core server, FreeSWITCH can unlock the telecommunications potential of any device.
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
linyiLYi/bilibot
A local chatbot fine-tuned by bilibili user comments.
seetafaceengine/SeetaFace2
SeetaFace 2: open source, full stack face recognization toolkit.
Delta-ML/delta
DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/
UKPLab/EasyNMT
Easy to use, state-of-the-art Neural Machine Translation for 100+ languages
1inch/shieldy
@shieldy_bot Telegram bot repository
winterx/color4bg.js
Cool colorful backgrounds, generated by JS
howl-anderson/Chinese_models_for_SpaCy
SpaCy 中文模型 | Models for SpaCy that support Chinese
ai/audio-recorder-polyfill
MediaRecorder polyfill to record audio in Edge and Safari
chenkui164/FastASR
这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。
speechio/BigCiDian
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
ShawnHymel/ei-keyword-spotting
swsamleo/MLSTGCN
Graph Neural Network
zkmkarlsruhe/language-identification
Spoken Language Identification on Common Voice and AudioSet using Deep Learning