thunn's Stars
AppFlowy-IO/AppFlowy
Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion.
nocodb/nocodb
🔥 🔥 🔥 Open Source Airtable Alternative
minio/minio
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.
coollabsio/coolify
An open-source & self-hostable Heroku / Netlify / Vercel alternative.
makeplane/plane
🔥 🔥 🔥 Open Source JIRA, Linear, Monday, and Asana Alternative. Plane helps you track your issues, epics, and product roadmaps in the simplest way possible.
dokku/dokku
A docker-powered PaaS that helps you build and manage the lifecycle of applications
frappe/erpnext
Free and Open Source Enterprise Resource Planning (ERP)
instantdb/instant
Instant is a modern Firebase. We make you productive by giving your frontend a real-time database.
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
goshops-com/clipshare
An incredibly simple, open-source alternative to Loom that only requires S3-compatible storage—no servers needed
yl4579/PL-BERT
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
nyrahealth/CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
IIEleven11/StyleTTS2FineTune
supertone-inc/super-monotonic-align
zhenye234/xcodec
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
nii-yamagishilab/mos-finetune-ssl
sarulab-speech/UTMOSv2
UTokyo-SaruLab MOS Prediction System
xi-j/Mamba-TasNet
jzmzhong/Automatic-Prosody-Annotator-with-SSWP-CLAP
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
myshell-ai/DreamVoice
3loi/NaturalVoices
MTG/tape
TAPE: An End-to-End Timbre-Aware Pitch Estimator
iamanigeeit/present
IIEleven11/AudioDatasetMaker
Uses deepgram/whisper/custom models to create an LJSpeech dataset for voice model fine tuning