MarineHuang's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
ggml-org/llama.cpp
LLM inference in C/C++
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Plugins/Artifacts) and Thinking. One-click FREE deployment of your private ChatGPT/ Claude / DeepSeek application.
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
karanpratapsingh/system-design
Learn how to design systems at scale and prepare for system design interviews
kenjihiranabe/The-Art-of-Linear-Algebra
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
davisking/dlib
A toolkit for making real world machine learning and data analysis applications in C++
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
OpenNMT/CTranslate2
Fast inference engine for Transformer models
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
rogersce/cnpy
library to read/write .npy and .npz files in C/C++
NATSpeech/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Edresson/YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
cnlinxi/book-text-to-speech
A book about Text-to-Speech (TTS) in Chinese.
modelscope/FunCodec
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
iflytek/aiges
AI Serving framework loader
paulovcmedeiros/pyRobBot
Chat with GPT LLMs over voice, UI & terminal, all with access to the internet. Powered by OpenAI.
Lecrapouille/zipper
[Lib][Version 2.2.0][Functional] C++ wrapper around minizip compression library
ahk-d/transcribe-video-audio
An OpenAI's Whisper-based full-stack project to transcribe audio and video files using React & Django.
viking-man/IntroventsEnglishCorner
A spoken English education chatbot based on ChatGPT/whsiper and gTTS.社恐人士的英语角
soun059/SpokenLanguageAssessment
A spoken language assessment tool by which you can use your speech to determine how better are you in your english speaking capabalities.