Alexw1111's Stars
alibaba/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
wangzhaode/mnn-llm
llm deploy project based mnn.
patchy631/ai-engineering-hub
fishaudio/text-labeler
A simple svs labeling tool
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
coaidev/coai
🚀 Next Generation AI One-Stop Internationalization Solution. 🚀 下一代 AI 一站式 B/C 端解决方案,支持 OpenAI,Midjourney,Claude,讯飞星火,Stable Diffusion,DALL·E,ChatGLM,通义千问,腾讯混元,360 智脑,百川 AI,火山方舟,新必应,Gemini,Moonshot 等模型,支持对话分享,自定义预设,云端同步,模型市场,支持弹性计费和订阅计划模式,支持图片解析,支持联网搜索,支持模型缓存,丰富美观的后台管理与仪表盘数据统计。
songquanpeng/one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
JusperLee/Apollo
Music repair method to convert lossy MP3 compressed music to lossless music.
KimberleyJensen/Mel-Band-Roformer-Vocal-Model
KitsuneX07/ComfyMSS
LetterLiGo/SafeEar
SafeEar: Content Privacy-Preserving Audio Deepfake Detection (Accepted by CCS 2024)
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Bin-Huang/chatbox
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
Golevka2001/Stereographic-Projection-of-Otto
通过球极投影的方式得到 otto 的多种形态
etched-ai/open-oasis
Inference script for Oasis 500M
city96/ComfyUI-GGUF
GGUF Quantization support for native ComfyUI models
OpenT2S/LlamaVoice
LlamaVoice is a llama-based large voice generation model, providing inference and training ability.
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
altera-al/project-sid
Refound-445/nonebot-plugin-nailongremove
Just for learning.
THUDM/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
taurusxin/ncmdump
转换网易云音乐 ncm 到 mp3 / flac. Convert Netease Cloud Music ncm files to mp3/flac files.
Majjcom/ncmppGui
一个使用C++编写的极速ncm转换GUI工具
openai/sparse_autoencoder
SUC-DriverOld/MSST-WebUI
Music Source Separation Training Inference Webui, besides, we packed UVR together!
kyutai-labs/moshi
rhasspy/piper
A fast, local neural text to speech system
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
commaai/openpilot
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.