SolomonLeon's Stars
ggerganov/llama.cpp
LLM inference in C/C++
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
fishaudio/fish-speech
SOTA Open Source TTS
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
HMCL-dev/HMCL
A Minecraft Launcher which is multi-functional, cross-platform and popular
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
lllyasviel/Paints-UNDO
Understand Human Behavior to Align True Needs
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
EasyTier/EasyTier
A simple, decentralized mesh VPN with WireGuard support.
huanghanzhilian/c-shopping
A beautiful shopping platform developed with Next.js, tailored for various devices including Desktop, Tablet, and Phone. 基于Nextjs开发同时适配Desktop、Tablet、Phone多种设备的精美购物平台
THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Spu7Nix/SPWN-language
A language for Geometry Dash triggers
RQLuo/MixTeX-Latex-OCR
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
winterx/color4bg.js
Cool colorful backgrounds, generated by JS
6drf21e/ChatTTS_Speaker
ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview
0x5446/api4sensevoice
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.
lovemefan/SenseVoice.cpp
Port of Funasr's Sense-voice model in C/C++
Dan-wanna-M/formatron
Formatron empowers everyone to control the format of language models' output with minimal overhead.
Jellyfish042/uncheatable_eval
Evaluating LLMs with Dynamic Data
lovemefan/fsmn-vad
A enterprise-grade Voice Activity Detector from modelscope and funasr.
HaringProGit/Revelation
A realistic shaderpack for Minecraft: Java Edition
revolunet/webaudio-wav-stream-player
instantly play remote wav streams using fetch API + WebAudio
eric-ai-lab/Screen-Point-and-Read
Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"
luke358/podcasts_pro
Podcasts Pro is a podcast application. You can use podcast RSS links to subscribe to your favorite podcasts.
lovemefan/CT-Transformer-punctuation
A enterprise-grade Chinese-English code switch punctuator from funasr.
lovemefan/campplus
A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx
sabin-prisma/kudos
Kudos application