SolomonLeon

SolomonLeon's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++69.8k 557 4.2k10.1k
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript56.3k 392 5.5k8.3k
fishaudio/fish-speech
SOTA Open Source TTS
Language:Python17.8k 110 4721.3k
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
Language:Python14.9k 105 175950
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.9k 106 612906
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python8.9k 85 642856
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell7.5k 42 772460
HMCL-dev/HMCL
A Minecraft Launcher which is multi-functional, cross-platform and popular
Language:Java7.1k 94 2.2k689
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python5.6k 36 590471
lllyasviel/Paints-UNDO
Understand Human Behavior to Align True Needs
Language:Python3.6k 20 64318
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Language:Python3.2k 101 123289
EasyTier/EasyTier
A simple, decentralized mesh VPN with WireGuard support.
Language:Rust2.2k 28 255214
huanghanzhilian/c-shopping
A beautiful shopping platform developed with Next.js, tailored for various devices including Desktop, Tablet, and Phone. 基于Nextjs开发同时适配Desktop、Tablet、Phone多种设备的精美购物平台
Language:JavaScript2.1k 25 9315
THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Language:Python1.6k 17 34149
Spu7Nix/SPWN-language
A language for Geometry Dash triggers
Language:Rust1.1k 20 6663
RQLuo/MixTeX-Latex-OCR
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
Language:Python984 2 2456
winterx/color4bg.js
Cool colorful backgrounds, generated by JS
Language:JavaScript623 5 1246
6drf21e/ChatTTS_Speaker
ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview
Language:Python567 6 1130
0x5446/api4sensevoice
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.
Language:Python272 5 1641
lovemefan/SenseVoice.cpp
Port of Funasr's Sense-voice model in C/C++
Language:C196 4 2513
Dan-wanna-M/formatron
Formatron empowers everyone to control the format of language models' output with minimal overhead.
Language:Python169 1 206
Jellyfish042/uncheatable_eval
Evaluating LLMs with Dynamic Data
Language:Jupyter Notebook73 2 24
lovemefan/fsmn-vad
A enterprise-grade Voice Activity Detector from modelscope and funasr.
Language:Python69 3 47
HaringProGit/Revelation
A realistic shaderpack for Minecraft: Java Edition
Language:GLSL672
revolunet/webaudio-wav-stream-player
instantly play remote wav streams using fetch API + WebAudio
Language:JavaScript43 4 911
eric-ai-lab/Screen-Point-and-Read
Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"
Language:Python24 2 02
luke358/podcasts_pro
Podcasts Pro is a podcast application. You can use podcast RSS links to subscribe to your favorite podcasts.
Language:Dart20 1 14
lovemefan/CT-Transformer-punctuation
A enterprise-grade Chinese-English code switch punctuator from funasr.
Language:Python19 1 22
lovemefan/campplus
A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx
Language:Python9 2 12
sabin-prisma/kudos
Kudos application
Language:TypeScript21