terencewlc

HK

terencewlc's Stars

raoenhui/live2d-example
看板娘案例
Language:HTML6018
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python4.5k474
cline/cline
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Language:TypeScript21.9k1.9k
JusperLee/SonicSim
Language:Python21325
openai/simple-evals
Language:Python2.2k186
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
Language:Python43.2k4.8k
Jonseed/ComfyUI-Detail-Daemon
A port of muerrilla's sd-webui-Detail-Daemon as a node for ComfyUI, to adjust sigmas that control detail.
Language:Python49615
microsoft/OmniParser
A simple screen parsing tool towards pure vision based GUI agent
Language:Jupyter Notebook5.5k430
vikhyat/moondream
tiny vision language model
Language:Jupyter Notebook6.6k528
CyberAgentAILab/TANGO
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
Language:Python983114
anliyuan/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
Language:Python1.4k211
OrionStarAI/Orion
Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized model, a RAG fine-tuned model, and an Agent fine-tuned model. Orion-14B 系列模型包括一个具有140亿参数的多语言基座大模型以及一系列相关的衍生模型，包括对话模型，长文本模型，量化模型，RAG微调模型，Agent微调模型等。
Language:Python79157
THUDM/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
Language:Python2.6k206
ArdaGnsrn/ollama-php
This is a PHP library for Ollama. Ollama is an open-source project that serves as a powerful and user-friendly platform for running LLMs on your local machine. It acts as a bridge between the complexities of LLM technology and the desire for an accessible and customizable AI experience.
Language:PHP8512
pengzhendong/streaming-sensevoice
Pseudo Streaming SenseVoice with Hotwords
Language:Python15518
SpenserCai/ComfyUI-FunAudioLLM
Comfyui custom node for FunAudioLLM include CosyVoice and SenseVoice
Language:Python645
microsoft/BitNet
Official inference framework for 1-bit LLMs
Language:C++12.6k880
liou666/polyglot
🤖️ Cross-platform AI language practice app （跨平台AI语言练习应用）
Language:TypeScript2.6k271
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Language:Python2.7k185
OwO-Network/DeepLX
Powerful Free DeepL API, No Token Required
Language:Go7k554
llmware-ai/llmware
Unified framework for building enterprise RAG pipelines with small, specialized models
Language:Python8.4k1.5k
harry0703/AudioNotes
快速提取音视频内容，整理成一份结构化的markdown笔记
Language:Python1.2k132
v3ucn/GPT-SoVITS-V2
GPT-SoVITS-V2模型，合并了官方的一些PR，包含但不限于:参考音频自动填充，字幕同步，SillyTavern酒馆接入等功能
Language:Python7910
SillyTavern/SillyTavern
LLM Frontend for Power Users.
Language:JavaScript9.4k2.6k
Eikanya/Live2d-model
Live2d model collection
Language:Mathematica2.2k708
pudding0503/live2d-package
一个 Live2D 模型端 Web 端整合包与使用教程。
Language:JavaScript134
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language:Python9.7k1.3k
abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
Language:Python2.5k187
CerebriumAI/examples
Examples for Cerebrium Serverless GPUs
Language:Python45461
FriendsOfPHP/Goutte
Goutte, a simple PHP Web Scraper
Language:PHP9.3k1k