terencewlc's Stars
raoenhui/live2d-example
看板娘案例
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
cline/cline
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
JusperLee/SonicSim
openai/simple-evals
All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
Jonseed/ComfyUI-Detail-Daemon
A port of muerrilla's sd-webui-Detail-Daemon as a node for ComfyUI, to adjust sigmas that control detail.
microsoft/OmniParser
A simple screen parsing tool towards pure vision based GUI agent
vikhyat/moondream
tiny vision language model
CyberAgentAILab/TANGO
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
anliyuan/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
OrionStarAI/Orion
Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized model, a RAG fine-tuned model, and an Agent fine-tuned model. Orion-14B 系列模型包括一个具有140亿参数的多语言基座大模型以及一系列相关的衍生模型,包括对话模型,长文本模型,量化模型,RAG微调模型,Agent微调模型等。
THUDM/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
ArdaGnsrn/ollama-php
This is a PHP library for Ollama. Ollama is an open-source project that serves as a powerful and user-friendly platform for running LLMs on your local machine. It acts as a bridge between the complexities of LLM technology and the desire for an accessible and customizable AI experience.
pengzhendong/streaming-sensevoice
Pseudo Streaming SenseVoice with Hotwords
SpenserCai/ComfyUI-FunAudioLLM
Comfyui custom node for FunAudioLLM include CosyVoice and SenseVoice
microsoft/BitNet
Official inference framework for 1-bit LLMs
liou666/polyglot
🤖️ Cross-platform AI language practice app (跨平台AI语言练习应用)
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
OwO-Network/DeepLX
Powerful Free DeepL API, No Token Required
llmware-ai/llmware
Unified framework for building enterprise RAG pipelines with small, specialized models
harry0703/AudioNotes
快速提取音视频内容,整理成一份结构化的markdown笔记
v3ucn/GPT-SoVITS-V2
GPT-SoVITS-V2模型,合并了官方的一些PR,包含但不限于:参考音频自动填充,字幕同步,SillyTavern酒馆接入等功能
SillyTavern/SillyTavern
LLM Frontend for Power Users.
Eikanya/Live2d-model
Live2d model collection
pudding0503/live2d-package
一个 Live2D 模型端 Web 端整合包与使用教程。
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
abus-aikorea/voice-pro
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube downloading, vocal isolation(UVR5), Text-to-Speech (Edge-TTS), and multi-language translation. Perfect for content creators and developers.
CerebriumAI/examples
Examples for Cerebrium Serverless GPUs
FriendsOfPHP/Goutte
Goutte, a simple PHP Web Scraper