terencewlc

Pinned Repositories

AudioNotes
快速提取音视频内容，整理成一份结构化的markdown笔记
Language:Python00
ChatTTS
A generative speech model for daily dialogue.
Language:Python00
coolify
An open-source & self-hostable Heroku / Netlify / Vercel alternative.
Language:PHP00
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python00
crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Language:Python00
DeepLX
Powerful Free DeepL API, No Token Required
Language:Go00
draw-a-ui
Draw a mockup and generate html for it
Language:TypeScript00
dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Language:Python00
EasyVtuber
tha3, but run 40fps on 3080 with virtural webcam support
Language:Python00
edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Language:Python00

terencewlc's Repositories

terencewlc/DeepLX
Powerful Free DeepL API, No Token Required
Language:Go00
terencewlc/FastGPT
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
Language:TypeScript00
terencewlc/FreeAskInternet
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use.
Language:Python
terencewlc/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
terencewlc/GPT-SoVITS-V2
GPT-SoVITS-V2模型，合并了官方的一些PR，包含但不限于:参考音频自动填充，字幕同步，SillyTavern酒馆接入等功能
terencewlc/Hunyuan3D-1
terencewlc/Live2d-model
Live2d model collection
Language:Batchfile
terencewlc/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
terencewlc/llmware
Unified framework for building enterprise RAG pipelines with small, specialized models
terencewlc/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.
terencewlc/melty
Open source AI code editor. To get access to the packaged version:
terencewlc/moondream
tiny vision language model
terencewlc/morphic
An AI-powered search engine with a generative UI
Language:TypeScript
terencewlc/ollama-php
This is a PHP library for Ollama. Ollama is an open-source project that serves as a powerful and user-friendly platform for running LLMs on your local machine. It acts as a bridge between the complexities of LLM technology and the desire for an accessible and customizable AI experience.
terencewlc/OmniParser
A simple screen parsing tool towards pure vision based GUI agent
terencewlc/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
terencewlc/OpenHands
🙌 OpenHands: Code Less, Make More
terencewlc/Ovis
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
terencewlc/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
Language:TypeScript
terencewlc/polyglot
🤖️ Cross-platform AI language practice app （跨平台AI语言练习应用）
terencewlc/self-hosted-ai-starter-kit
The Self-hosted AI Starter Kit is an open-source template that quickly sets up a local AI environment. Curated by n8n, it provides essential tools for creating secure, self-hosted AI workflows.
terencewlc/SillyTavern
LLM Frontend for Power Users.
terencewlc/simple-evals
terencewlc/stable-diffusion-webui
Stable Diffusion web UI
terencewlc/streaming-sensevoice
Pseudo Streaming SenseVoice with Hotwords
terencewlc/teable
✨ The Next Gen Airtable Alternative: No-Code Postgres
terencewlc/twilio-php
A PHP library for communicating with the Twilio REST API and generating TwiML.
terencewlc/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
terencewlc/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。
terencewlc/VideoChat
实时语音交互数字人，支持端到端语音方案（GLM-4-Voice - THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，支持音色克隆，首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3 seconds.