Pinned Repositories
AudioNotes
快速提取音视频内容,整理成一份结构化的markdown笔记
ChatTTS
A generative speech model for daily dialogue.
coolify
An open-source & self-hostable Heroku / Netlify / Vercel alternative.
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
DeepLX
Powerful Free DeepL API, No Token Required
draw-a-ui
Draw a mockup and generate html for it
dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
EasyVtuber
tha3, but run 40fps on 3080 with virtural webcam support
edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
terencewlc's Repositories
terencewlc/DeepLX
Powerful Free DeepL API, No Token Required
terencewlc/FastGPT
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.
terencewlc/FreeAskInternet
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use.
terencewlc/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
terencewlc/GPT-SoVITS-V2
GPT-SoVITS-V2模型,合并了官方的一些PR,包含但不限于:参考音频自动填充,字幕同步,SillyTavern酒馆接入等功能
terencewlc/Hunyuan3D-1
terencewlc/Live2d-model
Live2d model collection
terencewlc/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
terencewlc/llmware
Unified framework for building enterprise RAG pipelines with small, specialized models
terencewlc/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.
terencewlc/melty
Open source AI code editor. To get access to the packaged version:
terencewlc/moondream
tiny vision language model
terencewlc/morphic
An AI-powered search engine with a generative UI
terencewlc/ollama-php
This is a PHP library for Ollama. Ollama is an open-source project that serves as a powerful and user-friendly platform for running LLMs on your local machine. It acts as a bridge between the complexities of LLM technology and the desire for an accessible and customizable AI experience.
terencewlc/OmniParser
A simple screen parsing tool towards pure vision based GUI agent
terencewlc/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
terencewlc/OpenHands
🙌 OpenHands: Code Less, Make More
terencewlc/Ovis
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
terencewlc/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
terencewlc/polyglot
🤖️ Cross-platform AI language practice app (跨平台AI语言练习应用)
terencewlc/self-hosted-ai-starter-kit
The Self-hosted AI Starter Kit is an open-source template that quickly sets up a local AI environment. Curated by n8n, it provides essential tools for creating secure, self-hosted AI workflows.
terencewlc/SillyTavern
LLM Frontend for Power Users.
terencewlc/simple-evals
terencewlc/stable-diffusion-webui
Stable Diffusion web UI
terencewlc/streaming-sensevoice
Pseudo Streaming SenseVoice with Hotwords
terencewlc/teable
✨ The Next Gen Airtable Alternative: No-Code Postgres
terencewlc/twilio-php
A PHP library for communicating with the Twilio REST API and generating TwiML.
terencewlc/Ultralight-Digital-Human
一个超轻量级、可以在移动端实时运行的数字人模型
terencewlc/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
terencewlc/VideoChat
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3 seconds.