Pinned Repositories
01
The open-source language model computer
ADeus
An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it will have all the right context about what you want to talk about - a truly personalized, personal AI.
agent-zero
Agent Zero AI framework
aidea
AIdea 是一款支持 GPT 以及国产大语言模型通义千问、文心一言等,支持 Stable Diffusion 文生图、图生图、 SDXL1.0、超分辨率、图片上色的全能型 APP。
aidea-docker
本项目为 AIdea 项目的一键部署安装包,基于 docker compose。
aidea-server
AIdea 是一款支持 GPT 以及国产大语言模型通义千问、文心一言等,支持 Stable Diffusion 文生图、图生图、 SDXL1.0、超分辨率、图片上色的全能型 APP。
aifs
Local semantic search. Stupidly simple.
alibabacloud-bailian-speech-demo
Sample Repository for the AlibabaCloud Bailian Speech SDK
api4sensevoice
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.
autogen
A programming framework for agentic AI. Join our Discord: https://discord.gg/pAbnFJrkgZ
zhengwayne's Repositories
zhengwayne/agent-zero
Agent Zero AI framework
zhengwayne/alibabacloud-bailian-speech-demo
Sample Repository for the AlibabaCloud Bailian Speech SDK
zhengwayne/api4sensevoice
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.
zhengwayne/awesome-llm-apps
Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.
zhengwayne/BiRefNet
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
zhengwayne/ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
zhengwayne/ComfyUI-segment-anything-2
ComfyUI nodes to use segment-anything-2
zhengwayne/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
zhengwayne/esp32-camera
zhengwayne/ESP32_AI_LLM
本项目使用esp32、esp32s3接入讯飞星火、豆包、chatgpt等大模型,实现语音对话聊天功能,支持语音唤醒、连续对话、音乐播放等功能,同时外接了一块显示屏实时显示对话的内容。
zhengwayne/fairseq2
FAIR Sequence Modeling Toolkit 2
zhengwayne/flux
Official inference repo for FLUX.1 models
zhengwayne/GitHub-Chinese-Top-Charts
:cn: GitHub中文排行榜,各语言分设「软件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。
zhengwayne/GraphRAG-Ollama-UI
GraphRAG using Ollama with Gradio UI and Extra Features
zhengwayne/HokkienTranslation
zhengwayne/IMAGDressing
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing
zhengwayne/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
zhengwayne/maybe
The OS for your personal finances
zhengwayne/MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
zhengwayne/MixTeX
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
zhengwayne/openai-realtime-console
React app for inspecting, building and debugging with the Realtime API
zhengwayne/pipecat
Open Source framework for voice and multimodal conversational AI
zhengwayne/public-apis
A collective list of free APIs
zhengwayne/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
zhengwayne/RealtimeTTS
Converts text to speech in realtime
zhengwayne/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
zhengwayne/StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
zhengwayne/VideoLingo
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
zhengwayne/voiceapi
Streaming ASR and TTS based on FastAPI+ sherpa-onnx
zhengwayne/wiseflow
Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and uploads them to the database.