Pinned Repositories
Agent-S
Agent S: an open agentic framework that uses computers like a human
agenta
The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.
AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
Bert-VITS2
vits2 backbone with bert
browser-use
Open-Source Web Automation library with any LLM
Chat2DB
🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
CleanS2S
High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
rvspoc-s2311-llama2
Submission repo for s2311. ref: rvspoc.org
jcyao's Repositories
jcyao/Agent-S
Agent S: an open agentic framework that uses computers like a human
jcyao/agenta
The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.
jcyao/AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
jcyao/browser-use
Open-Source Web Automation library with any LLM
jcyao/Chat2DB
🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
jcyao/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
jcyao/CleanS2S
High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!
jcyao/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
jcyao/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
jcyao/fast-voice-assistant
⚡ Insanely fast AI voice assistant with <500ms response times
jcyao/lobe-chat
🤖 Lobe Chat - an open-source, extensible (Function Calling), high-performance chatbot framework. It supports one-click free deployment of your private ChatGPT/LLM web application.
jcyao/FantasticSql-baseline
一个改来改去的baseline
jcyao/FinRobot
FinRobot: An Open-Source AI Agent Platform for Financial Analysis using LLMs 🚀 🚀 🚀
jcyao/FunAudioLLM-APP
jcyao/I-ViT
[ICCV 2023] I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference
jcyao/keyword-spot
端到端语音唤醒工具箱,从模型训练到模型推理。
jcyao/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
jcyao/MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
jcyao/metahuman-stream
Real time interactive streaming digital human
jcyao/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
jcyao/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
jcyao/nlp-engineering
专注于Python/C++/CUDA、ML/DL/RL和NLP/KG/DS/LLM领域的技术分享。
jcyao/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
jcyao/pipecat
Open Source framework for voice and multimodal conversational AI
jcyao/rasa
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
jcyao/robotframework
Generic automation framework for acceptance testing and RPA
jcyao/rtvi-web-demo
Example UI implementing the RTVI web client
jcyao/swift
Fast voice assistant powered by Groq, Cartesia, and Vercel.
jcyao/vad
Voice activity detector (VAD) for the browser with a simple API
jcyao/wewe-rss
🤗更优雅的微信公众号订阅方式,支持私有化部署、微信公众号RSS生成(基于微信读书)v2.x