Suntion3's Stars
langchain-ai/langsmith-cookbook
LLM-Red-Team/doubao-free-api
🚀 豆包大模型逆向API【特长:超强联网搜索】,零配置部署,多路token支持,仅供测试,如需商用请前往官方开放平台。
AIDC-AI/ali-langengine
Alibaba LangEngine is an AI application development framework written in Java.
Shubhamsaboo/awesome-llm-apps
Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.
google-gemini/cookbook
Examples and guides for using the Gemini API
google-gemini/multimodal-live-api-web-console
A react-based starter app for using the Multimodal Live API over websockets with Gemini
MLT-OSS/open-assistant-api
The Open Assistant API is a ready-to-use, open-source, self-hosted agent/gpts orchestration creation framework, supporting customized extensions for LLM, RAG, function call, and tools capabilities. It also supports seamless integration with the openai/langchain sdk.
datastax/astra-assistants-api
Drop in replacement for the OpenAI Assistants API
ahmad2b/postbot3000
PostBot 3000 is an open-source project that shows how to build a powerful AI agent and stream responses and generate artifacts. This project makes it easier for anyone looking to implement similar solutions.
nirbar1985/ai-travel-agent
AI Travel Agent
Henry-23/VideoChat
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
fishaudio/fish-speech
SOTA Open Source TTS
adorosario/openai-realtime-with-customgpt-poc
POC Using OpenAI Realtime API with CustomGPT for RAG And Twilio Voice
ALucek/openai-realtime-rag
Fork of OpenAI's Realtime Console, adapted for RAG
zhayujie/chatgpt-on-wechat
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
Tencent-RTC/trtc-conversation-ai-example
trtc conversation ai exammple
notedit/trtc-ai-api-check
trtc ai api check
openai/openai-realtime-console
React app for inspecting, building and debugging with the Realtime API
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
facebookresearch/spiritlm
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
kyutai-labs/moshi
AgoraIO-Community/Agora-AIGCService-Example
78/xiaozhi-esp32
Build your own AI friend
THUDM/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
openai/openai-realtime-api-beta
Node.js + JavaScript reference client for the Realtime API (beta)
livekit/agents
Build real-time multimodal AI applications 🤖🎙️📹
TEN-framework/TEN-Agent
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatible with popular workflow platforms like Dify and Coze.
dsa/fast-voice-assistant
⚡ Insanely fast AI voice assistant with <500ms response times
datawhalechina/tiny-universe
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe