seasky0524's Stars
datvodinh/rag-chatbot
Chat with multiple PDFs locally
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—foundation models
jianjianai/ms-copilot-play
Cloudflare Worker 的 Microsoft Copilot 加速服务。Microsoft Copilot 是基于 OpenAI GPT-4 的强大 AI 并且能够使用 Bing 搜索来解答问题。简单部署即可在国内高速访问原滋原味的 Microsoft Copilot 的几乎全部功能,聊天,笔记本,插件,图像生成,分享等等..
yakami129/VirtualWife
VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama
WEIFENG2333/AsrTools
✨ AsrTools: 智能语音转文字工具 | 高效批处理 | 用户友好界面 | 无需 GPU |支持 SRT/TXT 输出 | 让您的音频瞬间变成精确文字!
Doriandarko/o1-engineer
o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging the power of OpenAI's API, this tool provides functionalities such as code generation, file editing, and project planning to streamline your development workflow.
3b1b/manim
Animation engine for explanatory math videos
imfile-io/imfile-desktop
A full-featured download manager.
abus-aikorea/voice-pro
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech, and Translation.
CoatiSoftware/Sourcetrail
Sourcetrail - free and open-source interactive source explorer
raphael-baena/DTLR
Handwritten Text Recognition and Character Detection
Relsoul/whisper-win-gui
基于whisper的实时语音识别 网页和桌面客户端
lamm-mit/PDF2Audio
yamadashy/repomix
📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, and Gemini.
punnerud/Local_Knowledge_Graph
roboflow/supervision
We write your reusable computer vision tools. 💜
xszyou/Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
YUANZHUO-BNU/metahuman_overview
数字人资料整理
Kedreamix/PaddleAvatar
你是否曾经幻想过与自己的虚拟人交互?现在,使用PaddleAvatar,您可以将自己的图像、音频和视频转化为一个逼真的数字人视频,与其进行人机交互。 PaddleAvatar是一种基于PaddlePaddle深度学习框架的数字人生成工具,基于Paddle的许多套件,它可以将您的数字图像、音频和视频合成为一个逼真的数字人视频。除此之外,PaddleAvatar还支持进一步的开发,例如使用自然语言处理技术,将数字人视频转化为一个完整的人机交互系统,使得您能够与虚拟的自己进行真实的对话和互动。 使用PaddleAvatar,您可以将数字人视频用于各种场合,例如游戏、教育、虚拟现实等等。PaddleAvatar为您提供了一个自由创作的数字世界,让您的想象力得到了充分的释放!
fishaudio/fish-speech
Brand new TTS solution
1234567Yang/cf-proxy-ex
Cloudflare超级代理,无服务器代理,Duckduckgo代理(可用AI聊天,包含GPT4o/Calude3),OpenAI/ChatGPT代理,Github加速,在线代理。Cloudflare super proxy, setting up a free serverless proxy by using Cloudflare worker.
PowerHouseMan/ComfyUI-AdvancedLivePortrait
starpig1129/AI-Data-Analysis-MultiAgent
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, data analysis, visualization, and report writing. Perfect for researchers and data scientists seeking to enhance their workflow and productivity.
yunwei37/Prompt-Engineering-Guide-zh-CN
🐙 关于提示词工程(prompt)的指南、论文、讲座、笔记本和资源大全(自动持续更新)
QuivrHQ/quivr
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
pontusab/cursor.directory
Find the best cursor rules for your framework and language
Huanshere/VideoLingo
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
baihengaead/wifi-crack-tool
WiFi密码暴力破解工具-图形界面,支持WPA/WPA2/WPA3、多开并发、自动破解、自定义密码本、自动生成密码字典
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image