Seitk's Stars
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Aider-AI/aider
aider is AI pair programming in your terminal
goldbergyoni/javascript-testing-best-practices
📗🌐 🚢 Comprehensive and exhaustive JavaScript & Node.js testing best practices (July 2023)
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
exo-explore/exo
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
fishaudio/fish-speech
SOTA Open Source TTS
KwaiVGI/LivePortrait
Bring portraits to life!
Doriandarko/claude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claude to generate and manage its own tools, continuously expanding its capabilities through conversation. Available both as a CLI and a modern web interface
budtmo/docker-android
Android in docker solution with noVNC supported and video recording
baptisteArno/typebot.io
💬 Typebot is a powerful chatbot builder that you can self-host.
modelscope/agentscope
Start building LLM-empowered multi-agent applications in an easier way.
BuilderIO/ai-shell
A CLI that converts natural language to shell commands.
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Netflix/maestro
Maestro: Netflix’s Workflow Orchestrator
browserbase/stagehand
An AI web browsing framework focused on simplicity and extensibility.
ictnlp/LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
iterative/datachain
ETL, Analytics, Versioning for Unstructured Data
heshengtao/comfyui_LLM_party
LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1,ollama, gemini, grok, qwen, GLM, deepseek, kimi,doubao. Adapted to local llms, vlm, gguf such as llama-3.3, Linkage graphRAG / RAG
kkangert/kspider
Kspider 是一个爬虫平台,以图形化方式定义爬虫流程,无需代码即可实现一个爬虫流程,Kspider不仅限爬虫,也可用于WEB自动化测试,更多功能等你探索。
memfreeme/memfree
MemFree - Hybrid AI Search Engine & AI Page Generator
raznem/parsera
Lightweight library for scraping web-sites with LLMs
nkasmanoff/pi-card
Raspberry Pi Voice Assistant
yeates/PromptFix
[NeurIPS 24] PromptFix: You Prompt and We Fix the Photo
ohayonguy/PMRF
Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration
kkangert/kspider-ui
Kspider 是一个爬虫平台,以图形化方式定义爬虫流程,无需代码即可实现一个爬虫流程,Kspider不仅限爬虫,也可用于WEB自动化测试,更多功能等你探索。
user1342/Tomato
LLM steganography with minimum-entropy coupling - Hiding encrypted messages in natural language.