422339238's Stars
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
richards199999/Thinking-Claude
Let your Claude able to think
ahujasid/blender-mcp
Sanster/IOPaint
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
CopilotKit/open-mcp-client
TencentARC/VideoPainter
Any-length Video Inpainting and Editing with Plug-and-Play Context Control
nanobrowser/nanobrowser
Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.
browser-use/browser-use
Make websites accessible for AI agents
Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
yuruotong1/autoMate
Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves
AgentDeskAI/browser-tools-mcp
Monitor browser logs directly from Cursor and other MCP compatible IDEs.
eastlondoner/cursor-tools
Give Cursor Agent an AI Team and Advanced Skills
refly-ai/refly
🎨 Refly is an open-source AI-native creation engine. Its intuitive free-form canvas interface combines multi-threaded dialogues, artifacts, AI knowledge base integration, chrome extension clip & save, contextual memory, intelligent search, WYSIWYG AI editor and more, empowering you to effortlessly transform ideas into production-ready content.
Plachtaa/seed-vc
zero-shot voice conversion & singing voice conversion, with real-time support
bloc97/Anime4K
A High-Quality Real Time Upscaler for Anime Video
k4yt3x/video2x
A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.
SparkAudio/Spark-TTS
Spark-TTS Inference Code
camel-ai/owl
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
mannaandpoem/OpenManus
No fortress, purely open ground. OpenManus is Coming.
GuijiAI/HeyGem.ai
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
SSShooter/mind-elixir-core
⚗ Mind Elixir is a JavaScript, framework-agnostic mind map core.
penpot/penpot
Penpot: The open-source design tool for design and code collaboration
souzatharsis/podcastfy
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
redotvideo/examples
A collection of example projects built with Revideo
redotvideo/revideo
Create Videos with Code
BandarLabs/gitpodcast
Convert any git repository into an engaging podcast
271374667/VideoFusion
一站式短视频拼接软件 无依赖,点击即用,自动去黑边,自动帧同步,自动调整分辨率,批量变更视频为横屏/竖屏
chrischoy/WhisperChain
Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what you said!
wfql1024/MultiWeChatManager
懒得点?懒得扫码?那就交给它!🛠️ 这是一款专为 微信多开(未来也可以支持其他平台!!) 而设计的 自动化管理工具,支持 多号一键登录、全局多开、自启动登录、防撤回 等功能,是让你省心的好工具!🚀