cslovewl's Stars
guobao2333/DeepLX-Serverless
DeepL Free API for Serverless
snailuncle/autojsDemo
VikParuchuri/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
kijai/ComfyUI-LivePortraitKJ
ComfyUI nodes for LivePortrait
KwaiVGI/LivePortrait
Bring portraits to life!
yinkaisheng/Python-UIAutomation-for-Windows
🐍Python 3 wrapper of Microsoft UIAutomation. Support UIAutomation for MFC, WindowsForm, WPF, Modern UI(Metro UI), Qt, IE, Firefox, Chrome ...
openatx/uiautomator2
Android Uiautomator2 Python Wrapper
lucasg/Dependencies
A rewrite of the old legacy software "depends.exe" in C# for Windows devs to troubleshoot dll load dependencies issues.
IVGSZ/Flash-VStream
This is the official implementation of "Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams"
ZHO-ZHO-ZHO/ComfyUI-AuraSR-ZHO
AuraSR in ComfyUI for img & video
ZHO-ZHO-ZHO/ComfyUI-UltraEdit-ZHO
ComfyUI UltraEdit(Diffusers)
niedev/RTranslator
Open source real-time translation app for Android that runs locally
PeterH0323/Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋
TeamWiseFlow/wiseflow
Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and uploads them to the database.
unilei/image-watermark-tool
一个能够在本地设备上为图片添加水印,并且图片不会被发送到任何服务器,所有操作都在本地浏览器完成的工具。非常适合保护您敏感证件(如身份证、驾照、护照等)
pipecat-ai/pipecat
Open Source framework for voice and multimodal conversational AI
DigitalPhonetics/IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages!
CosmosShadow/gptpdf
Using GPT to parse PDF
vscode-reborn-ai/vscode-reborn-ai
Code with AI in VSCode, but you get to choose the AI.
cdb-boop/ComfyUI-Bringing-Old-Photos-Back-to-Life
Bringing Old Photos Back to Life in ComfyUI.
worm128/AI-YinMei
AI吟美-人工智能主播-Vtuber
DAMO-NLP-SG/WebDesignAgent
WebDesignAgent : Towards Effortless Website Creation
FareedKhan-dev/AI-text-to-video-model-from-scratch
In this blog, we will build a small scale text-to-video model from scratch. We will input a text prompt, and our trained model will generate a video based on that prompt.
fishaudio/fish-speech
Brand new TTS solution
T8RIN/ImageToolbox
🖼️ Image Toolbox is a powerful app for advanced image manipulation. It offers dozens of features, from basic tools like crop and draw to filters, OCR, and a wide range of image processing options
BinNong/meet-libai
李白 :bust_in_silhouette: 作为唐代杰出诗人,其诗歌作品在**文学史上具有重要地位。近年来,随着数字技术和人工智能的快速发展,传统文化普及推广的形式也面临着创新与变革。国内外对于李白诗歌的研究虽已相当深入,但在数字化、智能化普及方面仍存在不足。因此,本项目旨在通过构建李白知识图谱,结合大模型训练出专业的AI智能体,以生成式对话应用的形式,推动李白文化的普及与推广。
CH563/shot-easy-website
Take a screenshot online and compresses images in browser with Webassembly
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
run-llama/llama_deploy
Deploy your agentic worfklows to production
Kwai-Kolors/Kolors
Kolors Team