Pinned Repositories
Comfyui-Aix-NodeMap
Comfyui's latest node organization and annotation, continuously updated, and supported by the Aix team/comfyui最新节点整理及注释,持续更新,AIX团队
nougat-latex-ocr
Codebase for fine-tuning / evaluating nougat-based image2latex generation models
Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
stable-diffusion-webui
Stable Diffusion web UI
TEN-Agent
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
ComfyUI-Crystools
A powerful set of tools for ComfyUI
ComfyUI_EchoMimic
You can using EchoMimic in ComfyUI
Universalcow's Repositories
Universalcow/Comfyui-Aix-NodeMap
Comfyui's latest node organization and annotation, continuously updated, and supported by the Aix team/comfyui最新节点整理及注释,持续更新,AIX团队
Universalcow/stable-diffusion-webui
Stable Diffusion web UI
Universalcow/video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Universalcow/TEN-Agent
TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.
Universalcow/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Universalcow/nougat-latex-ocr
Codebase for fine-tuning / evaluating nougat-based image2latex generation models
Universalcow/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!