Pinned Repositories
adetailer
Auto detecting, masking and inpainting with detection model.
AI-Vtuber
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain(本地/llm)/chatglm/text-generation-webui/闻达/千问/kimi】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
aimoneyhunter
ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English version for more insights.
chineseocr_lite
超轻量级中文ocr,支持竖排文字识别, 支持ncnn推理 , dbnet(1.7M) + crnn(6.3M) + anglenet(1.5M) 总模型仅10M
CloudGamePlatform
云电脑云游戏平台整体解决方案,支持windows、Android、OSX/IOS平台
DocTr-ncnn
ncnn demo of (文档矫正)DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction
fastllm
纯c++实现,无第三方依赖的大模型库,支持CUDA加速,目前支持国产大模型ChatGLM-6B,MOSS; 可以在安卓设备上流畅运行ChatGLM-6B
ncnn-android-styletransfer
The style transfer android example
Pluralistic-Inpainting
CVPR 2019: "Pluralistic Image Completion"
Realtime-Voice-Clone-Chinese
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
hubin858130's Repositories
hubin858130/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
hubin858130/chinese-llm-benchmark
中文大模型能力评测榜单:目前已囊括115个大模型,覆盖chatgpt、gpt4o、百度文心一言、阿里通义千问、讯飞星火、商汤senseChat、minimax等商用模型, 以及百川、qwen2、glm4、yi、书生internLM2、llama3等开源大模型,多维度能力评测。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!
hubin858130/claude-dev
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, and more with your permission every step of the way.
hubin858130/ComfyUI-CogVideoX-MZ
CogVideoX-5B 4-bit quantization model
hubin858130/ComfyUI-Fluxtapoz
Nodes for image juxtaposition for Flux in ComfyUI
hubin858130/comfyui-liveportrait
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control
hubin858130/ComfyUI-Molmo
Generate detailed image descriptions and analysis using Molmo models in ComfyUI.
hubin858130/ComfyUI-UniAnimate-W
图片跳舞
hubin858130/continue
⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains
hubin858130/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
hubin858130/DeepFakeDefenders
Image forgery recognition algorithm
hubin858130/DH_live
每个人都能用的数字人
hubin858130/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
hubin858130/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
hubin858130/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
hubin858130/h5-Dooring
H5 Page Maker, H5 Editor, LowCode. Make H5 as easy as building blocks. | 让H5制作像搭积木一样简单, 轻松搭建H5页面, H5网站, PC端网站,LowCode平台.
hubin858130/In-Context-LoRA
Official repository of In-Context LoRA for Diffusion Transformers
hubin858130/JoyHallo
JoyHallo: Digital human model for Mandarin
hubin858130/MaskGCT-ComfyUI
TTS
hubin858130/melty
Open source AI code editor. To download the packaged app:
hubin858130/MiniMates
The fastest digital human algorithm, now on your desktop.
hubin858130/moshi
语音交互
hubin858130/podlm-public
hubin858130/Rope-Live
Customized fork of Rope Deepfake software featuring live streaming capabilities and support for Deepfacelive models
hubin858130/RTranslator
Open source real-time translation app for Android that runs locally
hubin858130/sd-scripts
lora训练
hubin858130/SDXL_EcomID_ComfyUI
换脸
hubin858130/seed-vc
State-of-the-Art zero-shot voice conversion & singing voice conversion with in context learning
hubin858130/TANGO
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
hubin858130/void
AICoder