chuck-ma

chuck-ma's Stars

sayakpaul/diffusers-torchao
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
Language:Python2057
lllyasviel/IC-Light
More relighting!
Language:Python4.9k333
alimama-creative/FLUX-Controlnet-Inpainting
Language:Python23315
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python7.8k722
TIGER-AI-Lab/AnyV2V
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks"
Language:Jupyter Notebook46234
cjh0613/tencent-sensitive-words
腾讯的离线敏感词库
1.1k245
instantX-research/InstantStyle-Plus
InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation 🔥
Language:Python392
yuanyang1991/birefnet_tensorrt
BiRefNet Inference using tensorrt
Language:Python3
ZhengPeng7/BiRefNet
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
Language:Python1.1k81
Fanghua-Yu/SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
Language:Python4.3k375
GuijiAI/ReHiFace-S
Real Time High-Fidelity Faceswap
Language:Python24153
eolinker/apinto
基于golang开发的网关。具有各种插件，可以自行扩展，即插即用。此外，它可以快速帮助企业管理API服务，提高API服务的稳定性和安全性。
Language:Go1.4k201
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python12k1.3k
YaoFANGUK/video-subtitle-remover
基于AI的图片/视频硬字幕去除、文本水印去除，无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API，本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
Language:Python4k526
GuijiAI/duix.ai
Language:C++4.5k647
clash-verge-rev/clash-verge-rev
Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)
Language:TypeScript34.1k2.6k
Kwai-Kolors/Kolors
Kolors Team
Language:Python3.7k237
palxiao/poster-design
一款漂亮且功能强大的在线海报设计器，图片编辑器，仿稿定设计，适用于多种场景：海报生成、电商产品图、文章长图、视频/公众号封面等。A beautiful online image designer, suitable for various scenarios like generate posters, making design easier!
Language:Vue3.6k561
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python5.2k529
BadToBest/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python2.5k300
lovell/sharp
High performance Node.js image processing, the fastest module to resize JPEG, PNG, WebP, AVIF and TIFF images. Uses the libvips library.
Language:JavaScript29k1.3k
fishaudio/fish-speech
Brand new TTS solution
Language:Python12.7k955
andrewyng/translation-agent
Language:Python4.7k533
zh-plus/openlrc
Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT，Claude等)来转录、翻译你的音频为字幕文件。
Language:Python43828
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python31.1k3.4k
Eddycrack864/Ultimate-Vocal-Remover-5.6-for-Google-Colab
Ultimate Vocal Remover for Google Colab
Language:Python3611
MC-E/ReVideo
Language:Python3078
YaoFANGUK/video-subtitle-extractor
视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Language:Python5.8k639
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Language:Python3.3k285
idiap/coqui-ai-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python38233