ymzlygw

AIGC Engineer in Japan, major in VC, TTS, NLP..etc.

CTWTokyo

Pinned Repositories

-ControlNet-Poses-pose-depot
A collection of ControlNet poses.
Language:Astro00
-SAMplayground
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
Language:Python0 0 00
3d-photo-inpainting
[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting
Language:Python0 0 00
AI-Paint-Tool-krita-ai-diffusion
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
Language:Python0 0 00
animate-your-word
Official implementations for paper: Dynamic Typography: Bringing Text to Life via Video Diffusion Prior
Language:Python00
awesome-pretrained-stylegan2
A collection of pre-trained StyleGAN 2 models to download
Language:Python1 0 00
espnet
End-to-End Speech Processing Toolkit
Language:Python1 0 00
face_recognition
The world's simplest facial recognition api for Python and the command line
Language:Python1 0 00
stylegan2_justtinpinkney
StyleGAN2 - Official TensorFlow Implementation with practical improvements
Language:Python1 0 00

ymzlygw's Repositories

ymzlygw/face_recognition
The world's simplest facial recognition api for Python and the command line
Language:Python1 0 00
ymzlygw/-ControlNet-Poses-pose-depot
A collection of ControlNet poses.
Language:Astro00
ymzlygw/animate-your-word
Official implementations for paper: Dynamic Typography: Bringing Text to Life via Video Diffusion Prior
Language:Python00
ymzlygw/AnyText
ymzlygw/B-LoRA-for-style-content
Implicit Style-Content Separation using B-LoRA
ymzlygw/CartoonSegmentation
Instance segmentation for cartoon/anime characters and some visual techniques building around it.
ymzlygw/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
ymzlygw/coqui-TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
ymzlygw/Depth-Anything
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
ymzlygw/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
ymzlygw/EfficientSAM
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
ymzlygw/excalidraw-canvas-html
Virtual whiteboard for sketching hand-drawn like diagrams
Language:TypeScript0 0
ymzlygw/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
ymzlygw/IMAGDressing
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing
ymzlygw/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
ymzlygw/InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
ymzlygw/LLaMA-Factory-for-finetune
Unify Efficient Fine-Tuning of 100+ LLMs
ymzlygw/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
ymzlygw/manga-color-BasicPBC
Official Implementation of "Learning Inclusion Matching for Animation Paint Bucket Colorization"
ymzlygw/MMD--SystemAnimatorOnline
XR Animator, AI-based Full Body Motion Capture and Extended Reality (XR) solution, powered by System Animator Online
Language:JavaScript
ymzlygw/OCR-SAM
Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
ymzlygw/ollama-local-startup-llm-model
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
ymzlygw/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
ymzlygw/sketch2manga
Apply screentone to line drawings or colored illustrations with diffusion models.
ymzlygw/stable-diffusion-webui-2024-2-13
Stable Diffusion web UI
Language:Python
ymzlygw/style-aligned
Official code for "Style Aligned Image Generation via Shared Attention"
Language:Python0 0
ymzlygw/ToonCrafter
a research paper for generative cartoon interpolation
Language:Python0 0
ymzlygw/ultimate-upscale-for-automatic1111
ymzlygw/Upscale-all-models-database
An open and free database for AI models
ymzlygw/UpScale-APISR
APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)