mmediting: A Python repository from OpenMMLab

OpenMMLab website ^HOT OpenMMLab platform ^{TRY IT OUT}

English | 简体中文

🚀 What's New

New release MMEditing v1.0.0rc7 [07/04/2023]:

Support DiffuserWrapper
Support ControlNet (training and inference).
Support PyTorch 2.0 (successfully compile 33+ models on 'inductor' backend).

MMEditing has supported all the tasks, models, metrics, and losses in MMGeneration and unifies interfaces of all components based on MMEngine 😍.

Please refer to changelog.md for details and release history.

Please refer to migration documents to migrate from old version MMEditing 0.x to our brand new 1.x version.

🔝Back to top

📖 Introduction

MMEditing is an open-source image and video editing&generating toolbox based on PyTorch. It is a part of the OpenMMLab project.

Currently, MMEditing support multiple image and video generation/editing tasks.

mmediting_supported_tasks.mp4

The best practice on our main branch works with Python 3.8+ and PyTorch 1.9+.

✨ Major features

State of the Art

MMEditing provides state-of-the-art generative models to process, edit and synthesize images and videos.
Powerful and Popular Applications

MMEditing supports popular and contemporary image restoration, text-to-image, 3D-aware generation, inpainting, matting, super-resolution and generation applications. Specifically, MMEditing supports GAN interpolation, GAN projection, GAN manipulations and many other popular GAN’s applications. It’s time to play with your GANs!
New Modular Design for Flexible Combination

We decompose the editing framework into different modules and one can easily construct a customized editor framework by combining different modules. Specifically, a new design for complex loss modules is proposed for customizing the links between modules, which can achieve flexible combinations among different modules.(Tutorial for losses)
Efficient Distributed Training

With the support of MMSeparateDistributedDataParallel, distributed training for dynamic architectures can be easily implemented.

🔝Back to top

🙌 Contributing

More and more community contributors are joining us to make our repo better. Some recent projects are contributed by the community including:

GLIDE is contributed by @Taited.
Restormer is contributed by @AlexZou14.
SwinIR is contributed by @Zdafeng.

Projects is opened to make it easier for everyone to add projects to MMEditing.

We appreciate all contributions to improve MMEditing. Please refer to CONTRIBUTING.md in MMCV and CONTRIBUTING.md in MMEngine for more details about the contributing guideline.

🔝Back to top

🛠️ Installation

MMEditing depends on PyTorch, MMEngine and MMCV. Below are quick steps for installation.

Step 1. Install PyTorch following official instructions.

Step 2. Install MMCV with MIM.

pip3 install openmim
# wait for more pre-compiled pkgs to release
mim install 'mmcv>=2.0.0'

Step 3. Install MMEditing from source.

git clone https://github.com/open-mmlab/mmediting.git
cd mmediting
pip3 install -e .

Please refer to installation for more detailed instruction.

Getting Started

Please see quick run and inference for the basic usage of MMEditing.

🔝Back to top

📊 Model Zoo

Supported algorithms

Conditional GANs	Unconditional GANs	Image Restoration	Image Super-Resolution
SNGAN/Projection GAN (ICLR'2018) SAGAN (ICML'2019) BIGGAN/BIGGAN-DEEP (ICLR'2018)	DCGAN (ICLR'2016) WGAN-GP (NeurIPS'2017) LSGAN (ICCV'2017) PGGAN (ArXiv'2017) PGGAN (ICLR'2018) SinGAN (ICCV'2019) StyleGANV1 (CVPR'2019) StyleGANV2 (CVPR'2019) StyleGANV3 (NeurIPS'2021)	SwinIR (ICCVW'2021) NAFNet (ECCV'2022) Restormer (CVPR'2022)	SRCNN (TPAMI'2015) SRResNet&SRGAN (CVPR'2016) EDSR (CVPR'2017) ESRGAN (ECCV'2018) RDN (CVPR'2018) DIC (CVPR'2020) TTSR (CVPR'2020) GLEAN (CVPR'2021) LIIF (CVPR'2021) Real-ESRGAN (ICCVW'2021)
Video Super-Resolution	Video Interpolation	Image Colorization	Image Translation
EDVR (CVPR'2018) TOF (IJCV'2019) TDAN (CVPR'2020) BasicVSR (CVPR'2021) IconVSR (CVPR'2021) BasicVSR++ (CVPR'2022) RealBasicVSR (CVPR'2022)	TOFlow (IJCV'2019) CAIN (AAAI'2020) FLAVR (CVPR'2021)	InstColorization (CVPR'2020)	Pix2Pix (CVPR'2017) CycleGAN (ICCV'2017)
Inpainting	Matting	Text-to-Image	3D-aware Generation
Global&Local (ToG'2017) DeepFillv1 (CVPR'2018) PConv (ECCV'2018) DeepFillv2 (CVPR'2019) AOT-GAN (TVCG'2019)	DIM (CVPR'2017) IndexNet (ICCV'2019) GCA (AAAI'2020)	ControlNet (2023) GLIDE (NeurIPS'2021) Disco-Diffusion (2022) Stable-Diffusion (2022)	EG3D (CVPR'2022)

Please refer to model_zoo for more details.

🔝Back to top

🤝 Acknowledgement

MMEditing is an open source project that is contributed by researchers and engineers from various colleges and companies. We wish that the toolbox and benchmark could serve the growing research community by providing a flexible toolkit to reimplement existing methods and develop their own new methods.

We appreciate all the contributors who implement their methods or add new features, as well as users who give valuable feedbacks. Thank you all!

🔝Back to top

🖊️ Citation

If MMEditing is helpful to your research, please cite it as below.

@misc{mmediting2022,
    title = {{MMEditing}: {OpenMMLab} Image and Video Editing Toolbox},
    author = {{MMEditing Contributors}},
    howpublished = {\url{https://github.com/open-mmlab/mmediting}},
    year = {2022}
}

🔝Back to top

🎫 License

This project is released under the Apache 2.0 license. Please refer to LICENSES for the careful check, if you are using our code for commercial matters.

🔝Back to top

🏗️ ️OpenMMLab Family

MMEngine: OpenMMLab foundational library for training deep learning models.
MMCV: OpenMMLab foundational library for computer vision.
MIM: MIM installs OpenMMLab packages.
MMClassification: OpenMMLab image classification toolbox and benchmark.
MMDetection: OpenMMLab detection toolbox and benchmark.
MMDetection3D: OpenMMLab's next-generation platform for general 3D object detection.
MMRotate: OpenMMLab rotated object detection toolbox and benchmark.
MMSegmentation: OpenMMLab semantic segmentation toolbox and benchmark.
MMOCR: OpenMMLab text detection, recognition, and understanding toolbox.
MMPose: OpenMMLab pose estimation toolbox and benchmark.
MMHuman3D: OpenMMLab 3D human parametric model toolbox and benchmark.
MMSelfSup: OpenMMLab self-supervised learning toolbox and benchmark.
MMRazor: OpenMMLab model compression toolbox and benchmark.
MMFewShot: OpenMMLab fewshot learning toolbox and benchmark.
MMAction2: OpenMMLab's next-generation action understanding toolbox and benchmark.
MMTracking: OpenMMLab video perception toolbox and benchmark.
MMFlow: OpenMMLab optical flow toolbox and benchmark.
MMEditing: OpenMMLab image and video editing toolbox.
MMGeneration: OpenMMLab image and video generative models toolbox.
MMDeploy: OpenMMLab model deployment framework.

🔝Back to top