PaddleHub: A Python repository from leishufei

English | 简体中文

Introduction

PaddleHub aims to provide developers with rich, high-quality, and directly usable pre-trained models.
【No need for deep learning background, no data and training process】，you can use AI models quickly and enjoy the dividends of the artificial intelligence era.
Covers 4 major categories of CV, NLP, Audio, and Video, and supports one-click prediction, one-click service deployment and transfer learning
All models are OPEN SOURCE, FREE for download and use in offline scenario.

Recent updates

2020.12.1: Release 2.0-beta-1 version, migrate ERNIE, RoBERTa, BERT to dynamic graph mode. Add text classification fine-tuen task based on large-scale pre-trained models.
2020.11.20: Release 2.0-beta version, fully migrate the dynamic graph programming mode, and upgrade the service deployment Serving capability; add 1 hand key point detection model, 12 image animation models, 3 image editing models, 3 speech synthesis models, syntax Analyzing one, the total number of pre-trained models reaches 【182】.
2020.10.09: Added 4 new OCR multi-language series models, 4 image editing models, and the total number of pre-trained models reached 【162】.
2020.09.27: 6 new text generation models and 1 image segmentation model were added, and the total number of pre-trained models reached 【154】.
2020.08.13: Released v1.8.1, added a segmentation model, and supports EMNLP2019-Sentence-BERT as a text matching task network. The total number of pre-training models reaches 【147】.
2020.07.29: Release v1.8.0, new AI couplets and AI writing poems, jieba word cutting, text data LDA, semantic similarity calculation, new target detection, short video classification model, ultra-lightweight Chinese and English OCR, new pedestrian detection, vehicle Industrial-grade models such as detection and animal recognition support VisualDL visualization training, and the total number of pre-training models reaches 【135】.

Features

【Abundant Pre-trained Models】: 180+ pre-trained models covering the four major categories of CV, NLP, Audio, and Video, all open source downloads, and can be run offline.
【Quick Model Prediction】: Model calls can be realized through a one-line command line or a minimalist Python API to quickly experience the model effect.
【Model As Service】: A one-line command to build deep learning model API service deployment capabilities.
【Ten Lines of Code for Transfer Learning】: Ten lines of code complete the transfer-learning task of image classification and text classification.
【PIP installation 】: Support PIP quick installation and use.
【Cross-platform Compatibility】: Can run on Linux, Windows, MacOS and other operating systems.

Visualization Demo

Text Recognition

Contains ultra-lightweight Chinese and English OCR models, high-precision Chinese and English, multilingual German, French, Japanese, Korean OCR recognition.

Face Detection

Including face detection, mask face detection, multiple algorithms are optional.

Image Editing

4x super resolution effect, multiple super resolution models are optional.
Colorization models can be used to repair old grayscale photos.

SuperResolution	Restoration

Object Detection

Pedestrian detection, vehicle detection, and more industrial-grade ultra-large-scale pretrained models are provided.

Key Point Detection

Supports body, face and hands key point detection for single or multiple person.

Image Segmentation

Contains excellent portrait cutout model, ACE2P human body analysis world champion model.

Image Animation

Contains image style transfer models with Hayao Miyazaki and Makoto Shinkai styles, etc.

Image Classification

Including animal classification, dish classification, wild animal product classification, multiple algorithms are available.

Text Generation

Including AI poem writing, AI couplets, AI love words, AI hidden poems, multiple algorithms are available.

Syntax Analysis

Leading Chinese syntactic analysis model release by Baidu NLP.

Sentiment Analysis

SOTA Chinese sentiment analysis model released by Baidu NLP.

Text Review

Contains the review of Chinese pornographic text, and multiple algorithms are available.

Speech Synthesis

TTS speech synthesis algorithm, multiple algorithms are available
Input: Life was like a box of chocolates, you never know what you're gonna get.
The synthesis effect is as follows:

deepvoice3	fastspeech	transformer

Video Classification

Short video classification trained via large-scale video dataset, supports 3000+ tag types prediction.
Example: Input a short video of swimming, the algorithm can output the result of "swimming"

===Key Points===

All the above pre-trained models are all open source, and the number of models is continuously updated. Welcome Star to pay attention.

Welcome to join PaddleHub technical group

If you have any questions during the use of the model, you can join the official WeChat group to get more efficient questions and answers, and fully communicate with developers from all walks of life. We look forward to your joining.

If you fail to scan the code, please add WeChat 15711058002 and note "Hub", the operating class will invite you to join the group.

Documentation Tutorial

PIP Installation
Quick Start
Rich Pre-trained Models 182
- Boutique Featured Models
- Computer Vision 126
- Natural Language Processing 48
- Audio 3
  - Speech Synthesis 3
- Video 5
  - Video Classification 5
Deploy
Advanced documentation
- Command Line Interface Usage
- How to Load Customized Dataset
Community
License
Contribution

License

The release of this project is certified by the Apache 2.0 license.

Contribution

We welcome you to contribute code to PaddleHub, and thank you for your feedback.

Many thanks to Austendeng for fixing the SequenceLabelReader
Many thanks to cclauss optimizing travis-ci check
Many thanks to 奇想天外，Contributed a demo of mask detection
Many thanks to mhlwsk，Contributed the repair sequence annotation prediction demo
Many thanks to zbp-xxxp，Contributed modules for viewing pictures and writing poems
Many thanks to zbp-xxxp and 七年期限,Jointly contributed to the Mid-Autumn Festival Special Edition Module
Many thanks to livingbody，Contributed models for style transfer based on PaddleHub's capabilities and Mid-Autumn Festival WeChat Mini Program

leishufei/PaddleHub