JesseWong333/pytorch_deeplearning

Python

Introduction

Universal pytorch train & test framework, which is developed for fast implementation, research and deploy

Getting Started

基础设计实现参照, Thanks for their brilliant works

pytorch cycle-gan
fairseq
mmdetection

TODO LIST

易用性

使用文档及范例
提供更多的常用基础组件的实现，比如efficient Net, resnet FPN 等，常用的loss, 如dice loss, centre loss, arc face等, anchor方法等一系列相关东西

This project is still under development.

The interface is unstable.

Now including:

C^2TD: a fast and robust text detector via centre line regression and centre border probability[.config\c2td.py]
CRNN & GRCNN: only inference
pixel-link: Codes are from https://git.iyunxiao.com/DeepVision/pytorch-ocr-framework
im2latex: 少量没有解耦. Codes are from https://git.iyunxiao.com/DeepVision/pytorch-ocr-framework
Unet segmentation
To be continue: Transformers

更新历史

初步的框架设计，统一的训练、推理抽象。完成1-5点
增加文字识别模型resnet的backbone添加进来，并将beamsearch后处理及文字后处理整合到ctc_decode.py后处理，但是这部分代码一直在变动，后处理的代码应该再考虑怎么更好加进来
单例推理预处理部分，这里是参考了注册机制，每一个模型的预处理注册一个预处理函数，但是感觉这样不太好，很多预处理的代码有重复部分，可能要考虑怎么抽取出来，用参数配置的方式
增加im2latext及pixellink训练代码
visdom子进程启动。可视化显示loss. 等待"支持训练时评估"功能完成后，接入更多的可视化展示
增加OCR通用识别模型，检测，识别dataset
训练预处理可配置pipe_line (常用预处理操作)
增加evaluators模块供训练时评估 (目前支持dataset的, 临时文件夹评估待完成)