Overview:

This repository is forked from qingswu/CTPN. CTPN specific layers are integrated into the standard BVLC Caffe (caffe_bvlc) and Intel optimized Caffe (caffe_intel). Performance was only tested on CPU.

Changes to Caffe distributions:

Original implementation of lstm along with its header file are moved into directory "bk_layers". The functionality of lstm is provided by CTPN specific lstm_layer.
CTPN specific layers, transpose and reverse, are copied into layers folder and enabled in both distributions.
For Intel optimized Caffe, concat layer is locked to Caffe engine. Thus concat layer will have no MKL/MKL-DNN support.

How to run demo.py

Build any/both of two caffe distributions

# cd caffe_bvlc [or caffe_intel]
# mkdir release
# cd release
# cmake -D CPU_ONLY=ON -D BLAS=mkl .. <== for BVLC Caffe. Please make sure Intel MKL has been installed.
# cmake ..                            <== for Intel optimized Caffe
# make -j$(nproc)
# make install                        <== this directory "install" ({INSTALL}) will be used in the following step

Build supporting package for the demo program

2.1 Modify the Makefile in {CTPN_ROOT} and add header file search directory where contains numpy/arrayobject.h. Change "-I/usr/local/lib/python2.7/dist-packages/numpy/core/include" to the path in your environment.

2.2 Run make to build supporting package.
Run demo program

# wget http://textdet.com/downloads/ctpn_trained_model.caffemodel -P models
# export PYTHONPATH={INSTALL}/python
# python tools/demo.py --no-gpu

Troubleshooting

On some environments, running the demo program may invoke an error indicating libmpi.so.12 is missing. In this case, please set environment variable LD_LIBRARY_PATH to {CTPN_ROOT}/caffe_intel/external/mlsl/l_mlsl_/intel64/lib.

export LD_LIBRARY_PATH={CTPN_ROOT}/caffe_intel/external/mlsl/l_mlsl_<version>/intel64/lib

The following contents are kept from original repository

CUDA 8.0 compatible version

1 . Updated caffe to current version, keeping the files that the official version doesn't have.

2 . Small fix in CTPN code to adapt to the new caffe.

git clone --recursive https://github.com/qingswu/CTPN.git
...compile caffe following official steps
# and goto root folder, compile cython code
make
# download model
wget http://textdet.com/downloads/ctpn_trained_model.caffemodel -P models/
# run the demo
./tools/demo.py

Detecting Text in Natural Image with Connectionist Text Proposal Network

The codes are used for implementing CTPN for scene text detection, described in:

Z. Tian, W. Huang, T. He, P. He and Y. Qiao: Detecting Text in Natural Image with
Connectionist Text Proposal Network, ECCV, 2016.

Online demo is available at: textdet.com

These demo codes (with our trained model) are for text-line detection (without side-refiement part).

Required hardware

You need a GPU. If you use CUDNN, about 1.5GB free memory is required. If you don't use CUDNN, you will need about 5GB free memory, and the testing time will slightly increase. Therefore, we strongly recommend to use CUDNN.

It's also possible to run the program on CPU only, but it's extremely slow due to the non-optimal CPU implementation.

Required softwares

Python2.7, cython and all what Caffe depends on.

How to run this code

Clone this repository with git clone https://github.com/tianzhi0549/CTPN.git. It will checkout the codes of CTPN and Caffe we ship.
Install the caffe we ship with codes bellow.
- Install caffe's dependencies. You can follow this tutorial. Note: we need Python support. The CUDA version we need is 7.0.
- Enter the directory caffe.
- Run cp Makefile.config.example Makefile.config.
- Open Makefile.config and set WITH_PYTHON_LAYER := 1. If you want to use CUDNN, please also set CUDNN := 1. Uncomment the CPU_ONLY :=1 if you want to compile it without GPU.
  
  Note: To use CUDNN, you need to download CUDNN from NVIDIA's official website, and install it in advance. The CUDNN version we use is 3.0.
- Run make -j && make pycaffe.
After Caffe is set up, you need to download a trained model (about 78M) from Google Drive or our website, and then populate it into directory models. The model's name should be ctpn_trained_model.caffemodel.
Now, be sure you are in the root directory of the codes. Run make to compile some cython files.
Run python tools/demo.py for a demo. Or python tools/demo.py --no-gpu to run it under CPU mode.

License

The codes are released under the MIT License.

jingxu10/CTPN