odnl: A Python repository from lilhope

Thanks for the amazing feature of mxnet,a GPU with 6GB memory is enough

download pre-trained ImageNet Model VGG or Resnet-101 and put it in /model folder.Or your can use script/get_pretrained_model.sh
download dataset refer refcoco version and upzip it,put it in data folder.
download the mscoco(http://mscoco.org/) dataset,use scrpt/get_coco.sh,after successfully download it,create a symbol link use follow code(which will saves the disk space):

	cd ./data/images/
	ln -s mscoco YourPathtoMSCOCO train2014 image

download the pre-trained Facebook word2vec model
uhhh. so many data for download,I'll write a script for easy usage (:
run make in root folder,this will make some cython functions for RCNN and tookits of mscoco
python train_end2end.py

I list some good and not good result as follow(red rectangle is what the model predicted,gredd rectangle is the ground truth)：

working project

There were some technologies that may improve my model.I've add it to the working project.shown as below:

Use ROIAlign instead of RoIPooling(cpu farward and backward was implemented,I'm working on the GPU code)
encode sentences using CNN,which seems more effcient for short text.
dilation CNN to get image feature.
add a demo,I think a onlie domo is needed.

This implement was 90% base one the mxent-faster-rcnn,thanks to this fast and concise implement.