This repository is not active
arachid1/text-extraction-MSER
Final goal: Information retrieval of scanned PDF files by keywords but also semantic connections to those keywords. Tools: Extraction using MSER, Recognition using Tesseract and semantization using Word2vec
Python