/text-extraction-MSER

Final goal: Information retrieval of scanned PDF files by keywords but also semantic connections to those keywords. Tools: Extraction using MSER, Recognition using Tesseract and semantization using Word2vec

Primary LanguagePython

This repository is not active