/OCR-In-Django-And-Tesseract

A Django based web application that takes files in Image, PDF and text formats from users and extracts the textual content of these files using OCR (Optical Character Recognition), summarizes the content of files using NLTK library using Page Rank algorithm and Natural Language Processing

Primary LanguageHTML

Stargazers