๐ ๏ธ Built with open-sourced Tesseract OCR Engine (WebAssembly JavaScript port). An offline in-browser setup.
A Fusion of OCR Technology (Tesseract.js) & Web Speech API ๐ฃ๐ฌ. Non-mobile responsive ๐ฑ๐ซ.
- Image-to-Text: Input image file for Text Extraction with Tesseract-OCR
- PDF-to-Image-to-Text: Input PDF File to render each page image for Text Extraction with PDF.js
- Text-to-Speech: Voice generation with Web API SpeechSynthesis
- Export extracted text to local .txt file.
Article One :: Build A Text-To-Speech App Using Client-Side JavaScript
Article Two :: Build An Image & PDF Text Extraction Tool with Tesseract OCR Using Client-side JavaScript
โ Join me on ๐ Medium at ~ ฮพ(๐หถโโกโ) @geek-cc
๐ฎ Please buy me a Taco! ๐
- ยฉ Copyright - This work is licensed under Apache License
- Original library at Tesseract.js v2 and its co-creator/maintainer Jerome Wu