MILE lab, IISc

Medical Intelligence and Language Engineering (MILE) Lab, Department of Electrical Engineering (EE), Indian Institute of Science (IISc)

Bangalore, India

Pinned Repositories

DegradedWordsKannada
Benchmarking dataset of degraded word images (with character splits) in Kannada along with their associated ground truth Unicode text
Language:Shell2 2 00
Kannada-OCR-test-images-with-ground-truth
This Kannada OCR benchmarking dataset contains 250 images, carefully chosen to have various kinds of recognition challenges. Some of the pages have italics and bold characters. Some of them have Halegannada poems and text; others are letterpress-printed pages, where the vowel modifiers appear as separate symbols and do not touch the consonants they go with. Some pages have interspersed English words; still others have tables with a lot of numeric data. In addition, there are old pages containing either a lot of broken characters or many words with two or more characters merged into a single connected component.
Language:Shell3 2 02
KonkaniDocumentsInKannadaScript
OCR dataset of Konkani documents printed using Kannada script along with groundtruth text
1 3 00
m2repo
0 1 00
MergedSymbolsKannada
Benchmarking dataset of merged symbols in Kannada along with their associated ground truth Unicode text
Language:Shell2 2 00
MILE-OCR-Engine
Language:C++2 2 43
MILE-Transliterator
A browser plugin to Google Chrome, which instantly transliterates a website present in any Indic script to Kannada. This plugin exploits the Unicode block parallelism and also uses a rule-based approach to transliterate web pages to Kannada. This enables a polyglot user to read online documents in other Indic scripts through Kannada script. Currently, it supports transliteration from Tamil, Telugu, Malayalam, Bangla, Gujarati, Odiya, Punjabi, Sanskrit and Hindi pages. The quality of transliteration was scored by 45 users on a scale of 1 to 5 and a mean opinion score of 4.6 has been achieved.
Language:JavaScript0 2 00
ocr-web-app
OCR web-application
Language:TypeScript4 2 280
SanskritPagesUsingKannadaScript
OCR dataset of scanned images of Sanskrit text printed using Kannada script along with groundtruth text
2 2 00
TuluDocuments
OCR dataset of scanned pages of Tulu books along with groundtruth text
2 2 00

MILE lab, IISc's Repositories

MILE-IISc/ocr-web-app
OCR web-application
Language:TypeScript4 2 280
MILE-IISc/Kannada-OCR-test-images-with-ground-truth
This Kannada OCR benchmarking dataset contains 250 images, carefully chosen to have various kinds of recognition challenges. Some of the pages have italics and bold characters. Some of them have Halegannada poems and text; others are letterpress-printed pages, where the vowel modifiers appear as separate symbols and do not touch the consonants they go with. Some pages have interspersed English words; still others have tables with a lot of numeric data. In addition, there are old pages containing either a lot of broken characters or many words with two or more characters merged into a single connected component.
Language:Shell3 2 02
MILE-IISc/DegradedWordsKannada
Benchmarking dataset of degraded word images (with character splits) in Kannada along with their associated ground truth Unicode text
Language:Shell2 2 00
MILE-IISc/MergedSymbolsKannada
Benchmarking dataset of merged symbols in Kannada along with their associated ground truth Unicode text
Language:Shell2 2 00
MILE-IISc/MILE-OCR-Engine
Language:C++2 2 43
MILE-IISc/SanskritPagesUsingKannadaScript
OCR dataset of scanned images of Sanskrit text printed using Kannada script along with groundtruth text
2 2 00
MILE-IISc/TuluDocuments
OCR dataset of scanned pages of Tulu books along with groundtruth text
2 2 00
MILE-IISc/KonkaniDocumentsInKannadaScript
OCR dataset of Konkani documents printed using Kannada script along with groundtruth text
1 3 00
MILE-IISc/m2repo
0 1 00
MILE-IISc/MILE-OCR-Model
Language:Java0 2 00
MILE-IISc/MILE-Transliterator
A browser plugin to Google Chrome, which instantly transliterates a website present in any Indic script to Kannada. This plugin exploits the Unicode block parallelism and also uses a rule-based approach to transliterate web pages to Kannada. This enables a polyglot user to read online documents in other Indic scripts through Kannada script. Currently, it supports transliteration from Tamil, Telugu, Malayalam, Bangla, Gujarati, Odiya, Punjabi, Sanskrit and Hindi pages. The quality of transliteration was scored by 45 users on a scale of 1 to 5 and a mean opinion score of 4.6 has been achieved.
Language:JavaScript0 2 00
MILE-IISc/AndroidScannerDemo
ScanLibrary is an android document scanning library built on top of OpenCV, using the app you will be able to select the exact edges and crop the document accordingly from the selected 4 edges and change the perspective transformation of the cropped image.
Language:C++1 0
MILE-IISc/angular-sample
Language:TypeScript2 0
MILE-IISc/CRNN
Convolutional recurrent neural network for scene text recognition or OCR in Keras
Language:Python1 0
MILE-IISc/MILE-OCR-API
Language:Java2 01
MILE-IISc/wikiclean
A Java Wikipedia markup to plain text converter
Language:Java0 0

MILE lab, IISc

Pinned Repositories

DegradedWordsKannada

Kannada-OCR-test-images-with-ground-truth

KonkaniDocumentsInKannadaScript

m2repo

MergedSymbolsKannada

MILE-OCR-Engine

MILE-Transliterator

ocr-web-app

SanskritPagesUsingKannadaScript

TuluDocuments

MILE lab, IISc's Repositories

MILE-IISc/ocr-web-app

MILE-IISc/Kannada-OCR-test-images-with-ground-truth

MILE-IISc/DegradedWordsKannada

MILE-IISc/MergedSymbolsKannada

MILE-IISc/MILE-OCR-Engine

MILE-IISc/SanskritPagesUsingKannadaScript

MILE-IISc/TuluDocuments

MILE-IISc/KonkaniDocumentsInKannadaScript

MILE-IISc/m2repo

MILE-IISc/MILE-OCR-Model

MILE-IISc/MILE-Transliterator

MILE-IISc/AndroidScannerDemo

MILE-IISc/angular-sample

MILE-IISc/CRNN

MILE-IISc/MILE-OCR-API

MILE-IISc/wikiclean