commonssibi

Pinned Repositories

2013-Summer-Camp
2013 Summer Camp preparation materials
00
argetlam_wiki
This is simple tool to show stats about edit-a-thones of Wikipedia Articles
0 2 00
crabgrass-core
Crabgrass is a software libre web application designed for social networking, group collaboration and network organizing. Our goal is to create communication tools that are tailored specifically to meet the needs of bottom up grassroots organizing.
Language:Ruby00
epapertowordpress
Convertion of a Tamil Epaper to its equivalent wordpress format
Language:Python00
Frequency-Analysis-of-tamil-letters
Frequency Analysis of tamil letters
Language:Gnuplot0 2 01
gdcmdtools
Google drive command-line tools
Language:Python00
Presentations
My presentations in various events
20
PunjabiLexicon-
To scrap Punjabi data from Digital dictionaries of South Asia
Language:Python43
TamilLexicon
To scrap data from Digital dictionaries of South Asia
12
tesseract-ocr
Git mirror of tesseract-ocr SVN/CVS from google code/sourceforge
Language:C++10

commonssibi's Repositories

commonssibi/PunjabiLexicon-
To scrap Punjabi data from Digital dictionaries of South Asia
Language:Python43
commonssibi/Presentations
My presentations in various events
20
commonssibi/TamilLexicon
To scrap data from Digital dictionaries of South Asia
12
commonssibi/tesseract-ocr
Git mirror of tesseract-ocr SVN/CVS from google code/sourceforge
Language:C++10
commonssibi/2013-Summer-Camp
2013 Summer Camp preparation materials
00
commonssibi/argetlam_wiki
This is simple tool to show stats about edit-a-thones of Wikipedia Articles
0 2 00
commonssibi/crabgrass-core
Crabgrass is a software libre web application designed for social networking, group collaboration and network organizing. Our goal is to create communication tools that are tailored specifically to meet the needs of bottom up grassroots organizing.
Language:Ruby00
commonssibi/epapertowordpress
Convertion of a Tamil Epaper to its equivalent wordpress format
Language:Python00
commonssibi/Frequency-Analysis-of-tamil-letters
Frequency Analysis of tamil letters
Language:Gnuplot0 2 01
commonssibi/gdcmdtools
Google drive command-line tools
Language:Python00
commonssibi/Google-OCR-for-Tamil-Testing-
Testing the Google OCR for Tamil with various inputs
2 0
commonssibi/korkai
A corpus builder for Tamil by analyzing wordpress, blogger, wikipedia dumps
Language:Go
commonssibi/OCR4wikisource
OCR for WikiSource using Google Drive OCR
Language:Python
commonssibi/ocropy
Python-based OCR package using recurrent neural networks.
Language:Python
commonssibi/open-tamil
Open Source Tamil Tools
Language:Python
commonssibi/Panchayat
Code used for article creation in wikipedia
Language:JavaScript
commonssibi/pdf2pages
Language:Python
commonssibi/Post-Processing-for-Google-Tamil-OCR
A project aimed at correcting mistakes in a Google Tamil OCR-ed file
commonssibi/PostOCRCorrectionforTamil
The project is aimed to supplement the OCR4Wikisource by devicing a mechanism to perform a post uploading cleansing by finding out the common errors which occur
commonssibi/quassel
Quassel IRC
Language:C++
commonssibi/Scrap-a-news-site
How to scrap static a news website
Language:Python
commonssibi/ScribusCTL
Official Scribus collaboration area for Complex Text Layout (RTL, CJK, Indic, OpenType, Math, CSS, ...)
Language:C++
commonssibi/Tesseract-findings
Tesseract data analysis from google and VietOCR
commonssibi/tesseract-georgian
A set of data files that can be used to train tesseract-ocr to read Georgian script (ქართული ენა)
Language:Python
commonssibi/Tesseract-testings
Training sets and other settings for Tesseract
commonssibi/Tickets-in-Tamil-
commonssibi/wikipedia-extractor
Extracts and cleans text from Wikipedia database dump and stores output in a number of files of similar size in a given directory. This is a mirror of the script by Giuseppe Attardi.
Language:Python