Pinned Repositories
2013-Summer-Camp
2013 Summer Camp preparation materials
argetlam_wiki
This is simple tool to show stats about edit-a-thones of Wikipedia Articles
crabgrass-core
Crabgrass is a software libre web application designed for social networking, group collaboration and network organizing. Our goal is to create communication tools that are tailored specifically to meet the needs of bottom up grassroots organizing.
epapertowordpress
Convertion of a Tamil Epaper to its equivalent wordpress format
Frequency-Analysis-of-tamil-letters
Frequency Analysis of tamil letters
gdcmdtools
Google drive command-line tools
Presentations
My presentations in various events
PunjabiLexicon-
To scrap Punjabi data from Digital dictionaries of South Asia
TamilLexicon
To scrap data from Digital dictionaries of South Asia
tesseract-ocr
Git mirror of tesseract-ocr SVN/CVS from google code/sourceforge
commonssibi's Repositories
commonssibi/PunjabiLexicon-
To scrap Punjabi data from Digital dictionaries of South Asia
commonssibi/Presentations
My presentations in various events
commonssibi/TamilLexicon
To scrap data from Digital dictionaries of South Asia
commonssibi/tesseract-ocr
Git mirror of tesseract-ocr SVN/CVS from google code/sourceforge
commonssibi/2013-Summer-Camp
2013 Summer Camp preparation materials
commonssibi/argetlam_wiki
This is simple tool to show stats about edit-a-thones of Wikipedia Articles
commonssibi/crabgrass-core
Crabgrass is a software libre web application designed for social networking, group collaboration and network organizing. Our goal is to create communication tools that are tailored specifically to meet the needs of bottom up grassroots organizing.
commonssibi/epapertowordpress
Convertion of a Tamil Epaper to its equivalent wordpress format
commonssibi/Frequency-Analysis-of-tamil-letters
Frequency Analysis of tamil letters
commonssibi/gdcmdtools
Google drive command-line tools
commonssibi/Google-OCR-for-Tamil-Testing-
Testing the Google OCR for Tamil with various inputs
commonssibi/korkai
A corpus builder for Tamil by analyzing wordpress, blogger, wikipedia dumps
commonssibi/OCR4wikisource
OCR for WikiSource using Google Drive OCR
commonssibi/ocropy
Python-based OCR package using recurrent neural networks.
commonssibi/open-tamil
Open Source Tamil Tools
commonssibi/Panchayat
Code used for article creation in wikipedia
commonssibi/pdf2pages
commonssibi/Post-Processing-for-Google-Tamil-OCR
A project aimed at correcting mistakes in a Google Tamil OCR-ed file
commonssibi/PostOCRCorrectionforTamil
The project is aimed to supplement the OCR4Wikisource by devicing a mechanism to perform a post uploading cleansing by finding out the common errors which occur
commonssibi/quassel
Quassel IRC
commonssibi/Scrap-a-news-site
How to scrap static a news website
commonssibi/ScribusCTL
Official Scribus collaboration area for Complex Text Layout (RTL, CJK, Indic, OpenType, Math, CSS, ...)
commonssibi/Tesseract-findings
Tesseract data analysis from google and VietOCR
commonssibi/tesseract-georgian
A set of data files that can be used to train tesseract-ocr to read Georgian script (ქართული ენა)
commonssibi/Tesseract-testings
Training sets and other settings for Tesseract
commonssibi/Tickets-in-Tamil-
commonssibi/wikipedia-extractor
Extracts and cleans text from Wikipedia database dump and stores output in a number of files of similar size in a given directory. This is a mirror of the script by Giuseppe Attardi.