Pinned Repositories
Aadhaar-Card-OCR
Extract text information from Aadhaar Card using tesseract-ocr :sunglasses:
AndroidPassportReader
Android application to read passports with MRZ
AntSynDistinction
Integrating Distributional Lexical Contrast into Word Embeddings for Antonym-Synonym Distinction
AspectExtraction
Aspect Extraction
AstroKundli
BiFlaG
Codes for the paper Bipartite Flat-Graph Network for Nested Named Entity Recognition
C-OCR
C-OCR是携程自研的OCR项目,主要包括身份证、护照、火车票、签证等旅游相关证件、材料的识别。 项目包含4个部分,拒识、检测、识别、后处理。
clinvoc
Tools for working with clinical vocabularies (such as ICD 9, ICD 10, HCPCS, etc.)
DocumentScan
((Work in Progress)) Document/ID scanner capable of performing template alignment and OCR on driver's licenses with a defined template using OpenCV and Pytesseract
idhruvc's Repositories
idhruvc/DocumentScan
((Work in Progress)) Document/ID scanner capable of performing template alignment and OCR on driver's licenses with a defined template using OpenCV and Pytesseract
idhruvc/Aadhaar-Card-OCR
Extract text information from Aadhaar Card using tesseract-ocr :sunglasses:
idhruvc/AspectExtraction
Aspect Extraction
idhruvc/AstroKundli
idhruvc/BiFlaG
Codes for the paper Bipartite Flat-Graph Network for Nested Named Entity Recognition
idhruvc/daniel_fintoc2019
Our participation to the "Financial Document Structure Extraction" task 2019
idhruvc/Dependency-Based-Sentence-Embeddings
Novel model for sentence embedding based on dependency parse-trees
idhruvc/GA_Project_5_Capstone_Multiclass_Legal_Text_Classification_BERT
GA Project 5 (Capstone Project): Using Neural Networks (BERT) with Legal NLP for Contract Clause Classification in real-life clauses
idhruvc/GFTE
A GCN-based table structure recognition method
idhruvc/grove
A Software as a Service (SaaS) log collection framework.
idhruvc/huey
A UI for DuckDB
idhruvc/image-super-resolution
🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.
idhruvc/image-table-ocr
Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.
idhruvc/language-models-are-knowledge-graphs-pytorch
Language models are open knowledge graphs ( non official implementation )
idhruvc/Log_Analysis
Log analysis project aimed at finding and predicting anomalies in logs
idhruvc/logai
LogAI - An open-source library for log analytics and intelligence
idhruvc/LogAnalyser_dataSet
This repository is to containi all the data sets within our log analyser module.
idhruvc/LogPPT
Log Parsing with Prompt-based Few-shot Learning (ICSE 2023, Technical Track)
idhruvc/LogSummary
A toolkit and datasets for LogSummary
idhruvc/matano
Open source security data lake for threat hunting, detection & response, and cybersecurity analytics at petabyte scale on AWS
idhruvc/Paraphrase-any-question-with-T5-Text-To-Text-Transfer-Transformer-
Paraphrase any question with T5 (Text-To-Text Transfer Transformer) - Pretrained model and training script provided
idhruvc/PdfPigSvmRegionClassifier
Proof of concept of a simple SVM Region Classifier using PdfPig and Accord.Net. The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
idhruvc/PLELog
Implementation of PLELog in ICSE 2021 accepted paper:Semi-supervised Log-based Anomaly Detection via Probabilistic Label Estimation.
idhruvc/pylogsentiment
Sentiment analysis in system logs.
idhruvc/sbd_adjudicatory_dec
idhruvc/spade
idhruvc/table-parser-opencv
Extract tables from images or PDFs and convert them to Excel files
idhruvc/table-structure
Cleaned research code for predicting table structure from an image.
idhruvc/text-to-insight
A GPT-assisted tool that translates natural language to SQL queries, tabular data, and graphs.
idhruvc/Zircolite
A standalone SIGMA-based detection tool for EVTX, Auditd and Sysmon for Linux logs