s0v1x's Stars
firmai/industry-machine-learning
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
AmanSavaria1402/TableNet
TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more and more number of people are sharing their documents as photos taken from smartphones. A lot of these documents contain lots of information in one or more tables. These tables often contain very important information and extracting this information from the image is a task of utmost importance. In modern times, information extraction from these tables is done manually, which requires a lot of effort and time and hence is very inefficient. Therefore, having an end-to-end system that given only the document image, can recognize and localize the tabular region and also recognizing the table structure (columns) and then extract the textual information from the tabular region automatically will be of great help since it will make our work easier and much faster. TableNet is just that. It is an end-to-end deep learning model that can localize the tabular region in a document image, understand the table structure and extract text data from it given only the document image. Earlier state-of-the-art deep learning methods took the two problems, that is, table detection and table structure recognition (recognizing rows and columns in the table) as separate and treated them separately. However, given the interdependence of the two tasks, TableNet considers them as two related sub-problems and solves them using a single neural network. Thus, also making it relatively lightweight and less compute intensive solution.
HuaizhengZhang/Active-Learning-as-a-Service
A scalable & efficient active learning/data selection system for everyone.
zzak00/job-scraper
benouinirachid/patterns-finder
Simple, Fast, Powerful and Easily extensible python package for extracting patterns from text, with over than 60 predefined Regular Expressions.
lavis-nlp/spert
PyTorch code for SpERT: Span-based Entity and Relation Transformer
flairNLP/flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
huggingface/neuralcoref
✨Fast Coreference Resolution in spaCy with Neural Networks
lxucs/coref-hoi
PyTorch implementation of the end-to-end coreference resolution model with different higher-order inference methods.
lxucs/coref-ee
mandarjoshi90/coref
BERT for Coreference Resolution
explosion/confection
:candy: Confection: the sweetest config system for Python
explosion/spacy-course
👩🏫 Advanced NLP with spaCy: A free online course
explosion/sense2vec
🦆 Contextually-keyed word vectors
explosion/projects
🪐 End-to-end NLP workflows from prototype to production
wookayin/gpustat
📊 A simple command-line utility for querying and monitoring GPU status
pre-commit/pre-commit
A framework for managing and maintaining multi-language pre-commit hooks.
daqcri/DeepER
End-to-End Deep Entity Resolution
zhao1701/extending-deep-ER
This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs on benchmark datasets under a variety of conditions and also tests a number of extensions designed to improve DeepER's accuracy.
alvarobartt/investpy
Financial Data Extraction from Investing.com with Python
microsoft/nlp-recipes
Natural Language Processing Best Practices & Examples
cerlymarco/tsmoothie
A python library for time-series smoothing and outlier detection in a vectorized way.
optionalCTF/gsuite-enum
A simple tool for enumerating GSuite email addresses.
TheSpeedX/PROXY-List
Get PROXY List that gets updated everyday
initstring/cloud_enum
Multi-cloud OSINT tool. Enumerate public resources in AWS, Azure, and Google Cloud.
inukshuk/anystyle
Fast citation reference parsing
pwn0sec/PwnXSS
PwnXSS: Vulnerability (XSS) scanner exploit
explosion/prodigy-recipes
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
0xsha/CloudBrute
Awesome cloud enumerator
google-research/multilingual-t5