PonteIneptique's Stars
sicara/easy-few-shot-learning
Ready-to-use code and tutorial notebooks to boost your way into few-shot learning for image classification.
facebookresearch/fairseq2
FAIR Sequence Modeling Toolkit 2
tesseract-ocr/tesstrain
Train Tesseract LSTM with make
sbrunner/deskew
Library used to deskew a scanned document
uvipen/Hierarchical-attention-networks-pytorch
Hierarchical Attention Networks for document classification
yinboc/prototypical-network-pytorch
A re-implementation of "Prototypical Networks for Few-shot Learning"
slaysd/pytorch-sentiment-analysis-classification
A PyTorch Tutorials of Sentiment Analysis Classification (RNN, LSTM, Bi-LSTM, LSTM+Attention, CNN)
ExtractTable/ExtractTable-py
Python library to extract tabular data from images and scanned PDFs
ankanbhunia/Handwriting-Transformers
Handwriting-Transformers (ICCV21)
morningmoni/HiLAP
Code for paper "Hierarchical Text Classification with Reinforced Label Assignment" EMNLP 2019
charlesdedampierre/BunkaTopics
🗺️ Data Cleaning and Textual Data Visualization 🗺️
william-sy/sonoff-zbdongle-p-fw-mac-Lin
JacobTyo/Valla
NathanGodey/headless-lm
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https://arxiv.org/abs/2309.08351)
pharos-alexandria/ocr-greek_cursive
Training files for Greek cursive script (in early print)
OpenITI/acdc_train
Automatic Collation for Diversifying Corpora
michmech/tei-dictionary.xsl
An XSLT stylesheet for TEI-encoded dictionaries
WissamAntoun/SlurmTUI
Terminal UI for monitoring SLURM jobs
bnagy/ruzicka
Freymat/from_eScriptorium_to_Passim_and_back
Pipeline for ground truth creation to train text recognition models. Extracts OCR results from eScriptorium, prepare them for alignment with passim and import the valid alignments back to eScriptorium.
CTDave001/pytorch-handwriting-synthesis
About Handwriting generation and handwriting synthesis as described in Alex Graves's paper https://arxiv.org/abs/1308.0850. Pytorch implementation.
gabays/grobid
Automatic XML TEI encoding of catalogues using GROBID technologies
hellrich/hyperwords
Modified version of Omar Levy's hyperwords word embedding tool, allows for weighted downsampling as well as resource friendly training of models.
ThoraHagen/Encyc-Transformation
XSLT files transforming the custom markup of historical German encyclopedias to (slightly modified) TEI Lex-0 markup.
apertium/apertium-oci-fra
Apertium translation pair for Occitan and French
Badar-e-Alam/Content-and-Style-Aware-Generation-of-Text-Line-Images-for-Handwriting-Recognition
mromanello/DTS-validator
DTS validator is a suite of tests to validate implementations of the DTS API.
PieterBeullens/translators-touch
liao961120/PPMI
Construct Word Embeddings with PPMI & SVD
Mejans/data-occitan