AindriyaBarua/PDF-mining

Web scrapped to create Indian NER dataset, injected CONLL data with Indian data, fine-tuned BERT, weighted Fasttext for unsupervised KNN to classify reports, extracted data from PDFs

Jupyter Notebook

No issues in this repository yet.