ArneDefauw

Pinned Repositories

BERT_doc_classification
Document classification with BERT
Language:Python4 2 02
bert_document_classification
architectures and pre-trained models for long document classification.
Language:Python0 0 00
BERT_NER
NER with BERT
Language:Python0 1 10
cache-conda-envs
Speed up your builds by caching Anaconda environments on GitHub Actions
Language:Python0 0 00
CVDD-PyTorch
A PyTorch implementation of Context Vector Data Description (CVDD), a method for Anomaly Detection on text.
Language:Python0 0 00
Demo
Demo repo for tutotial articles on Opensource.com
0 0 00
diffgram
Training Data (Data Labeling, Annotation, Catalog, Workflow) for all Data Types (Image, Video, 3D, Text, Geo, Audio, more) at scale.
Language:Python0 0 00
dkpro-cassis
UIMA CAS processing library written in Python
Language:Python0 0 00
doc_classification_tfidf
Language:Python0 1 00
DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
Language:Python0 0 00

ArneDefauw's Repositories

ArneDefauw/BERT_doc_classification
Document classification with BERT
Language:Python4 2 02
ArneDefauw/BERT_NER
NER with BERT
Language:Python0 1 10
ArneDefauw/cache-conda-envs
Speed up your builds by caching Anaconda environments on GitHub Actions
Language:Python0 0 00
ArneDefauw/Demo
Demo repo for tutotial articles on Opensource.com
0 0 00
ArneDefauw/diffgram
Training Data (Data Labeling, Annotation, Catalog, Workflow) for all Data Types (Image, Video, 3D, Text, Geo, Audio, more) at scale.
Language:Python0 0 00
ArneDefauw/dkpro-cassis
UIMA CAS processing library written in Python
Language:Python0 0 00
ArneDefauw/doc_classification_tfidf
Language:Python0 1 00
ArneDefauw/DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
Language:Python0 0 00
ArneDefauw/fake_news_semantics
Code for the paper "Do Sentence Interactions Matter ? Leveraging Sentence Level Representations for Fake News Classification"
Language:Python0 0
ArneDefauw/FakeNewsCorpusSpanish
The Spanish Fake News Corpus contains a collection of 971 news divided into 491 real news and 480 fake news. The corpus covers news from 9 different topics: Science, Sport, Economy, Education, Entertainment, Politics, Health, Security, and Society
0 0
ArneDefauw/files2rouge
Calculating ROUGE score between two files (line-by-line)
Language:Perl1 0
ArneDefauw/ganbert-pytorch
Enhancing the BERT training with Semi-supervised Generative Adversarial Networks in Pytorch/HuggingFace
Language:Jupyter Notebook0 0
ArneDefauw/ilastik-napari
ilastik plugin for napari
Language:Python
ArneDefauw/Legal-Docs-Large-MLTC
Multi Label Text Classification for Legal documents. Work on mono-lingual and multilingual parallel data
Language:Jupyter Notebook0 0
ArneDefauw/lmtc-eurlex57k
Large-Scale Multi-Label Text Classification on EU Legislation
ArneDefauw/mlm-scoring
Python library & examples for Masked Language Model Scoring (ACL 2020)
Language:Python0 0
ArneDefauw/multi-eurlex
MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer
Language:Python0 0
ArneDefauw/multilingual-fake-news
The code related to the paper
Language:Python0 0
ArneDefauw/Multimodal-Toolkit
Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
Language:Python0 0
ArneDefauw/neural-document-aligner
Document aligner which uses neural technologies to search matches across bilingual documents
Language:Python0 0
ArneDefauw/Nimbus
Language:Python0 0
ArneDefauw/question_generator
An NLP system for generating reading comprehension questions
Language:Python0 0
ArneDefauw/quick-tips
Language:Jupyter Notebook0 0
ArneDefauw/spatialdata
An open and universal framework for processing spatial omics data
Language:Python0 0
ArneDefauw/spatialdata-io
Language:Python
ArneDefauw/TopicalChange
Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.
Language:Python0 0
ArneDefauw/trafilatura
Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments)
Language:Python0 0
ArneDefauw/Voice-Privacy-Challenge-2020
Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf
Language:Shell0 0
ArneDefauw/word2word
Easy-to-use word-to-word translations for 3,564 language pairs.
Language:Python0 0
ArneDefauw/wordfreq
Access a database of word frequencies, in various natural languages.
Language:Python0 0