BramVanroy
👋 My name is Bram and I work on natural language processing and machine translation (evaluation) but I also spend a lot of time in this open-source world 🌍
@CCL-KULeuven @INLBelgium
Pinned Repositories
astred
An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.
bert-for-inference
A small repo showing how to easily use BERT (or other transformers) for inference
bicorpus-preprocessing
fietje-2
An open, efficient LLM for Dutch
mai-simplification-nl-2023
Sentence-Level Text Simplification for Dutch
mateo-demo
MAchine Translation Evaluation Online (MATEO)
multilingual-text-to-amr
spacy-extreme
An example of how to use spaCy for extremely large files without running into memory issues
spacy_conll
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.
spacy_download
Download and load spaCy models on-the-fly
BramVanroy's Repositories
BramVanroy/spacy_conll
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.
BramVanroy/fietje-2
An open, efficient LLM for Dutch
BramVanroy/mateo-demo
MAchine Translation Evaluation Online (MATEO)
BramVanroy/mai-simplification-nl-2023
Sentence-Level Text Simplification for Dutch
BramVanroy/multilingual-text-to-amr
BramVanroy/dutch-instruction-datasets
BramVanroy/llm-finetuning
BramVanroy/xml_query
BramVanroy/alignment-handbook
Robust recipes to align language models with human and AI preferences
BramVanroy/clin34-benchmarks
BramVanroy/CommonCrawl-CreativeCommons
BramVanroy/evaluate
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
BramVanroy/aclpub2
BramVanroy/bitsandbytes
8-bit CUDA functions for PyTorch
BramVanroy/bramvanroy.github.io
BramVanroy/clarin-spf
A Python package to interact with the CLARIN SPF API to retrieve the 'logged in' cookies necessary to potentially interact with APIs of services that require the authentication.
BramVanroy/clin33-zero-shot
BramVanroy/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
BramVanroy/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
BramVanroy/distilabel
Distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency
BramVanroy/distilabel-helm-instruct-adaptable-evaluation-criteria
A repo that implements Stanford CRFM their HELM Instruct with adaptable evaluation criteria
BramVanroy/llama.cpp
LLM inference in C/C++
BramVanroy/lm-evaluation-harness
BramVanroy/mt-benchmarks
BramVanroy/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
BramVanroy/outlines
Generative Model Programming
BramVanroy/penman
PENMAN notation (e.g. AMR) in Python
BramVanroy/ReinWeb
BramVanroy/sacrebleu
Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons
BramVanroy/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.