wietsedv
PhD student at the University of Groningen, working on larger language models for smaller languages
University of Groningen
Pinned Repositories
acl-anthology
Data and software for building the ACL Anthology.
bertje
BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s layers? A closer look at the NLP pipeline in monolingual and multilingual models"
CEPEND
Contingent Event Pair Extractor for Natural Disasters
common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
docker-calibre-server
Automatically updating slim Debian image with the latest Calibre Server
dtw-numba
A faster Python DTW library with no precompiled code
dumb
A Benchmark for Smart Evaluation of Dutch Models (EMNLP 2023)
gpt2-recycle
As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)
low-resource-adapt
Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High" (ACL Findings 2021)
xpos
Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)
wietsedv's Repositories
wietsedv/bertje
BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s layers? A closer look at the NLP pipeline in monolingual and multilingual models"
wietsedv/gpt2-recycle
As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)
wietsedv/xpos
Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)
wietsedv/docker-calibre-server
Automatically updating slim Debian image with the latest Calibre Server
wietsedv/dumb
A Benchmark for Smart Evaluation of Dutch Models (EMNLP 2023)
wietsedv/low-resource-adapt
Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High" (ACL Findings 2021)
wietsedv/dtw-numba
A faster Python DTW library with no precompiled code
wietsedv/acl-anthology
Data and software for building the ACL Anthology.
wietsedv/CEPEND
Contingent Event Pair Extractor for Natural Disasters
wietsedv/common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
wietsedv/conda-forge-pinning-feedstock
A conda-smithy repository for conda-forge-pinning.
wietsedv/dumbench
wietsedv/espnet
End-to-End Speech Processing Toolkit
wietsedv/espnet-feedstock
A conda-smithy repository for espnet.
wietsedv/nixpkgs
Nix Packages collection & NixOS
wietsedv/NLP-NL
wietsedv/pan19-cross-domain-authorship-attribution
wietsedv/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
wietsedv/mojo
The Mojo Programming Language
wietsedv/staged-recipes
A place to submit conda recipes before they become fully fledged conda-forge feedstocks
wietsedv/WP24