stefan-it
Researcher, M.Sc Computational Linguistics, Former student @ The Center for Information and Language Processing (CIS), LMU Munich
Near Munich, Germany
Pinned Repositories
capsnet-nlp
CapsNet for NLP
europeana-bert
BERT and ELECTRA models trained on Europeana Newspapers
fine-tuned-berts-seq
Fine-tuned Transformers compatible BERT models for Sequence Tagging
flair-experiments
Experiments with Zalando's flair library
gc4lm
GC4LM: A Colossal (Biased) language model for German
hmByT5
Upcoming Historical Multilingual and Monolingual ByT5 Models
nmt-en-vi
Neural Machine Translation system for English to Vietnamese (IWSLT'15 English-Vietnamese data)
turkish-bert
Turkish BERT/DistilBERT, ELECTRA and ConvBERT models
ukrainian-electra
Ukrainian ELECTRA model
xlm-v-experiments
Experiments for XLM-V Transformers Integeration
stefan-it's Repositories
stefan-it/turkish-bert
Turkish BERT/DistilBERT, ELECTRA and ConvBERT models
stefan-it/flair-experiments
Experiments with Zalando's flair library
stefan-it/xlm-v-experiments
Experiments for XLM-V Transformers Integeration
stefan-it/ukrainian-electra
Ukrainian ELECTRA model
stefan-it/delpher-lm
Language Model for Historic Dutch (Delpher Corpus)
stefan-it/hmBench
hmBench: Fine-Tuning, Evaluating & Benchmarking of Historic Language Models on NER Datasets
stefan-it/hmByT5
Upcoming Historical Multilingual and Monolingual ByT5 Models
stefan-it/germeval-ner-t5
Evaluating German T5 Models on GermEval 2014 (NER)
stefan-it/hmTEAMS
Historical Multilingual TEAMS Models
stefan-it/blbooks-lms
Pretrained Language Models on British Library Corpus
stefan-it/hetzner-gpu-server
My cheatsheet for Hetzner GPU Server Setup
stefan-it/stefan-it
stefan-it/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
stefan-it/adapters
A Unified Library for Parameter-Efficient and Modular Transfer Learning
stefan-it/awesome-huggingface
🤗 A list of wonderful open-source projects & applications integrated with Hugging Face libraries.
stefan-it/tecb-de
German Text Embedding Clustering Benchmark
stefan-it/api-inference-community
stefan-it/ASP
PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxiv.org/pdf/2210.14698.pdf
stefan-it/autotrain-advanced
🤗 AutoTrain Advanced
stefan-it/autotrain-flair-mobie
Example Repository for using Auto Train with Flair Library on MobIE NER Dataset
stefan-it/binder
stefan-it/bpemb
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
stefan-it/charmen-electra
stefan-it/CleanCoNLL
The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.
stefan-it/co-funer
Experiments on CO-Fun NER Dataset
stefan-it/georgian-ner
Resources about Named Entity Recognition for Georgian
stefan-it/hub-docs
Frontend components, documentation and information hosted on the Hugging Face website.
stefan-it/oxLSTM
Resources about open source xLSTM implementations
stefan-it/tanl
Structured Prediction as Translation between Augmented Natural Languages
stefan-it/TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo