tannonk
PhD Student in Computational Linguistics at the University of Zurich.
University of ZurichZurich, Switzerland
Pinned Repositories
FS2019-TextMining
Repository for course mini-project
llm_inference
LLM inference with HuggingFace (experimental)
muss
Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".
two-headed-master
Development of ASR for ArchiMob, a spoken corpus of Swiss German.
20Minuten
BLESS
Code for the EMNLP 2023 paper "BLESS: Benchmarking Large Language Models on Sentence Simplification"
multilingual-instruction-tuning
Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"
SimpleFUDGE
Code for the paper "Target-Level Sentence Simplification as Controlled Paraphrasing" (TSAR 2022)
specific_hospo_respo
Code for hospitality review response generation
understanding-ctx-aug
Code for the 2023 ACL Findings paper, Uncovering Hidden Consequences of Pre-training Objectives in Sequence-to-Sequence Models (Kew & Sennrich, 2023)
tannonk's Repositories
tannonk/llm_inference
LLM inference with HuggingFace (experimental)
tannonk/FS2019-TextMining
Repository for course mini-project
tannonk/muss
Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".
tannonk/understanding_control_tokens
tannonk/3b1b_videos
Code for the manim-generated scenes used in 3blue1brown videos
tannonk/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
tannonk/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
tannonk/fudge
tannonk/GeDi
GeDi: Generative Discriminator Guided Sequence Generation
tannonk/GPTScore
Source Code of Paper "GPTScore: Evaluate as You Desire"
tannonk/K2T
tannonk/langchain
⚡ Building applications with LLMs through composability ⚡
tannonk/LENS
tannonk/lm-evaluation-harness-de
A framework for few-shot evaluation of autoregressive language models.
tannonk/mixmatch
Mireshghallah et al., 2022, Mix and Match: Learning-free Controllable Text Generation using Energy Language Models
tannonk/multilingual-modeling
Adapting BLOOM model to support a new unseen language
tannonk/NAR-PMI
Code for Findings of ACL2021 paper: A non-autoregressive edit-based approach to controllable text simplification.
tannonk/prompting_exercise
LLM prompt engineering exercise for ML4NLP2024
tannonk/PyPaperBot
PyPaperBot is a Python tool for downloading scientific papers using Google Scholar, Crossref, and SciHub.
tannonk/resources
PyMC3 educational resources
tannonk/salsa
Success and Failure Linguistic Simplification Annotation 💃
tannonk/textplot
(Mental) maps of texts with kernel density estimation and force-directed networks.
tannonk/transformer-contributions-nmt
tannonk/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
tannonk/trl
Train transformer language models with reinforcement learning.
tannonk/ts-explore
Source code for Text Simplification Evaluation papers at ACL findings and CTTS workshop.
tannonk/TS_annotation_tool
Annotation Tool for Text Simplification Corpora
tannonk/UniEval
Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation
tannonk/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
tannonk/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs