Pinned Repositories
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
CopperMT
[ACL 2021, Findings] Cognate Prediction Per Machine Translation
CustomScLM
EtymDB
[LREC 2020] EtymDB, an Etymological DataBase (v2.1)
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
historical-semantic-change
Code for the L'Change paper "Caveats of Measuring Semantic Change of Cognates and Borrowings using Multilingual Word Embeddings"
lighteval
PLexGen
[LT4HALA 2020] Phonetic lexicon generator and sound change applier
PyExt
Several simple extensions that add some nifty features to Python
lighteval
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
clefourrier's Repositories
clefourrier/EtymDB
[LREC 2020] EtymDB, an Etymological DataBase (v2.1)
clefourrier/CopperMT
[ACL 2021, Findings] Cognate Prediction Per Machine Translation
clefourrier/PLexGen
[LT4HALA 2020] Phonetic lexicon generator and sound change applier
clefourrier/lighteval
clefourrier/CustomScLM
clefourrier/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
clefourrier/historical-semantic-change
Code for the L'Change paper "Caveats of Measuring Semantic Change of Cognates and Borrowings using Multilingual Word Embeddings"
clefourrier/PyExt
Several simple extensions that add some nifty features to Python
clefourrier/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
clefourrier/acl-2020-virtual-conference
Repository for the ACL 2020 virtual conference website (work in progress)
clefourrier/bertviz
Tool for visualizing attention in the Transformer model (BERT, GPT-2, Albert, XLNet, RoBERTa, CTRL, etc.)
clefourrier/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
clefourrier/blog
Public repo for HF blog posts
clefourrier/clefourrier
clefourrier/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
clefourrier/dojo-clean-code
clefourrier/evaluate
🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
clefourrier/GPU-Puzzles
Solve puzzles. Learn CUDA.
clefourrier/helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).
clefourrier/historical_texts
BigScience working group on language models for historical texts
clefourrier/interpretability-tutorial-emnlp2020
Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"
clefourrier/kill-the-newsletter
Convert email newsletters into Atom feeds
clefourrier/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
clefourrier/ml_timeline
clefourrier/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
clefourrier/selected-romance-sound-correspondences
Selected romance sound correspondances for Latin to a couple children.
clefourrier/stations
List of stations and associated metadata
clefourrier/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
clefourrier/twitter-alt-bot
Twitter alt bot
clefourrier/whisper