lewtun

LLM Research and Engineering @ Hugging Face

@huggingfaceBern, Switzerland

Pinned Repositories

alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.8k 112 137418
course
The Hugging Face course on Transformers
Language:MDX2.3k 52 159770
diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
Language:Jupyter Notebook3.7k 96 28407
setfit
Efficient few-shot learning with Sentence Transformers
Language:Jupyter Notebook2.3k 22 323225
trl
Train transformer language models with reinforcement learning.
Language:Python10.4k 77 1.3k1.3k
dl4phys
Deep learning for particle physicists
Language:Jupyter Notebook25 3 07
dslectures
Course materials for introductory data science
Language:Jupyter Notebook56 4 512
hepml
Practical machine learning for physicists
Language:Jupyter Notebook15 3 05
notebooks
Jupyter notebooks for the Natural Language Processing with Transformers book
Language:Jupyter Notebook3.9k 63 1011.2k

lewtun's Repositories

lewtun/dslectures
Course materials for introductory data science
Language:Jupyter Notebook56 4 512
lewtun/hepml
Practical machine learning for physicists
Language:Jupyter Notebook15 3 05
lewtun/awesome-rlhf
A curated list of resources dedicated to Reinforcement Learning from Human Feedback (RLHF).
8 4 00
lewtun/chatty-lms
A Hugging Face Space to compare various dialogue-prompted language models
5 1 00
lewtun/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python3 0 0
lewtun/deepcode
Machine learning on source code for the Rocket platform
Language:Jupyter Notebook1 2 0
lewtun/pretraining-with-human-feedback
Code accompanying the paper Pretraining Language Models with Human Preferences
Language:Python1 1 0
lewtun/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python1 0 0
lewtun/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook0 0
lewtun/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Language:Python0 0
lewtun/chatgpt-failures
Failure archive for ChatGPT and similar models
Language:Python0 0
lewtun/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
lewtun/distilabel
⚗️ AI Feedback framework for scalable LLM alignment
Language:Python0 0
lewtun/easyllm
Language:Jupyter Notebook0 01
lewtun/FLAN
Language:Python0 0
lewtun/following-instructions-human-feedback
0 0
lewtun/google-research
Google Research
Language:Jupyter Notebook0 0
lewtun/langchain
⚡ Building applications with LLMs through composability ⚡
Language:Python0 01
lewtun/language-model-agents
Experiments with generating opensource language model assistants
Language:Jupyter Notebook0 0
lewtun/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python0 0
lewtun/MixEval
The official evaluation suite and dynamic data release for MixEval.
Language:Python0 0
lewtun/Open-Assistant
Language:Python0 0
lewtun/orpo
Official repository for ORPO
Language:Shell0 0
lewtun/rlhf-interface
Language:Python0 0
lewtun/self-instruct
Aligning pretrained language models with instruction data generated by themselves.
Language:Python0 0
lewtun/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
Language:Python0 0
lewtun/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python0 0
lewtun/SubjQA
A question-answering dataset with a focus on subjective information
1 0
lewtun/trl
Train transformer language models with reinforcement learning.
Language:Python0 0
lewtun/WebShop
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Language:Python0 0