RobinQrtz's Stars
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
susumuota/nano-askllm
Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
allenai/open-instruct
arcee-ai/PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
pytorch/torchtitan
A native PyTorch Library for large model training
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
allenai/bff
allenai/papermage
library supporting NLP and CV research on scientific papers
McGill-NLP/llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
NbAiLab/tokenizer-benchmark
Benchmark for Scandinavian Tokenizers
databricks/megablocks
LumiOpen/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
welfare-state-analytics/riksdagen-corpus
Swedish parliamentary proceedings - Riksdagens protokoll 1867-today
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
huggingface/datablations
Scaling Data-Constrained Language Models
hplt-project/data-analytics-tool
Data Analytics Tool
catie-aq/flashT5
A fast implementation of T5/UL2 in PyTorch using Flash Attention
ltgoslo/ltg-bert
LTG-Bert
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
stas00/ml-engineering
Machine Learning Engineering Open Book
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
arcee-ai/mergekit
Tools for merging pretrained large language models.
fastai/numerical-linear-algebra
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
karpathy/llama2.c
Inference Llama 2 in one file of pure C
ggerganov/llama.cpp
LLM inference in C/C++
karpathy/makemore
An autoregressive character-level language model for making more things
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
DigitalPhonetics/IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages!
google-research/deduplicate-text-datasets