lightmatmul

Learning swims in entropy.

@DynamoflUnited Kingdom

Pinned Repositories

axolotl
Go ahead and axolotl questions
Language:Python0 0 00
Data-Generation-with-OpenAI
Generating synthetic data for model training using OpenAI API.
Language:Python0 1 00
flash-attention
Fast and memory-efficient exact attention
Language:Python0 0 00
gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
Language:Python0 0 00
llm.c
LLM training in simple, raw C/CUDA
Language:Cuda0 0 00
LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python0 0 00
LoRA-from-scratch
Low-Rank Adaptation of LLMs implemented using PyTorch
Language:Python0 1 10
minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python0 0 00
RLHF-trainer
Supervised Finetune and Align LLMs using RLHF (RM and PPO).
Language:Python0 2 00
Transformer-from-scratch
Transformer from scratch using Pytorch [https://arxiv.org/pdf/1706.03762]
Language:Python4 2 11

lightmatmul/Transformer-from-scratch
Transformer from scratch using Pytorch [https://arxiv.org/pdf/1706.03762]
Language:Python4 2 11
lightmatmul/axolotl
Go ahead and axolotl questions
Language:Python0 0 00
lightmatmul/Data-Generation-with-OpenAI
Generating synthetic data for model training using OpenAI API.
Language:Python0 1 00
lightmatmul/flash-attention
Fast and memory-efficient exact attention
Language:Python0 0 00
lightmatmul/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
Language:Python0 0 00
lightmatmul/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda0 0 00
lightmatmul/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python0 0 00
lightmatmul/LoRA-from-scratch
Low-Rank Adaptation of LLMs implemented using PyTorch
Language:Python0 1 10
lightmatmul/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python0 0 00
lightmatmul/RLHF-trainer
Supervised Finetune and Align LLMs using RLHF (RM and PPO).
Language:Python0 2 00
lightmatmul/sft-toolkit
Language:Python
lightmatmul/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python0 0