llm_code: A Python repository from ailian8025

glm-6b llama-7b bloom-1b 7b gpt2-0.125b 1.5b

import torch
torch.cuda.memory_allocated()/1024/1024
torch.cuda.max_memory_allocated()/1024/1024
torch.cuda.memory_summary()

peft是技巧性微调

PEFT approaches only fine-tune a small number of (extra) model parameters while freezing most parameters of the pretrained LLMs

ailian8025/llm_code