Lightning-AI/lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

PythonApache-2.0

Pinned issues

Looking for LLaMA 2?

#452 opened 10 months ago by carmocca

Open3

Issues

Where is tokenizer.model? tokenizer path
#488 opened 2 months ago by andreamigliorati
1
数据集与训练方法的相关问题
#489 opened a month ago by feizhuqwq
0
Running into StopIteration with single node multi GPU pretraining against the redpajama sample
#465 opened 8 months ago by cabal-daniel
6
Using llama3 through lit lama
#487 opened 2 months ago by fireyanci
5
How to convert lit-llama pretrained model to HF format?
#486 opened 2 months ago by karkeranikitha
1
why cannot the generate function be used twice
#485 opened 3 months ago by WyGongya
0
Converting from lit-llama to HF checkpoint?
#484 opened 3 months ago by jacqueline-he
0
it seems that hash of traindata is lost, so it's impossible to continue finetune after stop
#483 opened 3 months ago by drazdra
0
`PackedDatasetBuilder` does not separate with `sep_token`
#482 opened 4 months ago by calvintwr
0
OSError: Not found: "checkpoints/lit-llama/tokenizer.model": No such file or directory Error #2
#481 opened 4 months ago by anirudhitagi
4
reset_cache() Decrease the Generation Quality of Consecutive Inferences
#437 opened a year ago by HenryPengZou
0
TPU Training
#480 opened 5 months ago by kathir-ks
0
Mistral Model
#458 opened 9 months ago by PierreColombo
1
RuntimeError: Expected x1.dtype() == cos.dtype() to be true, but got false. (Could this error message be improved? If so, please report an enhancement request to PyTorch.)
#472 opened 7 months ago by JerryDaHeLian
1
Issue with Rotary Embedding Initialization when the number of devices is > 1
#479 opened 6 months ago by diegodoimo
0
Beam search generation
#477 opened 6 months ago by hellowoe23
0
Error: git submodule update --init --recursive -q did not run successfully
#475 opened 6 months ago by seupedro
1
Ban some tokens
#474 opened 7 months ago by AnnaKholkina
0
Can I use Lightning fabirc to pre train llama2 on v100?
#473 opened 7 months ago by JerryDaHeLian
0
RuntimeError: cutlassF: no kernel found to launch!
#471 opened 7 months ago by xvanQ
1
How to quantize LLama in fine-tuning ?
#470 opened 7 months ago by sfarzi
0
How to convert hf weight of 70b to lit-lamma weights?
#469 opened 8 months ago by sfarzi
0
Looking for LLaMA 2?
#452 opened 10 months ago by carmocca
3
Full fine-tuning on Alpaca dataset with 4 L40s GPUs fails 8 hours into the training job with index_copy_
#468 opened 8 months ago by cabal-daniel
2
RuntimeError: probability tensor contains either `inf`, `nan` or element < 0
#467 opened 8 months ago by AI-Zebra
0
Why is LLaMA response to queries in the conversation so wrong?
#466 opened 8 months ago by Harsh-raj
0
How to do conversation with fine tuned model?
#464 opened 8 months ago by Harsh-raj
0
[question] nan loss value and run time error
#463 opened 8 months ago by nevermet
0
[question] assert lora_path.is_file() error
#462 opened 8 months ago by nevermet
0
[question] error message while finetuning
#460 opened 8 months ago by nevermet
2
Question about 'validating...' from lora.py
#461 opened 8 months ago by nevermet
0
Adapter finetuning do not run on two cards (A100 40G)
#459 opened 9 months ago by wasifferoze
0
When I finetuned the model, an error occurred during the decoding process: IndexError: Out of range: piece id is out of range.
#457 opened 9 months ago by HypherX
0
TypeError: super(type, obj): obj must be an instance or subtype of type
#445 opened a year ago by Vinter8848
3
lightning llama
#436 opened 10 months ago by sri9s
1
How can I do to inferece with different promts in Juypter Notebook, just load the model and tokenizer once?
#450 opened 10 months ago by Vinter8848
2
Only add a linear layer to LLaMA without any computation degrade the performance
#451 opened 10 months ago by YUCHEN005
0
(documentation) error on readme.md about (Facebook's) LLama's license
#448 opened 10 months ago by maathieu
1
(documentation) How do I know if generate.py is running on GPU / GPU configuration
#449 opened 10 months ago by maathieu
1
ValueError: Precision 'bf16-true' is invalid
#447 opened 10 months ago by AlphaGoMK
1
multi gpus for full finetune
#446 opened a year ago by qiqiApink
3
inference finetuned model using LoRa in Huggingface format
#442 opened a year ago by LamOne1
7
No response after training an epoch
#443 opened a year ago by Dylandtt
2
This codebase has so many errors it is completely useless and unusable
#438 opened a year ago by Abecid
2
LORA: RuntimeError: GET was unable to find an engine to execute this computation
#441 opened a year ago by LamOne1
1
converting Adapter to huggingface format
#440 opened a year ago by LamOne1
0
Support conversion to huggingface format
#439 opened a year ago by LamOne1
1
Getting AssertionError when saving a FSDP strat trained model with 16-mixed precision
#434 opened a year ago by JimenezBarreroDavid
1
combine adapter weights with the base model
#433 opened a year ago by wxl-lxw
0
Question about FlashAttention and KV-cache
#430 opened a year ago by KnowingNothing
0