erfanzar/EasyDeL

Accelerate, Optimize performance with streamlined training and serving options with JAX.

PythonApache-2.0

Issues

How to do sequence classification training ?
#169 opened a month ago by sparsh35
4
Custom dataset preprocessing
#171 opened a month ago by ayukh
4
Nan losses with Gemma 1 DPO training on Kaggle TPU
#170 opened 2 months ago by defdet
3
TPU v4-32 set-up not working
#166 opened 3 months ago by s-smits
13
Issue saving and converting the Gemma 2 model after training
#168 opened 3 months ago by sparsh35
2
Import error EasyDeL libraries examples/flash_attention_training_example.py
#165 opened 4 months ago by s-smits
6
oom when llama2-7b sft
#163 opened 5 months ago by kuangdao
5
EasyDeL
#164 opened 5 months ago by kuangdao
1
TPU-v3 Kaggle not working after update
#161 opened 6 months ago by s-smits
5
NaN loss in ORPOTrainer with legacy_sharded_vanilla
#156 opened 6 months ago by nyl199310
9
value error using flash attention
#158 opened 6 months ago by heydaari
1
Falcon-11B: Dict key mismatch; expected keys: ['input_layernorm', 'mlp', 'self_attention']; dict: {'self_attention': {'query_key_value': {'kernel': Array
#154 opened 6 months ago by s-smits
9
Logging into wandb.ai
#157 opened 6 months ago by heydaari
2
Out of Memory issue in new easydel version.
#155 opened 6 months ago by nyl199310
6
[Feature Request] Add support for tiiuae/falcon-11B
#152 opened 6 months ago by s-smits
4
checkpoint's size is increasing everytime.
#134 opened 6 months ago by IvoryTower800
3
Import Error
#150 opened 6 months ago by heydaari
1
Mosaic kernels cannot be automatically partitioned. Please wrap the call in a shard_map or xmap
#149 opened 6 months ago by nyl199310
3
AssertionError: Precision DEFAULT requested together with quantization.
#147 opened 7 months ago by peterniu19
5
Kaggle training examples don't work
#140 opened 7 months ago by jcole75
14
Out of memory for serving example
#142 opened 7 months ago by xu3kev
3
Can't load checkpoints continue training
#148 opened 7 months ago by IvoryTower800
7
training does not start using latest easydel
#146 opened 7 months ago by IvoryTower800
6
load_in_8bit doesn't work on Kaggle TPU
#143 opened 7 months ago by IvoryTower800
2
'LoraWeight' object has no attribute 'tolist'
#145 opened 7 months ago by defdet
4
Please provide support for LLama3 or provide example on how to serve it using Easydel
#144 opened 7 months ago by jchauhan
4
How to reduce TPU RAM when finetuning?
#131 opened 8 months ago by IvoryTower800
8
Attention Mask for Packed Sequences (via Attention Bias)
#129 opened 8 months ago by xingyaoww
3
Transformers-like API for inference
#128 opened 8 months ago by Froggy111
19
Unable to Load EasyDeL State
#133 opened 8 months ago by w11wo
6
Error converting easydel checkpoint to huggingface model.
#132 opened 8 months ago by IvoryTower800
2
a question about how to increase batch size.
#125 opened 8 months ago by IvoryTower800
6
How to continue training from a previous saved easydel checkpoint?
#126 opened 8 months ago by IvoryTower800
9
Training with Ring Attention Failed
#120 opened 8 months ago by IvoryTower800
10
Docs site is broken https://erfanzar.github.io
#121 opened 8 months ago by nigh8w0lf
1
Install from git not working
#118 opened 8 months ago by sr5434
8
Training in kaggle's TPU is failing
#117 opened 9 months ago by saidineshpola
5
Error while finetuning Tinyllama on Kaggle TPU
#104 opened 9 months ago by jchauhan
5
Output Differs from Hugging Face Transformer Result and EasyDel Results
#116 opened 9 months ago by jchauhan
4
[Urgent] Exception while load AdaptLLM/medicine-chat, variant of llama
#114 opened 9 months ago by jchauhan
6
Support HuggingFaceH4/zephyr-7b-beta serving using EasyDel
#110 opened 9 months ago by jchauhan
1
Easydel support on TPU v4.8 - getting exception
#111 opened 9 months ago by jchauhan
1
Example shown on https://pypi.org/project/EasyDeL/ to finetune tinyllama raise exception on kaggle
#105 opened 9 months ago by jchauhan
3
GPT2 (150M model) support on Tv2.8. Example scripts goes out of memory
#108 opened 9 months ago by jchauhan
1
Exception while running any model - einops.EinopsError: Error while processing rearrange-reduction pattern "b (c n) d -> b c n d".
#106 opened 9 months ago by jchauhan
1
None of the examples scripts works, that used to work earlier. Please test your examples again and update docs
#109 opened 9 months ago by jchauhan
1
QLoRA Finetune Example
#103 opened 9 months ago by sr5434
11
Error while training GPT2 on the kaggle
#98 opened 10 months ago by jchauhan
3
Potential regression causing resource exhausted after recent commit
#99 opened 10 months ago by yhavinga
3
ValueError: `params` cannot be accessed from model when the model is created with `_do_init=False`.
#96 opened 10 months ago by jchauhan
2