Issues
- 4
How to do sequence classification training ?
#169 opened by sparsh35 - 4
Custom dataset preprocessing
#171 opened by ayukh - 3
Nan losses with Gemma 1 DPO training on Kaggle TPU
#170 opened by defdet - 13
TPU v4-32 set-up not working
#166 opened by s-smits - 2
- 6
- 5
oom when llama2-7b sft
#163 opened by kuangdao - 1
- 5
TPU-v3 Kaggle not working after update
#161 opened by s-smits - 9
NaN loss in ORPOTrainer with legacy_sharded_vanilla
#156 opened by nyl199310 - 1
value error using flash attention
#158 opened by heydaari - 9
Falcon-11B: Dict key mismatch; expected keys: ['input_layernorm', 'mlp', 'self_attention']; dict: {'self_attention': {'query_key_value': {'kernel': Array
#154 opened by s-smits - 2
Logging into wandb.ai
#157 opened by heydaari - 6
Out of Memory issue in new easydel version.
#155 opened by nyl199310 - 4
[Feature Request] Add support for tiiuae/falcon-11B
#152 opened by s-smits - 3
checkpoint's size is increasing everytime.
#134 opened by IvoryTower800 - 1
Import Error
#150 opened by heydaari - 3
Mosaic kernels cannot be automatically partitioned. Please wrap the call in a shard_map or xmap
#149 opened by nyl199310 - 5
- 14
Kaggle training examples don't work
#140 opened by jcole75 - 3
Out of memory for serving example
#142 opened by xu3kev - 7
Can't load checkpoints continue training
#148 opened by IvoryTower800 - 6
training does not start using latest easydel
#146 opened by IvoryTower800 - 2
load_in_8bit doesn't work on Kaggle TPU
#143 opened by IvoryTower800 - 4
'LoraWeight' object has no attribute 'tolist'
#145 opened by defdet - 4
Please provide support for LLama3 or provide example on how to serve it using Easydel
#144 opened by jchauhan - 8
How to reduce TPU RAM when finetuning?
#131 opened by IvoryTower800 - 3
- 19
Transformers-like API for inference
#128 opened by Froggy111 - 6
Unable to Load EasyDeL State
#133 opened by w11wo - 2
- 6
a question about how to increase batch size.
#125 opened by IvoryTower800 - 9
- 10
Training with Ring Attention Failed
#120 opened by IvoryTower800 - 1
Docs site is broken https://erfanzar.github.io
#121 opened by nigh8w0lf - 8
Install from git not working
#118 opened by sr5434 - 5
Training in kaggle's TPU is failing
#117 opened by saidineshpola - 5
Error while finetuning Tinyllama on Kaggle TPU
#104 opened by jchauhan - 4
- 6
- 1
- 1
Easydel support on TPU v4.8 - getting exception
#111 opened by jchauhan - 3
Example shown on https://pypi.org/project/EasyDeL/ to finetune tinyllama raise exception on kaggle
#105 opened by jchauhan - 1
- 1
Exception while running any model - einops.EinopsError: Error while processing rearrange-reduction pattern "b (c n) d -> b c n d".
#106 opened by jchauhan - 1
None of the examples scripts works, that used to work earlier. Please test your examples again and update docs
#109 opened by jchauhan - 11
QLoRA Finetune Example
#103 opened by sr5434 - 3
Error while training GPT2 on the kaggle
#98 opened by jchauhan - 3
- 2
ValueError: `params` cannot be accessed from model when the model is created with `_do_init=False`.
#96 opened by jchauhan