Shivanandroy/simpleT5
simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.
PythonMIT
Issues
- 0
Settings for Multiple GPU Ports usage
#63 opened by Abubakkar13 - 2
AttributeError: 'LightningDataModule' object has no attribute '_has_setup_TrainerFn.FITTING'
#57 opened by tr4wzified - 0
import error
#62 opened by PranshuKala - 1
AttributeError: 'LightningDataModule' object has no attribute '_has_setup_TrainerFn.FITTING'
#61 opened by lucbuijs - 3
How to resume training?
#43 opened by RK-BAKU - 2
how to train in multi-gpus
#31 opened by jiangliqin - 0
How can we evaluate model using some metric?
#60 opened by MuFazil - 0
about labels and decoder_attention_mask
#59 opened by gitfor20 - 4
'SimpleT5' object has no attribute 'device'
#20 opened by athack - 0
- 1
Training model on low RAM GPU
#48 opened by mbledkowski - 0
Data parallelism technique for Training simplet5 model – CUDA out of memory proplem
#56 opened by NashaatRJ - 1
Pytorch two devices error
#55 opened by mystsec - 2
how to generate attention mask seperately?
#52 opened by Liujingxiu23 - 3
How to use flan-t5?
#51 opened by Bachstelze - 4
TPU support
#27 opened by peregilk - 2
test and eval sets the same?
#50 opened by mgh1 - 5
ValueError: text input must of type `str` (single example), `List[str]` (batch or single pretokenized example) or `List[List[str]]` (batch of pretokenized examples).
#10 opened by j0st - 0
`do_sample` does nothing
#49 opened by mgh1 - 5
- 3
Possible to feed target as array
#47 opened by cclegend90 - 2
Push finished model
#44 opened by peregilk - 1
- 1
Model.predict on vector of strings
#29 opened by Nagakiran1 - 0
Does this rep support DeepSpeed?
#41 opened by benjpau - 5
Unicode Charecter training issue
#40 opened by rahat10120141 - 1
ValueError: text input must of type `str` (single example), `List[str]` (batch or single pretokenized example) or `List[List[str]]` (batch of pretokenized examples).
#39 opened by Ushanjay - 0
Saved model name not customizable
#37 opened by ke-lara - 0
- 0
No Model.Save method?
#33 opened by simonhughes22 - 0
Train on specific cuda device
#32 opened by iknoorjobs - 6
Suppress the Output Models
#21 opened by bayismet - 0
- 4
Metrics and logging.
#11 opened by Sripaad - 0
Is Byt5 Supported?
#28 opened by tbetth01 - 9
ValueError: transformers.__spec__ is None
#12 opened by seregadgl20-oss - 0
Azure Deployment
#26 opened by mandulasandeep - 1
model.predict
#25 opened by saloyiana - 0
- 5
simpleT5 for Grammatical Error Correction
#15 opened by pradeepdev-1995 - 4
- 6
TypeError: forward() got an unexpected keyword argument 'cross_attn_head_mask In onnx_predict function
#8 opened by farshadfiruzi - 0
CUDA error or IndexError (on CPU)
#14 opened by djstrong - 2
byT5 with version 0.1.2
#9 opened by kimgerdes - 6
colab error
#7 opened by yzhang-github-pub - 2
Is there any option for fine-tuning mt5 models instead of training from scratch?
#6 opened by farshadfiruzi - 1
Add License
#3 opened by derdanielb - 2
Support ByT5
#1 opened by seregadgl - 1
Is the task string necessary?
#2 opened by nikogamulin