Issues
- 5
Support for "Deterministic Pipelines"
#243 opened by ruomingp - 0
What is a number?
#734 opened by justzh - 3
unable to train mt5 from t5x using mixtures ValueError: Dataset is missing an expected feature during input_validation validation: 'inputs'
#310 opened by StephennFernandes - 3
ValueError: mutable default <class 'seqio.vocabularies.PassThroughVocabulary'> for field vocabulary is not allowed: use default_factory
#562 opened by jli262 - 0
Unimax sampler implementation?
#565 opened by StephennFernandes - 0
unimax sampling ?
#556 opened by StephennFernandes - 0
- 0
Dataset performance
#547 opened by KeremTurgutlu - 2
- 0
- 2
Concatenating Tasks?
#513 opened by gahdritz - 2
- 2
Please include installation instructions
#261 opened by leiterenato - 1
- 0
- 2
seqio 0.0.13 cannot be installed on Apple Silicon due to transitive tensorflow dependency of clu
#396 opened by tuzhucheng - 2
Different preprocessors for each dataset split
#407 opened by marcospiau - 1
how to decide ideal mixture rates ?
#299 opened by StephennFernandes - 2
FunctionDataSource does not allow function with 3 positional arguments thus shuffling does not work
#307 opened by marton-avrios - 1
Tokenizer is not behaving as expected on special tokens (doesn't recognize `pad` and `eos` tokens)
#311 opened by armancohan - 1
Using a registered task to add another
#333 opened by AkshitaB - 0
import seqio
#291 opened by Arij-Aladel - 9
- 3
HuggingFace Tokenizers compatibility
#188 opened by gabeorlanski - 8
Dataset seeking for restarting from a T5X crashed run using HuggingFace datasets
#224 opened by versae - 0
Add Evaluator example
#223 opened by abisee - 1
Add method to directly add tasks/mixtures.
#215 opened by salayatana66 - 7
Using seqio for T5X Dataset Generation
#182 opened by stefan-it - 3
seqio_cache_tasks fails on DataflowRunner
#109 opened by bzz - 2
Possible ByteVocabulary Bug
#158 opened by xwd - 1