princeton-nlp/DinkyTrain
Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃
PythonMIT
Issues
- 1
- 1
While install dependencies ERROR: Could not find a version that satisfies the requirement hydra-core<1.1,>=1.0.7 (from fairseq) (from versions: none) ERROR: No matching distribution found for hydra-core<1.1,>=1.0.7
#9 opened by henrywang0314 - 2
- 3
- 2
fairseq-train: error: argument --arch/-a: invalid choice: 'deepspeed_roberta_large'
#7 opened by leoozy - 0
Converting fairseq models to huggingface fails for models trained using DeepSpeed
#5 opened by carlosejimenez - 4
- 4
- 2