Issues
- 0
- 0
Use the Adam8Bit Optimiser
#23 opened - 0
Use Meet-in-the-Middle Objective
#22 opened - 0
Optimize GPT-2 Model using MosaicML
#21 opened - 0
Look into FlashAttention
#20 opened - 0
Enable CUDA Mixed Precision Training
#12 opened - 0
- 0
- 0
- 0
Upload The Stack Dedup v1.2 to S3
#4 opened - 0
- 0