facebookresearch/MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
PythonNOASSERTION
Issues
- 0
- 11
- 1
Evaluation pipeline
#14 opened by kalyani7195 - 0
Pretraining dataset
#13 opened by kalyani7195 - 0
Function calling
#12 opened by darkzbaron - 0
Explain data preparation strategies
#10 opened by Atharva-Phatak - 2
Layer sharing model issues
#9 opened by pdh930105 - 0
Optimal Learning Rate
#8 opened by XinDongol - 2
eval issues?
#2 opened by appvoid - 1
- 1
Training data?
#3 opened by jacqueline-he