YuchuanTian/RethinkTinyLM
[ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”
Python
Issues
- 1
heads and embedding layers请教
#5 opened by aaronlyt - 0
Training cost
#4 opened by zhuyiche - 2
如何实现难样本挖掘?
#3 opened by LinB203 - 1
Vision based Tiny VLM
#2 opened by abhigoku10