AndromedaPerseus/PoSE
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
PythonMIT
Watchers
No one’s watching this repository yet.
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
PythonMIT
No one’s watching this repository yet.