/BAdam

[NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers