[NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models
Primary LanguagePythonApache License 2.0Apache-2.0