/llm-parameter-stats

How do parameter statistics change over training in LLMs?

Primary LanguagePythonApache License 2.0Apache-2.0

llm-parameter-stats

How do parameter statistics change over training in LLMs?