run deepseek v3 on a single node. Drops unused experts from memory.
Primary LanguagePythonMIT LicenseMIT
No issues in this repository yet.