/dsv3-lowmem

run deepseek v3 on a single node. Drops unused experts from memory.

Primary LanguagePythonMIT LicenseMIT

No issues in this repository yet.