Integration ZeroDDP and ShardedModelv2 from colossal AI
dongsungkim opened this issue · 1 comments
dongsungkim commented
Describe a TODO feature
There are two version of Zero support from ZeroDDP and ShardedModelv2
- check the possibility to merge two into one
- Otherwise, it has a function to choose one of them based on the flag (not shown to users directly)
Assignees
- Dongsung and Hyen
hyeinhyun commented
There is updates about Zero-DP in Colossal AI.
Zero 1 -> Low_level_optimizer
Zero 2 -> Low_level_optimizer (partitioned gradient =True)
Zero 3 -> Sharded optimizer v2, sharded modelV2