bigscience-workshop/Megatron-DeepSpeed

Are there any other layer norm functions, such as RMSNorm or DeepNorm

lvcc2018 opened this issue · 0 comments

Are there any other layer norm functions, such as RMSNorm or DeepNorm