/Tri-RMSNorm

Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.

Primary LanguagePythonApache License 2.0Apache-2.0

Watchers

No one’s watching this repository yet.