dtunai/Tri-RMSNorm
Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.
PythonApache-2.0
No issues in this repository yet.
Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.
PythonApache-2.0
No issues in this repository yet.