rmsnorm

There are 6 repositories under rmsnorm topic.

  • DefTruth/CUDA-Learn-Notes

    🎉CUDA/C++ 笔记 / 技术博客: fp32、fp16/bf16、fp8/int8、flash_attn、sgemm、sgemv、warp/block reduce、dot prod、elementwise、softmax、layernorm、rmsnorm、hist etc.

    Language:Cuda1.1k115109
  • bzhangGo/rmsnorm

    Root Mean Square Layer Normalization

    Language:Python2024111
  • knotgrass/Griffin

    Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

    Language:Python8211
  • dtunai/Tri-RMSNorm

    Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.

    Language:Python5102
  • rmgogogo/nano-aigc

    Generative models nano version for fun. No STOA here, nano first.

    Language:Jupyter Notebook0200
  • sushantkumar23/nano-gpt

    Simple character level Transformer

    Language:Jupyter Notebook0100