flashinfer.norm¶
Kernels for normalization layers.
|
Root mean square normalization. |
|
Root mean square normalization + fp8 quantization. |
|
Fused add root mean square normalization. |
|
Fused add root mean square normalization + fp8 quantization. |
|
Gemma-style root mean square normalization. |
|
Gemma-style fused add root mean square normalization. |
|
Layer normalization. |
|
Fused RMSNorm + SiLU activation. |
|
Fused QK RMSNorm + 3D RoPE + V copy for video generation DIT self-attention. |
Fused residual + LayerNorm + scale/shift for DIT self-attention. |
|
Fused gate + residual + LayerNorm + scale/shift for DIT self-attention. |
|
Fused gate + residual + LayerNorm(gamma, beta) for DIT self-attention. |