flashinfer.mamba¶
Mamba / Mamba-2 state-space-model kernels. These wrap the selective scan and state-update primitives used in SSM blocks.
|
Selective state update operation for Mamba layers (the generation phase). |
|
Checkpointing SSU with MTP replay using matmul-based parallel token processing. |