flashinfer.gdn_decode

Gated Delta-Rule decode-side kernels used in Mamba-2 / GDN-style sequence models. These functions consume a pre-built KV / state cache and run the recurrent gated delta-rule update for the current decode step.

gated_delta_rule_decode(q, k, v, state, ...)

Gated Delta Rule Decode kernel (K-major layout, no transpose needed).

gated_delta_rule_decode_pretranspose(q, k, ...)

Gated Delta Rule Decode kernel for single-token generation.

gated_delta_rule_mtp(q, k, v, initial_state, ...)

Gated Delta Rule MTP kernel (Multiple Token Processing).