flashinfer.gdn_prefill

Gated Delta-Rule prefill-side kernels. chunk_gated_delta_rule is the chunked GDN scan that initializes the recurrent state from a full prompt before the decode loop takes over.

chunk_gated_delta_rule(q, k, v[, g, beta, ...])

Chunked Gated Delta Rule (GDN) attention for prefill.