.. _apiprefill: flashinfer.prefill ================== Attention kernels for prefill & append attention in both single request and batch serving setting. .. currentmodule:: flashinfer.prefill Single Request Prefill/Append Attention --------------------------------------- .. autosummary:: :toctree: ../generated single_prefill_with_kv_cache single_prefill_with_kv_cache_return_lse Batch Prefill/Append Attention ------------------------------ .. autoclass:: BatchPrefillWithPagedKVCacheWrapper :members: :exclude-members: begin_forward, end_forward, forward, forward_return_lse .. automethod:: __init__ .. autoclass:: BatchPrefillWithRaggedKVCacheWrapper :members: :exclude-members: begin_forward, end_forward, forward, forward_return_lse .. automethod:: __init__