flashinfer.comm.vllm_register_graph_buffers¶
- flashinfer.comm.vllm_register_graph_buffers(fa: int, handles: List[List[int]], offsets: List[List[int]]) None¶
Register graph-capture buffers across the all-reduce world.
- Parameters:
fa (int) – Handle returned by
init_custom_ar().handles (list[list[int]]) – Per-rank IPC handles published via
get_graph_buffer_ipc_meta().offsets (list[list[int]]) – Per-rank offsets matching
handles.