flashinfer.comm.vllm_register_buffer

flashinfer.comm.vllm_register_buffer(fa: int, fake_ipc_ptrs: List[int]) None

Register a peer’s IPC-shared buffer with the local all-reduce handle.

Parameters:
  • fa (int) – Handle returned by init_custom_ar().

  • fake_ipc_ptrs (list[int]) – Per-rank IPC pointers obtained from each peer.