flashinfer.green_ctx

This module provides utilities for creating CUDA green contexts.

Green context creation

split_device_green_ctx(dev, num_groups, ...)

Split the device into multiple green contexts, return the corresponding streams and CUdevResource for each group and the remaining SMs.

split_device_green_ctx_by_sm_count(dev, ...)

Split the device into multiple green contexts, each with a fixed number of SMs, return the corresponding streams and CUdevResource for each group and the remaining SMs.