flashinfer.activation.gelu_tanh_and_mul¶

flashinfer.activation.gelu_tanh_and_mul(input: Tensor, out: Tensor = None, enable_pdl: bool | None = None) → Tensor¶

Fused GeLU Tanh and Mul operation.

gelu(tanh(input[..., :hidden_size])) * input[..., hidden_size:]

Parameters:

input (torch.Tensor) – Input tensor, shape (…, 2 * hidden_size).
out (Optional[torch.Tensor]) – The output tensor, if specified, the kernel will update this tensor inplace.
enable_pdl (bool) – Whether to enable programmatic dependent launch

Returns:

output – Output tensor, shape (…, hidden_size).

Return type:

torch.Tensor