Skip to content

[Kernels]Updated Triton kernels into 2.1.0 and adding flash-decoding for llama token attention #4368

[Kernels]Updated Triton kernels into 2.1.0 and adding flash-decoding for llama token attention

[Kernels]Updated Triton kernels into 2.1.0 and adding flash-decoding for llama token attention #4368

Annotations

1 warning

The logs for this run have expired and are no longer available.