Skip to content

Set FP16 KV-cache for non-quantized text models #4036

Set FP16 KV-cache for non-quantized text models

Set FP16 KV-cache for non-quantized text models #4036

Triggered via pull request November 29, 2024 12:22
Status Cancelled
Total duration 2m 41s
Artifacts

test_generation.yml

on: pull_request
Fit to window
Zoom out
Zoom in

Annotations

2 errors
build
Canceling since a higher priority waiting request for 'Generation Utils - Test (deprecated)-ak/fp16_cache_for_fp16_models' exists
build
The operation was canceled.