Skip to content

Set FP16 KV-cache for non-quantized text models #4505

Set FP16 KV-cache for non-quantized text models

Set FP16 KV-cache for non-quantized text models #4505

Triggered via pull request November 29, 2024 12:22
Status Cancelled
Total duration 2m 44s
Artifacts

build_pr_documentation.yml

on: pull_request
build_documentation
2m 29s
build_documentation
Fit to window
Zoom out
Zoom in

Annotations

2 errors
build_documentation
Canceling since a higher priority waiting request for 'Build PR documentation-ak/fp16_cache_for_fp16_models' exists
build_documentation
The operation was canceled.