Set FP16 KV-cache for non-quantized text models #4505
Triggered via pull request
November 29, 2024 12:22
Status
Cancelled
Total duration
2m 44s
Artifacts
–
build_pr_documentation.yml
on: pull_request
build_documentation
2m 29s
Annotations
2 errors
build_documentation
Canceling since a higher priority waiting request for 'Build PR documentation-ak/fp16_cache_for_fp16_models' exists
|
build_documentation
The operation was canceled.
|