Skip to content

Accept token_ids in Shortfin LLM Server #1204

Accept token_ids in Shortfin LLM Server

Accept token_ids in Shortfin LLM Server #1204

Triggered via pull request January 24, 2025 14:11
Status Cancelled
Total duration 2m 6s
Artifacts

ci_eval_short.yaml

on: pull_request
Matrix: IREE Perplexity
Fit to window
Zoom out
Zoom in

Annotations

1 error
IREE Perplexity (3.11, llama-mi300x-3)
Canceling since a higher priority waiting request for 'CI - sharktank perplexity short-862' exists