Skip to content

Actions: microsoft/onnxruntime-genai

Windows CUDA x64 Build

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,320 workflow runs
2,320 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update onnxruntime-extension
Windows CUDA x64 Build #2377: Pull request #1160 synchronize by skyline75489
December 25, 2024 06:42 28m 18s jialli/ortx
December 25, 2024 06:42 28m 18s
Update onnxruntime-extension
Windows CUDA x64 Build #2376: Pull request #1160 synchronize by skyline75489
December 23, 2024 06:45 22m 16s jialli/ortx
December 23, 2024 06:45 22m 16s
Update onnxruntime-extension
Windows CUDA x64 Build #2375: Pull request #1160 synchronize by skyline75489
December 23, 2024 06:43 2m 7s jialli/ortx
December 23, 2024 06:43 2m 7s
[Model builder] Add option to exclude cache in inputs and outputs
Windows CUDA x64 Build #2374: Pull request #1162 opened by xenova
December 22, 2024 12:09 26m 12s xenova:patch-1
December 22, 2024 12:09 26m 12s
Add Granite to model builder
Windows CUDA x64 Build #2373: Pull request #1153 synchronize by kunal-vaishnavi
December 21, 2024 00:25 33m 6s kvaishnavi/granite
December 21, 2024 00:25 33m 6s
Recompute KV cache for Phi3 when switching from short to long factor
Windows CUDA x64 Build #2372: Pull request #1161 synchronize by ajindal1
December 20, 2024 23:04 27m 26s abjindal/phi3_reset_compute_cache
December 20, 2024 23:04 27m 26s
Update ORT GenAI examples (#1150)
Windows CUDA x64 Build #2371: Commit daefc4f pushed by kunal-vaishnavi
December 20, 2024 21:03 22m 39s main
December 20, 2024 21:03 22m 39s
Address a DML regression caused by the continuous decoding changes
Windows CUDA x64 Build #2370: Pull request #1159 synchronize by baijumeswani
December 20, 2024 20:29 23m 25s baijumeswani/fix-dml
December 20, 2024 20:29 23m 25s
Recompute KV cache for Phi3 when switching from short to long factor
Windows CUDA x64 Build #2369: Pull request #1161 synchronize by ajindal1
December 20, 2024 19:37 19m 39s abjindal/phi3_reset_compute_cache
December 20, 2024 19:37 19m 39s
Add pre-generated prompts option for benchmark
Windows CUDA x64 Build #2367: Pull request #1091 synchronize by omer-demir
December 20, 2024 19:00 24m 35s omer-demir:omerdemir/pre_generated_prompts
December 20, 2024 19:00 24m 35s
Address a DML regression caused by the continuous decoding changes
Windows CUDA x64 Build #2366: Pull request #1159 synchronize by baijumeswani
December 20, 2024 18:42 23m 57s baijumeswani/fix-dml
December 20, 2024 18:42 23m 57s
Add Granite to model builder
Windows CUDA x64 Build #2365: Pull request #1153 synchronize by kunal-vaishnavi
December 20, 2024 09:02 32m 6s kvaishnavi/granite
December 20, 2024 09:02 32m 6s
Update ORT GenAI examples
Windows CUDA x64 Build #2364: Pull request #1150 synchronize by kunal-vaishnavi
December 20, 2024 08:44 21m 42s kvaishnavi/update-examples
December 20, 2024 08:44 21m 42s
Update ORT GenAI examples
Windows CUDA x64 Build #2363: Pull request #1150 synchronize by skyline75489
December 20, 2024 06:34 21m 50s kvaishnavi/update-examples
December 20, 2024 06:34 21m 50s
Update onnxruntime-extension
Windows CUDA x64 Build #2362: Pull request #1160 synchronize by skyline75489
December 20, 2024 06:28 9m 1s jialli/ortx
December 20, 2024 06:28 9m 1s
Update onnxruntime-extension
Windows CUDA x64 Build #2361: Pull request #1160 synchronize by skyline75489
December 20, 2024 05:35 9m 12s jialli/ortx
December 20, 2024 05:35 9m 12s
Update ORT GenAI examples
Windows CUDA x64 Build #2360: Pull request #1150 synchronize by kunal-vaishnavi
December 20, 2024 04:02 22m 49s kvaishnavi/update-examples
December 20, 2024 04:02 22m 49s
Update onnxruntime-extension
Windows CUDA x64 Build #2359: Pull request #1160 opened by skyline75489
December 20, 2024 04:02 8m 55s jialli/ortx
December 20, 2024 04:02 8m 55s
Address a DML regression caused by the continuous decoding changes
Windows CUDA x64 Build #2358: Pull request #1159 opened by baijumeswani
December 19, 2024 22:20 17m 35s baijumeswani/fix-dml
December 19, 2024 22:20 17m 35s
Update ORT GenAI examples
Windows CUDA x64 Build #2357: Pull request #1150 synchronize by kunal-vaishnavi
December 19, 2024 22:19 24m 39s kvaishnavi/update-examples
December 19, 2024 22:19 24m 39s
Update ORT GenAI examples
Windows CUDA x64 Build #2356: Pull request #1150 synchronize by kunal-vaishnavi
December 19, 2024 18:25 24m 42s kvaishnavi/update-examples
December 19, 2024 18:25 24m 42s
Update ORT GenAI examples
Windows CUDA x64 Build #2355: Pull request #1150 synchronize by kunal-vaishnavi
December 19, 2024 18:05 20m 4s kvaishnavi/update-examples
December 19, 2024 18:05 20m 4s
Update ORT GenAI examples
Windows CUDA x64 Build #2354: Pull request #1150 synchronize by kunal-vaishnavi
December 19, 2024 07:20 28m 20s kvaishnavi/update-examples
December 19, 2024 07:20 28m 20s
Support constrained decoding
Windows CUDA x64 Build #2353: Pull request #1038 synchronize by Taka152
December 19, 2024 07:13 35m 48s yingxiong/constrained_decoding
December 19, 2024 07:13 35m 48s