Workflow runs · NVIDIA/TensorRT-LLM

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows

662 workflow runs

qserve is slower then awq int4 for llama2-7b on H100 auto-assign #20: Issue #2509 labeled by hello-11

December 10, 2024 06:48

Failed to install by poetry add Blossom-CI #98: Issue comment #2515 (comment) created by hello-11

December 10, 2024 06:41

int8 slower than bf16 on A100 auto-assign #19: Issue #2553 labeled by nv-guomingz

December 10, 2024 06:32

45s

December 10, 2024 06:32

45s

int8 slower than bf16 on A100 auto-assign #18: Issue #2553 labeled by nv-guomingz

December 10, 2024 06:21

55s

December 10, 2024 06:21

55s

How to extract the hidden_states before output_ids during the inference process. Blossom-CI #97: Issue comment #2499 (comment) created by hello-11

December 10, 2024 06:12

Close inactive issues Close inactive issues #353: Scheduled

December 10, 2024 06:08

16s main

main

December 10, 2024 06:08

16s

lora doesn't work when kv_cache is disabled auto-assign #16: Issue #2543 labeled by nv-guomingz

December 10, 2024 06:01

41s

December 10, 2024 06:01

41s

[bug] Medusa example fails with vicuna 33B auto-assign #15: Issue #2478 labeled by nv-guomingz

December 10, 2024 05:52

42s

December 10, 2024 05:52

42s

[QST] How to get the prefill latency and TPOT resepectly when using C++ runtime auto-assign #14: Issue #2500 labeled by hello-11

December 10, 2024 05:51

[QST] How to get the prefill latency and TPOT resepectly when using C++ runtime Blossom-CI #96: Issue comment #2500 (comment) created by hello-11

December 10, 2024 05:51

Close inactive issues Close inactive issues #352: Scheduled

December 10, 2024 05:06

20s main

main

December 10, 2024 05:06

20s

[Question] Running custom Encoder Decoder model Blossom-CI #95: Issue comment #2491 (comment) created by hello-11

December 10, 2024 04:56

Close inactive issues Close inactive issues #351: Scheduled

December 10, 2024 04:07

15s main

main

December 10, 2024 04:07

15s

int4 not faster than fp16 and fp8 Blossom-CI #94: Issue comment #2487 (comment) created by Tracin

December 10, 2024 03:25

Issues with installing on Windows Blossom-CI #93: Issue comment #2489 (comment) created by hello-11

December 10, 2024 03:25

In streaming output mode, some Chinese characters are decoded as garbled characters auto-assign #13: Issue #2488 labeled by hello-11

December 10, 2024 03:21

Close inactive issues Close inactive issues #350: Scheduled

December 10, 2024 03:21

15s main

main

December 10, 2024 03:21

15s

int4 not faster than fp16 and fp8 auto-assign #12: Issue #2487 labeled by hello-11

December 10, 2024 03:20

How to install tensorrt-llm in python3.11? Blossom-CI #92: Issue comment #2481 (comment) created by hello-11

December 10, 2024 02:57

error: make -C docker release_build : Command 'git submodule update --init --recursive' returned non-zero exit status 128 Blossom-CI #91: Issue comment #2479 (comment) created by hello-11

December 10, 2024 02:51

Close inactive issues Close inactive issues #349: Scheduled

December 10, 2024 02:33

14s main

main

December 10, 2024 02:33

14s

Close inactive issues Close inactive issues #348: Scheduled

December 10, 2024 01:34

16s main

main

December 10, 2024 01:34

16s

Close inactive issues Close inactive issues #347: Scheduled

December 10, 2024 00:27

19s main

main

December 10, 2024 00:27

19s

Close inactive issues Close inactive issues #346: Scheduled

December 9, 2024 23:06

15s main

main

December 9, 2024 23:06

15s

int8 slower than bf16 on A100 auto-assign #11: Issue #2553 labeled by ShuaiShao93

December 9, 2024 23:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Actions

Workflows

Management

All workflows

Actions

Loading...
Loading

All workflows

Filter by Event

Sorry, something went wrong.

Sorry, something went wrong.

No matching events.

Filter by Status

Sorry, something went wrong.

Sorry, something went wrong.

No matching statuses.

Filter by Branch

Sorry, something went wrong.

Sorry, something went wrong.

No matching branches.

Filter by Actor

Sorry, something went wrong.

Sorry, something went wrong.

No matching users.

Actions: NVIDIA/TensorRT-LLM

Actions

All workflows All workflows Actions Loading... Loading Sorry, something went wrong.

All workflows

All workflows

Actions

Loading...
Loading