Skip to content

Add Continuous Decoding support in GQA #32112

Add Continuous Decoding support in GQA

Add Continuous Decoding support in GQA #32112

Triggered via pull request July 26, 2024 17:02
Status Failure
Total duration 13m 55s
Artifacts

lint.yml

on: pull_request
Fit to window
Zoom out
Zoom in

Annotations

2 errors and 16 warnings
Python format
Process completed with exit code 1.
Optional Lint C++
reviewdog: Too many results (annotations) in diff. You may miss some annotations due to GitHub limitation for annotation created by logging command. Please check GitHub Actions log console to see all results. Limitation: - 10 warning annotations and 10 error annotations per step - 50 annotations per job (sum of annotations from all the steps) - 50 annotations per run (separate from the job annotations, these annotations aren't created by users) Source: https://github.com/orgs/community/discussions/26680#discussioncomment-3252835
Python format
The following actions uses node12 which is deprecated and will be forced to run on node16: actions-rs/toolchain@v1. For more info: https://github.blog/changelog/2023-06-13-github-actions-all-actions-will-run-on-node16-instead-of-node12-by-default/
Python format
The following actions uses Node.js version which is deprecated and will be forced to run on node20: actions-rs/toolchain@v1. For more info: https://github.blog/changelog/2024-03-07-github-actions-all-actions-will-run-on-node20-instead-of-node16-by-default/
Python format
The `set-output` command is deprecated and will be disabled soon. Please upgrade to using Environment Files. For more information see: https://github.blog/changelog/2022-10-11-github-actions-deprecating-save-state-and-set-output-commands/
Python format
The `set-output` command is deprecated and will be disabled soon. Please upgrade to using Environment Files. For more information see: https://github.blog/changelog/2022-10-11-github-actions-deprecating-save-state-and-set-output-commands/
Python format
The `set-output` command is deprecated and will be disabled soon. Please upgrade to using Environment Files. For more information see: https://github.blog/changelog/2022-10-11-github-actions-deprecating-save-state-and-set-output-commands/
Python format
The `set-output` command is deprecated and will be disabled soon. Please upgrade to using Environment Files. For more information see: https://github.blog/changelog/2022-10-11-github-actions-deprecating-save-state-and-set-output-commands/
Optional Lint C++: onnxruntime/contrib_ops/cpu/bert/gqa_attention_base.h#L14
[cpplint] reported by reviewdog 🐶 Found C++ system header after other header. Should be: gqa_attention_base.h, c system, c++ system, other. [build/include_order] [4] Raw Output: onnxruntime/contrib_ops/cpu/bert/gqa_attention_base.h:14: Found C++ system header after other header. Should be: gqa_attention_base.h, c system, c++ system, other. [build/include_order] [4]
Optional Lint C++: onnxruntime/contrib_ops/cpu/bert/gqa_attention_base.h#L117
[cpplint] reported by reviewdog 🐶 At least two spaces is best between code and comments [whitespace/comments] [2] Raw Output: onnxruntime/contrib_ops/cpu/bert/gqa_attention_base.h:117: At least two spaces is best between code and comments [whitespace/comments] [2]
Optional Lint C++: onnxruntime/contrib_ops/cpu/bert/gqa_attention_base.h#L239
[cpplint] reported by reviewdog 🐶 Lines should be <= 120 characters long [whitespace/line_length] [2] Raw Output: onnxruntime/contrib_ops/cpu/bert/gqa_attention_base.h:239: Lines should be <= 120 characters long [whitespace/line_length] [2]
Optional Lint C++: onnxruntime/contrib_ops/cpu/bert/gqa_attention_base.h#L239
[cpplint] reported by reviewdog 🐶 At least two spaces is best between code and comments [whitespace/comments] [2] Raw Output: onnxruntime/contrib_ops/cpu/bert/gqa_attention_base.h:239: At least two spaces is best between code and comments [whitespace/comments] [2]
Optional Lint C++: onnxruntime/contrib_ops/cpu/bert/group_query_attention.cc#L19
[cpplint] reported by reviewdog 🐶 Found C++ system header after other header. Should be: group_query_attention.h, c system, c++ system, other. [build/include_order] [4] Raw Output: onnxruntime/contrib_ops/cpu/bert/group_query_attention.cc:19: Found C++ system header after other header. Should be: group_query_attention.h, c system, c++ system, other. [build/include_order] [4]
Optional Lint C++: onnxruntime/contrib_ops/cpu/bert/group_query_attention.cc#L132
[cpplint] reported by reviewdog 🐶 Lines should be <= 120 characters long [whitespace/line_length] [2] Raw Output: onnxruntime/contrib_ops/cpu/bert/group_query_attention.cc:132: Lines should be <= 120 characters long [whitespace/line_length] [2]
Optional Lint C++: onnxruntime/contrib_ops/cpu/bert/group_query_attention_helper.h#L227
[cpplint] reported by reviewdog 🐶 At least two spaces is best between code and comments [whitespace/comments] [2] Raw Output: onnxruntime/contrib_ops/cpu/bert/group_query_attention_helper.h:227: At least two spaces is best between code and comments [whitespace/comments] [2]
Optional Lint C++: onnxruntime/contrib_ops/cuda/bert/cutlass_fmha/fmha_launch_template.h#L45
[cpplint] reported by reviewdog 🐶 Lines should be <= 120 characters long [whitespace/line_length] [2] Raw Output: onnxruntime/contrib_ops/cuda/bert/cutlass_fmha/fmha_launch_template.h:45: Lines should be <= 120 characters long [whitespace/line_length] [2]
Optional Lint C++: onnxruntime/contrib_ops/cuda/bert/group_query_attention_helper.h#L249
[cpplint] reported by reviewdog 🐶 At least two spaces is best between code and comments [whitespace/comments] [2] Raw Output: onnxruntime/contrib_ops/cuda/bert/group_query_attention_helper.h:249: At least two spaces is best between code and comments [whitespace/comments] [2]
Optional Lint C++: onnxruntime/contrib_ops/cuda/bert/group_query_attention_impl.cu#L168
[cpplint] reported by reviewdog 🐶 Lines should be <= 120 characters long [whitespace/line_length] [2] Raw Output: onnxruntime/contrib_ops/cuda/bert/group_query_attention_impl.cu:168: Lines should be <= 120 characters long [whitespace/line_length] [2]