Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(veritech): When errors occur that don't cause us to retry the function, inform subscribers #5263

Merged
merged 1 commit into from
Jan 16, 2025

Conversation

britmyerss
Copy link
Contributor

This change ensures that for all functions types, when errors occur (such as killing a function, or timing out), we inform subscribers of the failure so they can handle them appropriately.

The main issue this fixes is during DVU jobs, if a function times out or is killed, the job will continue to run until it hits its own time out, exacerbating hung change sets


via GIPHY

@github-actions github-actions bot added the A-veritech Area: Task execution backend service [Rust] label Jan 15, 2025
Copy link

github-actions bot commented Jan 15, 2025

Dependency Review

✅ No vulnerabilities or OpenSSF Scorecard issues found.

OpenSSF Scorecard

PackageVersionScoreDetails

Scanned Files

Ok(function_result) => {
if let Err(err) = publisher.publish_result(&function_result).await {
metric!(counter.function_run.action = -1);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is scooped up by the dec call below, yeah?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah were we double decrementing here?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yeah I think just leftover from pre-refactor

zacharyhamm
zacharyhamm previously approved these changes Jan 15, 2025
Copy link
Contributor

@zacharyhamm zacharyhamm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This keeps the DVU from getting stuck! 🚀

@britmyerss britmyerss added this pull request to the merge queue Jan 16, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jan 16, 2025
@britmyerss britmyerss force-pushed the brit/bubble-up-veritech-errors branch from e04f617 to c34536d Compare January 16, 2025 15:27
@github-actions github-actions bot added the A-dal label Jan 16, 2025
…ction, inform subscribers

This change ensures that for all functions types, when errors occur (such as killing a function, or timing out), we inform subscribers of the failure so they can handle them appropriately. 

The main issue this fixes is during Pinga jobs, if a function times out or is killed, the job will continue to run until it hits its own time out, exacerbating hung change sets
@britmyerss britmyerss force-pushed the brit/bubble-up-veritech-errors branch from c34536d to 746f098 Compare January 16, 2025 15:32
@britmyerss
Copy link
Contributor Author

/try

Copy link

github-actions bot commented Jan 16, 2025

Okay, starting a try! I'll update this comment once it's running...\n
🚀 Try running here! 🚀

@britmyerss britmyerss added this pull request to the merge queue Jan 16, 2025
Merged via the queue into main with commit 2ca502d Jan 16, 2025
19 checks passed
@britmyerss britmyerss deleted the brit/bubble-up-veritech-errors branch January 16, 2025 16:05
Comment on lines +120 to +123
// This should be [`FuncRunState::Killed`]
// but because this is getting bubbled back to the Pinga Job running the
// action, and the error is now embedded in a successful response,
// (because we did get a response from Veritech)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

UGH! So sorry about this... that explains a lot.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-dal A-veritech Area: Task execution backend service [Rust]
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants