Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate 'adaptive' mode for retries and improve concurrency of message sends from multiple services #1452

Open
terrazoon opened this issue Dec 3, 2024 · 0 comments

Comments

@terrazoon
Copy link
Contributor

terrazoon commented Dec 3, 2024

Right now when we do large sends we are hitting some ThrottlingExceptions with a "max retries exceeded: 4" message. This is not our celery retries, this is built into AWS. This occurs both for publishing messages and for calling FilterLogEvents when we are getting the delivery receipts.

Apparently there is a new, and currently experimental, 'adaptive' mode for AWS retries which takes into account service limits and hopefully reduces the number of AWS retries:

https://boto3.amazonaws.com/v1/documentation/api/latest/guide/retries.html

Investigate switching over to this adaptive mode when it becomes less experimental.

Some additional guidance is also available here:

Ultimately, we also want to make sure that a large batch from one partner doesn't become a blocker for all other work to go through the system from other partners.

@terrazoon terrazoon converted this from a draft issue Dec 3, 2024
@ccostino ccostino changed the title Investigate 'adaptive' mode for retries Investigate 'adaptive' mode for retries and improve concurrency of message sends from multiple services Dec 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: 🌱 New
Development

No branches or pull requests

1 participant