Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SRE-3945: Add service impacts to Datadog alerts in Slack #38

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

XCiber
Copy link
Collaborator

@XCiber XCiber commented Oct 11, 2023

This commit enriches the Datadog alert messages forwarded to Slack. Alert titles are now bolded, the date is included, as well as the alert priority and scope. There's also now a link to the specific event, and the affected services are listed. This change allows for quicker and more efficient responses to alerts. In the Datadog processor, the alert structure now includes 'Services', indicating which services are affected by the alert. The processor also now queries a Prometheus endpoint to get a list of services affected by the product specified in the alert scope.

In the Slack message template, it now extracts the channel and thread IDs from the same GitLab pipeline variables as before, but under updated identifiers. Moreover, it tries to fetch chart images by checking more snapshot URLs than before. These enrichments provide much-needed context right in the Slack message, removing the need to jump back to Datadog or GitLab for additional investigation.

This commit enriches the Datadog alert messages forwarded to Slack. Alert titles are now bolded, the date is included, as well as the alert priority and scope. There's also now a link to the specific event, and the affected services are listed. This change allows for quicker and more efficient responses to alerts. In the Datadog processor, the alert structure now includes 'Services', indicating which services are affected by the alert. The processor also now queries a Prometheus endpoint to get a list of services affected by the product specified in the alert scope.

In the Slack message template, it now extracts the channel and thread IDs from the same GitLab pipeline variables as before, but under updated identifiers. Moreover, it tries to fetch chart images by checking more snapshot URLs than before. These enrichments provide much-needed context right in the Slack message, removing the need to jump back to Datadog or GitLab for additional investigation.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant