Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Synthetics] Alert Triggering at Point of Test Failure #149227

Open
drewpost opened this issue Jan 19, 2023 · 3 comments
Open

[Synthetics] Alert Triggering at Point of Test Failure #149227

drewpost opened this issue Jan 19, 2023 · 3 comments
Labels
Project:Alerting Synthetics Team:obs-ux-management Observability Management User Experience Team

Comments

@drewpost
Copy link

For the synthetics use case, customers require notification of issues with their monitors as close to instantaneously as possible. The current alerting framework available to the app only allows alerting via a look-back window. This delays sending an alert and impacts our end users' experience.

We want to be able to trigger an alert being sent with an event vs a time-based look back window. This would enable the following user flow: Synthetic Monitor executes a schedule or ad-hoc test run -> the test run fails -> the monitor moves into an error state and a new error event is opened -> an alert is triggered. There should be the absolute minimum amount of time required between each step of that flow.

@drewpost drewpost added Team:Uptime - DEPRECATED Synthetics & RUM sub-team of Application Observability Project:Alerting Synthetics labels Jan 19, 2023
@elasticmachine
Copy link
Contributor

Pinging @elastic/uptime (Team:uptime)

@andrewvc
Copy link
Contributor

After going through the options here with @kobelb we have two options:

  1. Decrease the poll interval from every 60s to every 30s in the current alert
  2. Find a way to immediately trigger an alert when an error occurs

Practically speaking option 1 is far more feasible in less time. Option 2 would require rethinking good chunks of our fleet integration, the synthetics service and kibana alerting.

One barrier to 1. is that currently serverless only supports a 60s minimum schedule, but they could make an exception for this alert.

@smith smith added Team:obs-ux-management Observability Management User Experience Team and removed Team:Uptime - DEPRECATED Synthetics & RUM sub-team of Application Observability labels Jul 8, 2024
@elasticmachine
Copy link
Contributor

Pinging @elastic/obs-ux-management-team (Team:obs-ux-management)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Project:Alerting Synthetics Team:obs-ux-management Observability Management User Experience Team
Projects
None yet
Development

No branches or pull requests

4 participants