Implemented Automated Healthchecks in the Grafana Dashboard #184
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This adds automated healthchecks to the to summary bar in the Dashboard.
Before:
After:
In addition to cluster Operator Healthchecks it features operator healthchecks - in particular - a ready check that verifies that at least one pod is ready and an operator reconcile errors check, that also displays which component has reconcile errors.
Here is an example of how that looks like for an unhealthy cluster:
There is an extra operator section with more detailed error information:
Bug Fixes:
Closes #183