Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix DELL PowerEdge iDRAC #170

Open
damyan opened this issue Nov 8, 2024 · 4 comments
Open

Fix DELL PowerEdge iDRAC #170

damyan opened this issue Nov 8, 2024 · 4 comments
Labels
bug Something isn't working

Comments

@damyan
Copy link
Member

damyan commented Nov 8, 2024

The DELL iDRAC needs some time before it can deliver source information. Asking "too soon" leads to HTTP 503:
stderr F 2024-11-08T09:24:55Z ERROR Reconciler error {"controller": "server", "controllerGroup": "metal.ironcore.dev", "controllerKind": "Server", "Server": {"name":"mgmt-1-system-0"}, "namespace": "", "name": "mgmt-1-system-0", "reconcileID": "6b4398aa-9256-4ce5-8e29-d23003ed8565", "error": "failed to update server status: failed to get system info for Server: failed to get systems: failed to retrieve some items: [{\"link\":\"/redfish/v1/Systems/System.Embedded.1\",\"error\":\"503: {\\\"error\\\":{\\\"@Message.ExtendedInfo\\\":[{\\\"Message\\\":\\\"iDRAC is currently unable to display any information because data sources are unavailable.\\\",\\\"MessageArgs\\\":[],\\\"[email protected]\\\":0,\\\"MessageId\\\":\\\"IDRAC.2.8.SYS518\\\",\\\"RelatedProperties\\\":[],\\\"[email protected]\\\":0,\\\"Resolution\\\":\\\"Wait for the data to be available and retry the operation. If the issue persists, contact your service provider.\\\",\\\"Severity\\\":\\\"Informational\\\"},{\\\"Message\\\":\\\"The service is temporarily unavailable. Retry in 30 seconds.\\\",\\\"MessageArgs\\\":[\\\"30\\\"],\\\"[email protected]\\\":1,\\\"MessageId\\\":\\\"Base.1.12.ServiceTemporarilyUnavailable\\\",\\\"RelatedProperties\\\":[],\\\"[email protected]\\\":0,\\\"Resolution\\\":\\\"Wait for the indicated retry duration and retry the operation.\\\",\\\"Severity\\\":\\\"Critical\\\"}],\\\"code\\\":\\\"Base.1.12.GeneralError\\\",\\\"message\\\":\\\"A general error has occurred. See ExtendedInfo for more information\\\"}}\"}]"}

A retry mechanism should be Implemented to work around the issue.

@damyan damyan added the bug Something isn't working label Nov 8, 2024
@afritzler
Copy link
Member

@defo89 @stefanhipfel have you seen this on Dell servers before, that the BIOS info after a reboot is not immediately available through Redfish?

@defo89
Copy link
Contributor

defo89 commented Nov 8, 2024

@defo89 @stefanhipfel have you seen this on Dell servers before, that the BIOS info after a reboot is not immediately available through Redfish?

No, I haven't seen this on the Dell servers we have (PowerEdge R660, iDRAC Firmware Version 7.10.30.05)

@afritzler
Copy link
Member

We tested it with a PowerEdge R440.

@stefanhipfel
Copy link
Contributor

stefanhipfel commented Nov 11, 2024

I added polling to wait for redfish system resources

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: Ready
Development

No branches or pull requests

4 participants