Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

IPNI outages should not affect Spark RSR #168

Open
bajtos opened this issue Sep 19, 2024 · 1 comment
Open

IPNI outages should not affect Spark RSR #168

bajtos opened this issue Sep 19, 2024 · 1 comment

Comments

@bajtos
Copy link
Contributor

bajtos commented Sep 19, 2024

In the past several days, there have been periods of time when IPNI was returning 404 to most requests. As a result, Spark reports a very low RSR score, which in turn negatively affects SPs' reputation.

image

We should find a strategy how to handle the degradation of the IPNI service in such a way that does not affect the reputation of storage providers, clients and allocators.

@bajtos bajtos changed the title Handle IPNI outages IPNI outages should not affect Spark RSR Oct 3, 2024
@bajtos
Copy link
Contributor Author

bajtos commented Oct 18, 2024

Also:

AFAICT, ATM, we simply assume IPNI works - there is no timeout and no retry logic. If we cannot get the response or parse the response body as JSON, we return a generic code IPNI_ERROR_FETCH.

We should add timeout & retry, potentially rotating between different IPNI endpoints/providers.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant