-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Downloading data from Zooniverse; classification_export.status_code == 403 error #38
Comments
had a discussion with someone who is keen to use our pipeline down the road and they raised the concern about the delay for getting results. They think the ~24 hour wait to get classifications will impact their ability to do science... worth discussing in this context perhaps? |
@clareh What specific concerns did they have about the delay? Why does 24-hour delay affect their science capacity? |
Maybe the above two comments should be attached to a separate discussion? They seem not related to this issue/bug but seem related to the general discussion topic of how to fetch data. |
@clareh Could you start an issue or a new discussion on this? |
Hi! I've added myself to this as the Zooniverse contact. My first thought is that perhaps the failed requests are using expired Authorization headers but I will investigate. |
Thank you @eatyourgreens ! |
Hi again, Do you know if the classification export is being requested after its signed URL has expired? Here's an example of an expired link: The URLs expire 3 minutes after they're generated, so maybe that's the cause of the problem? If the signed URL has expired, I think that you need to retry and generate a new URL. |
zooniverse/panoptes#4209 might fix this, once it’s deployed to Panoptes production. Credit to @yuenmichelle1 for figuring out the caching problem: those classification links are good for 3 minutes but Panoptes caches for 5 minutes, so there's a 2 minute overlap where Panoptes can give you an expired link. |
Thank you for the update @eatyourgreens, we'll retest soon |
Describe the bug
The last cell of the citizen science notebook (the one that grabs the classifications from Zooniverse using panoptes client) fails every 10th time it runs.
To Reproduce
Steps to reproduce the behavior, written in imperative mood:
Expected behavior
That there be no error with downloading the classifications. In other words, classification_export.status_code == 200 and classification_export.ok == True.
Actual behavior
Sometimes (again only ~10th time this is run), classification_export.status_code == 403.
Screenshots
EDC Output
Additional context
Here is the code we wrote that bypasses this issue. We are not including this in the alpha version of the code release, but we'd like to include it down the road. Currently, we just have one comment that recommends re-running the cell if it fails.
The text was updated successfully, but these errors were encountered: