Cannot download using wget or gcp #145

mahela97 · 2022-09-23T18:04:31Z

When i tried to download the dataset using wget command it just created sub directories with index.html and when i tried using GCP it gives me this error "BadRequestException: 400 Bucket is a requester pays bucket but no user project provided."

tompollard · 2022-09-23T18:12:20Z

To download from Google Cloud, you'll need to specify a project ID for covering any download costs. See: https://stackoverflow.com/questions/47739741/bucket-is-requester-pays-bucket-but-no-user-project-provided

If you have a project ID then you can specify it in the download command (see: https://cloud.google.com/storage/docs/using-requester-pays#using). e.g. for gsutil:

gsutil -u PROJECT_IDENTIFIER cp gs://BUCKET_NAME/OBJECT_NAME OBJECT_DESTINATION

If you want to avoid download fees, you can download the data from the PhysioNet servers using the suggested wget command. This will be slower!

mahela97 · 2022-09-23T18:18:32Z

@tompollard thank you for the reply. I am trying using wget but inside the subdirectories, there is only an index.html file instead of images. could you please help me to fix that?

tompollard · 2022-09-23T18:26:14Z

@mahela97 I'm not clear what dataset you are trying to download, but essentially I think you'll need to be patient! wget loads the directory structure before files, I believe, so you may not immediately see data within directories.

mahela97 · 2022-09-23T18:29:48Z

@tompollard i am trying with the mimic-cxr. Thank you for the help. Will check after few hours.

tompollard · 2022-09-23T18:33:49Z

@mahela97 makes sense, it's a large dataset! I think this is the same issue described at:

I'd be interested in hearing how long the download takes to complete with wget.

edeiana23 · 2023-04-26T09:12:15Z

Was anybody able to download the dataset with wget command? I left my computer on the whole weekend but nothing happened. I still only see the folders and the index.html file but no images in dicom format. Asking specifically to @mahela97 @ayhyap who raised the issue previously.

ayhyap · 2023-04-26T09:22:54Z

Was anybody able to download the dataset with wget command? I left my computer on the whole weekend but nothing happened. I still only see the folders and the index.html file but no images in dicom format. Asking specifically to @mahela97 @ayhyap who raised the issue previously.

Yes the files eventually transferred, it just took a while.
It is possible that some error occurred that caused your wget command to be interrupted.
The command provided on the mimic-cxr page includes the relevant options to resume the transfer if it stopped prematurely.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot download using wget or gcp #145

Cannot download using wget or gcp #145

mahela97 commented Sep 23, 2022

tompollard commented Sep 23, 2022

mahela97 commented Sep 23, 2022

tompollard commented Sep 23, 2022

mahela97 commented Sep 23, 2022

tompollard commented Sep 23, 2022

edeiana23 commented Apr 26, 2023

ayhyap commented Apr 26, 2023

Cannot download using wget or gcp #145

Cannot download using wget or gcp #145

Comments

mahela97 commented Sep 23, 2022

tompollard commented Sep 23, 2022

mahela97 commented Sep 23, 2022

tompollard commented Sep 23, 2022

mahela97 commented Sep 23, 2022

tompollard commented Sep 23, 2022

edeiana23 commented Apr 26, 2023

ayhyap commented Apr 26, 2023