diff --git a/+tests/+unit/TutorialTest.m b/+tests/+unit/TutorialTest.m index 15ea7e33..2d542129 100644 --- a/+tests/+unit/TutorialTest.m +++ b/+tests/+unit/TutorialTest.m @@ -25,7 +25,8 @@ 'basicUsage.mlx', ... % depends on external data 'convertTrials.m', ... % depends on basicUsage output 'formatStruct.m', ... % Actually a utility script, not a tutorial - 'read_demo.mlx'}; % depends on external data + 'read_demo.mlx', ... % depends on external data + 'remote_read.mlx'}; % Uses nwbRead on s3 url, potentially very slow % SkippedFiles - Name of exported nwb files to skip reading with pynwb SkippedFiles = {'testFileWithDataPipes.nwb'} % does not produce a valid nwb file @@ -78,7 +79,7 @@ function setupMethod(testCase) methods (Test) function testTutorial(testCase, tutorialFile) %#ok - run(tutorialFile) + C = evalc( 'run(tutorialFile)' ); %#ok testCase.testReadTutorialNwbFileWithPynwb() end end diff --git a/tutorials/html/remote_read.html b/tutorials/html/remote_read.html index bd167e04..9746120e 100644 --- a/tutorials/html/remote_read.html +++ b/tutorials/html/remote_read.html @@ -1,9 +1,12 @@ -Remote read of NWB files

Remote read of NWB files

It is possible to read an NWB file (or any HDF5 file) in MATLAB directly from several different kinds of remote locations, including AWS, Azure Blob Storage and HDFS. This tutorial will walk you through specifically loading a MATLAB file from AWS S3, which is the storage used by the DANDI archive. See MATLAB documentation for more general information.
To read an NWB file file from an s3 store, first you need to figure out the s3 path of that resource. The easiest way to do this is to use the DAND web client.
  1. (skip if on DANDI Hub) Make sure you do not have a file ~/.aws/credentials. If you do, rename it to something else. On Windows this file would be somewhere like C:/Users/username/.aws/credentials.
  2. Find and select a dandiset you want on the DANDI Archive, then click
  3. Navigate to the NWB file of interest and click
  4. Find the second entry of "contentURL"In your MATLAB session, take the end of that url and stick it onto s3 = 's3://dandiarchive/blobs/. In this case, you would have
s3 = 's3://dandiarchive/blobs/7ee/415/7ee41580-9b0b-44ca-8675-6959ddd8dc33'
  1. Read from that s3 path directly with
nwbfile = nwbRead(s3);
That's it! MATLAB will automatically detect that this is an S3 path instead of a local filepath and will set up a remote read interface for that NWB file. This appoach works on any computer with a fairly recent version of MATLAB and an internet connection. It works particularly well on the DANDI Hub, which has a very fast connection to the DANDI S3 store and which provides a MATLAB environment for free provided you have a license.

MATLAB vs. Python remote read

Python also allows you to remotely read a file, and has several advantages over MATLAB. Reading in Python is faster. On DANDI Hub, for MATLAB, reading the file takes about 51 seconds, while the analogous operation takes less than a second in Python. Python also allows you to create a local cache so you are not repeatedly requesting the same data, which can further speed up data access. Overall, we recommend remote reading using Python instead of MATLAB.
+.CodeBlock { background-color: #F5F5F5; margin: 10px 0 10px 0; } +.S4 { border-left: 0.998264px solid rgb(217, 217, 217); border-right: 0.998264px solid rgb(217, 217, 217); border-top: 0.998264px solid rgb(217, 217, 217); border-bottom: 0.998264px solid rgb(217, 217, 217); border-radius: 4px; padding: 6px 45px 4px 13px; line-height: 18.004px; min-height: 0px; white-space: nowrap; color: rgb(33, 33, 33); font-family: Menlo, Monaco, Consolas, "Courier New", monospace; font-size: 14px; } +.S5 { margin: 10px 10px 9px 4px; padding: 0px; line-height: 21px; min-height: 0px; white-space: pre-wrap; color: rgb(33, 33, 33); font-family: Helvetica, Arial, sans-serif; font-style: normal; font-size: 14px; font-weight: 400; text-align: left; } +.S6 { margin: 20px 10px 5px 4px; padding: 0px; line-height: 20px; min-height: 0px; white-space: pre-wrap; color: rgb(33, 33, 33); font-family: Helvetica, Arial, sans-serif; font-style: normal; font-size: 20px; font-weight: 700; text-align: left; }

Remote read of NWB files

It is possible to read an NWB file (or any HDF5 file) in MATLAB directly from several different kinds of remote locations, including AWS, Azure Blob Storage and HDFS. This tutorial will walk you through specifically loading a MATLAB file from AWS S3, which is the storage used by the DANDI archive. See MATLAB documentation for more general information.
To read an NWB file file from an s3 store, first you need to figure out the s3 path of that resource. The easiest way to do this is to use the DANDI web client.
  • (skip if on DANDI Hub) Make sure you do not have a file ~/.aws/credentials. If you do, rename it to something else. On Windows this file would be somewhere like C:/Users/username/.aws/credentials.
  • Find and select a dandiset you want on the DANDI Archive, then click
  • Navigate to the NWB file of interest and click
  • Find the second entry of "contentURL"
  • In your MATLAB session, take the end of that url (the blob id) and add it to this expression: s3 = 's3://dandiarchive/blobs/<blob_id>'. In this case, you would have:
s3 = 's3://dandiarchive/blobs/7ee/415/7ee41580-9b0b-44ca-8675-6959ddd8dc33';
  • Read from that s3 path directly with:
nwbfile = nwbRead(s3);
That's it! MATLAB will automatically detect that this is an S3 path instead of a local filepath and will set up a remote read interface for that NWB file. This appoach works on any computer with a fairly recent version of MATLAB and an internet connection. It works particularly well on the DANDI Hub, which has a very fast connection to the DANDI S3 store and which provides a MATLAB environment for free provided you have a license.

Note: MATLAB vs. Python remote read

Python also allows you to remotely read a file, and has several advantages over MATLAB. Reading in Python is faster. On DANDI Hub, for MATLAB, reading the file takes about 51 seconds, while the analogous operation takes less than a second in Python. Python also allows you to create a local cache so you are not repeatedly requesting the same data, which can further speed up data access. Overall, we recommend remote reading using Python instead of MATLAB.