Datasets use atomic write when persisting to disk #307

Partially fixes: #1954 Prior to this change dataset.download() would use a normal file write to persist downloaded data to disk. This meant another process or thread could check for the file and attempt to read it before the full content was written. This change uses a temporary file + a rename to update the file atomically. If a process is already reading a file that the new verion overwrites, the previous file node is unlinked rather than being overwritten so the read will work as expected. This will allow us to back out optimistically pre-loading dataset data before it is needed (and causing 404 errors when running tests on machines without the appropriate permissions to download UK data)

Commits on Nov 11, 2024

Update changelog_entry.yaml

nikhilwoodruff authored Nov 11, 2024

Configuration menu

View commit details

Copy full SHA for e49633f

Browse repository at this point

Copy the full SHA

e49633f View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Datasets use atomic write when persisting to disk #307

Datasets use atomic write when persisting to disk #307

Commits on Nov 8, 2024

Commits on Nov 11, 2024