Attempt to avoid some of the Enoent issues #1258

tunylund · 2023-12-01T08:39:18Z

Asset attempts to read metadata of the file every time the asset is saved. This seems to lead in error in production as the below stack trace shows. It looks like several processes attempt to change the state of the asset at the same time, some of the processes remove the actual local file which then causes the subjequent processes to fail reading the local file.

In any case, reading the local file should be necessary only rarely - not every time when Asset state is modified. This pr attempts to minize the reads.

It also looks like in production VirusScannerWorker gets executed several times at the same time on the same asset. I don't know why this happens, but this pr attemps to avoid unnecessary state transition from the VirusScanner, if another process has already taken over the state of the asset.

IDK if this fixes these issues, but maybe it might help reduce the noise in sentry 🤷‍♀️

Please see logit here

When debugging ERROR::ENOENT issues in production, i can see that file read attempts fail when Asset state is changed.

/app/app/models/asset.rb:251:in `stat'
/app/app/models/asset.rb:251:in `file_stat'
/app/app/models/asset.rb:164:in `last_modified_from_file'
/app/app/models/asset.rb:160:in `etag_from_file'
/app/app/models/asset.rb:230:in `store_metadata'
/usr/local/bundle/ruby/3.2.0/gems/activesupport-7.1.2/lib/active_support/callbacks.rb:403:in `block in make_lambda'

When debugging ERROR::ENOENT issues in production, i can see that Virus scanner triggered twice for the same asset.

30 Nov, 2023 @ 17:37:45.485 | 6568c869cc1ec5000d8eefe6 - VirusScanWorker#perform - Virus scan started
30 Nov, 2023 @ 17:37:45.303 | 6568c869cc1ec5000d8eefe6 - VirusScanWorker#perform - Virus scan started

Trello

In production we see errors that file reading is attempted by the `store_metadata` function when the file is not available. It looks like there might be a concurrency issue where one process has changed the file state and possibly removed the local file, whilst another process is attempting to save changes to the asset model and therefore attempts to read metadata from the file. This change attempts to minimize the attempts to read the file from the filesystem so that we can avoid these errors.

…is unscanned In production it looks like there are concurrency issues and the VirusScannerWorker executes several times for the same file. When several processes attempt to change the asset state at the same time, it can causes other issues. Specifically any attempt to read or remove the local file after the state transition can fail.

jkempster34

I think both of these changes should help reduce the number of errors, but I agree that we might want to add a more robust solution later.

Has this been tested in integration?

jkempster34 · 2023-12-01T09:50:58Z

app/models/asset.rb

@@ -76,7 +76,7 @@ class Asset

  mount_uploader :file, AssetUploader

-  before_save :store_metadata, unless: :uploaded?
+  before_save :store_metadata, unless: :uploaded?, if: -> { changes.include?(:file) }


I think this might help reduce the number of raised errors.

It wouldn't if the second process is already inside the store_metadata method

tunylund · 2023-12-01T10:24:23Z

I think both of these changes should help reduce the number of errors, but I agree that we might want to add a more robust solution later.

Has this been tested in integration?

Yeah i tested a regular file upload from whitehall in integration, seems to work.

Currently, when an asset is updated with a file which has the same name as the previous file, the metadata (specifically the MD5 hexdigest) doesn't get updated. There is a guard statement in the S3 uploader that prevents assets with and unchanged MD5 hexdigest) from uploading. This is due to a condition added in #1258 that was intended to quieten some errors we were seeing at the time. This same issue was addressed in #1263.

Asset metadata is not being updated when an asset is uploaded with a file which has the same name as the previous file. This is because of a condition added in #1258 that was intended to quieten some errors we were seeing at the time. This same issue was addressed in #1263. This is problematic as there is a guard statement in the S3 uploader that prevents assets with an unchanged MD5 hexdigest from uploading.

Tuomas Nylund added 2 commits December 1, 2023 07:58

tunylund force-pushed the enoent-issues branch from a8f1a45 to 4705054 Compare December 1, 2023 08:48

jkempster34 approved these changes Dec 1, 2023

View reviewed changes

tunylund merged commit 7bf405e into main Dec 1, 2023
8 checks passed

tunylund deleted the enoent-issues branch December 1, 2023 10:27

jkempster34 mentioned this pull request Mar 18, 2024

Ensure asset metadata is updated when file name isn't changed #1335

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attempt to avoid some of the Enoent issues #1258

Attempt to avoid some of the Enoent issues #1258

tunylund commented Dec 1, 2023 •

edited

Loading

jkempster34 left a comment

jkempster34 Dec 1, 2023

tunylund commented Dec 1, 2023

Attempt to avoid some of the Enoent issues #1258

Attempt to avoid some of the Enoent issues #1258

Conversation

tunylund commented Dec 1, 2023 • edited Loading

jkempster34 left a comment

Choose a reason for hiding this comment

jkempster34 Dec 1, 2023

Choose a reason for hiding this comment

tunylund commented Dec 1, 2023

tunylund commented Dec 1, 2023 •

edited

Loading