bug-1921849: support elasticsearch 8 #6741

relud · 2024-10-03T16:58:04Z

use ELASTICSEARCH_MODE=PREFER_NEW to make the webapp use es8 and the processor write to both es 1.4 and es8

socorro/tests/external/es/test_supersearch.py

socorro/external/es/crashstorage.py

Co-authored-by: krzepka <[email protected]>

willkg

We're going to do this PR in two passes. This is a code read pass.

While you're fixing things I brought up, I'll spend some time going through some manual testing for things I'm wondering about.

Then after you make changes, I'll read through those and add anything that came up in manual testing.

socorro/external/es/crashstorage.py

socorro/external/es/super_search_fields.py

socorro/tests/external/es/test_supersearch.py

socorro/external/es/super_search_fields.py

socorro/external/es/supersearch.py

socorro/tests/external/es/test_crashstorage.py

relud · 2024-11-07T18:24:10Z

socorro/external/es/crashstorage.py

-            except elasticsearch.exceptions.TransportError as e:
-                # If this is a TransportError, we try to figure out what the error
+            except elasticsearch.BadRequestError as e:
+                # If this is a BadRequestError, we try to figure out what the error
                # is and fix the document and try again


this section removes fields that cause document_parsing_exception and retries the document. This seems like an odd choice given that it happens after value fixing occurs, which should already be preventing the three types of failure we catch here. The only way i can think of to reach this code block in production is if we are writing to a field not in our mapping.

It used to be the case that it wrote all the data into Elasticsearch even if it wasn't in the mapping. That way when they add new fields to the crash report, they'd get indexed even if Socorro didn't explicitly have support for it. While the intentions were good, that was terrible so I changed it such that it only indexes what's defined in super search fields and in the mapping.

webapp/crashstats/topcrashers/tests/test_views.py

willkg · 2024-11-22T13:25:30Z

@relud Can you pull in the fix in PR #6813? Then I can go through this PR again.

relud · 2024-11-22T18:07:31Z

@relud Can you pull in the fix in PR #6813? Then I can go through this PR again.

done

willkg

I went through and tested the issues I raised in the previous review.

bin/process_crashes.sh works now.
Things correctly wait for both es and legacy_es containers to start up.
TopCrashers works now.

I wrote up bug 1933824 about a curiosity I hit when uploading some crash report dump files to gcs emulator. That's not related to these changes.

My only issue is that I'm not sure why you added four new metrics to statsd_metrics.yml rather than just the one I was hitting issues with.

Everything else looks fine as far as I can tell.

r+wc

socorro/statsd_metrics.yaml

relud force-pushed the relud-es-8-crash-storage branch 11 times, most recently from d96b425 to 9ac223f Compare October 8, 2024 18:09

relud requested a review from willkg October 8, 2024 18:19

This comment was marked as resolved.

Sign in to view

relud force-pushed the relud-es-8-crash-storage branch 3 times, most recently from 32912f9 to b360970 Compare October 9, 2024 23:58

relud commented Oct 9, 2024

View reviewed changes

socorro/tests/external/es/test_supersearch.py Show resolved Hide resolved

relud marked this pull request as ready for review October 9, 2024 23:59

relud requested a review from a team as a code owner October 9, 2024 23:59

relud commented Oct 11, 2024

View reviewed changes

socorro/external/es/crashstorage.py Outdated Show resolved Hide resolved

willkg mentioned this pull request Oct 23, 2024

Rework how supersearch fields permissions are handled #6764

Merged

This comment was marked as resolved.

Sign in to view

relud force-pushed the relud-es-8-crash-storage branch 4 times, most recently from 19b3ea4 to c0e2cfd Compare October 30, 2024 17:55

support elasticsearch 8

2704c10

Co-authored-by: krzepka <[email protected]>

relud force-pushed the relud-es-8-crash-storage branch from c0e2cfd to 2704c10 Compare October 30, 2024 18:05

This comment was marked as resolved.

Sign in to view

willkg requested changes Nov 5, 2024

View reviewed changes

relud requested a review from willkg November 7, 2024 00:25

address review

b3a3c73

relud force-pushed the relud-es-8-crash-storage branch from b460f21 to b3a3c73 Compare November 7, 2024 00:36

update error handling for dropping fields

a830cbf

relud commented Nov 7, 2024

View reviewed changes

willkg self-assigned this Nov 18, 2024

This comment was marked as resolved.

Sign in to view

relud added 2 commits November 20, 2024 14:02

Merge branch 'main' into relud-es-8-crash-storage

c7457c8

address review

fe18969

relud force-pushed the relud-es-8-crash-storage branch 4 times, most recently from 9988801 to 0e51573 Compare November 20, 2024 23:29

relud commented Nov 20, 2024

View reviewed changes

webapp/crashstats/topcrashers/tests/test_views.py Outdated Show resolved Hide resolved

relud force-pushed the relud-es-8-crash-storage branch from 0e51573 to 0b51672 Compare November 20, 2024 23:34

fix topcrashers

8cedc0c

relud force-pushed the relud-es-8-crash-storage branch from 0b51672 to 8cedc0c Compare November 20, 2024 23:36

relud requested a review from willkg November 20, 2024 23:39

Merge branch 'main' into relud-es-8-crash-storage

6dcc7e2

willkg approved these changes Nov 27, 2024

View reviewed changes

socorro/statsd_metrics.yaml Show resolved Hide resolved

relud added 2 commits December 3, 2024 08:25

address review

43584d9

Merge branch 'main' into relud-es-8-crash-storage

34ebe47

relud enabled auto-merge December 3, 2024 16:35

relud added this pull request to the merge queue Dec 3, 2024

Merged via the queue into main with commit d533845 Dec 3, 2024
1 check passed

relud deleted the relud-es-8-crash-storage branch December 3, 2024 16:59

relud mentioned this pull request Dec 3, 2024

Update ElasticSearch #6727

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug-1921849: support elasticsearch 8 #6741

bug-1921849: support elasticsearch 8 #6741

relud commented Oct 3, 2024 •

edited

Loading

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

willkg left a comment

relud Nov 7, 2024

willkg Nov 20, 2024

This comment was marked as resolved.

willkg commented Nov 22, 2024

relud commented Nov 22, 2024

willkg left a comment

bug-1921849: support elasticsearch 8 #6741

bug-1921849: support elasticsearch 8 #6741

Conversation

relud commented Oct 3, 2024 • edited Loading

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

willkg left a comment

Choose a reason for hiding this comment

relud Nov 7, 2024

Choose a reason for hiding this comment

willkg Nov 20, 2024

Choose a reason for hiding this comment

This comment was marked as resolved.

willkg commented Nov 22, 2024

relud commented Nov 22, 2024

willkg left a comment

Choose a reason for hiding this comment

relud commented Oct 3, 2024 •

edited

Loading