EarthWorks Aardvark staging: updating synonyms to remove duplicates and handle lobsters #334
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Relates to sul-dlss/earthworks#1091 and sul-dlss/earthworks#1083
A lot of the multi-expansion lines (i.e. any word on a line - if it's in a query - will have all the other words on the line also added to the query), had lots of duplicates. E.g. Elevation, altitudes, elevation. I removed the duplicate words from many of these lines. Since our configuration ignores case, I also removed duplicates that
I also saw "buildings" in the multi-word expansion in a line about religion. That didn't make much sense so I removed the word "buildings".
And yes, for the lobsters line, I replaced "Invertebrates, lobsters,mosquitoes,shellfish" with "Invertebrates => Invertebrates,lobsters,mosquitoes,shellfish". This expands a query for invertebrates to also include lobsters, mosquitoes, and shellfish, but not the other way around.