Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EarthWorks Aardvark staging: updating synonyms to remove duplicates and handle lobsters #334

Merged
merged 1 commit into from
Jul 29, 2024

Conversation

hudajkhan
Copy link
Contributor

Relates to sul-dlss/earthworks#1091 and sul-dlss/earthworks#1083

A lot of the multi-expansion lines (i.e. any word on a line - if it's in a query - will have all the other words on the line also added to the query), had lots of duplicates. E.g. Elevation, altitudes, elevation. I removed the duplicate words from many of these lines. Since our configuration ignores case, I also removed duplicates that

I also saw "buildings" in the multi-word expansion in a line about religion. That didn't make much sense so I removed the word "buildings".

And yes, for the lobsters line, I replaced "Invertebrates, lobsters,mosquitoes,shellfish" with "Invertebrates => Invertebrates,lobsters,mosquitoes,shellfish". This expands a query for invertebrates to also include lobsters, mosquitoes, and shellfish, but not the other way around.

@hudajkhan hudajkhan merged commit d0e88a7 into master Jul 29, 2024
1 check passed
@hudajkhan hudajkhan deleted the updateEarthworksSynonyms branch July 29, 2024 23:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants