Skip to content

Accuracy improvement for indian location names extraction using NER #9832

Discussion options

You must be logged in to vote

Sorry you're having trouble with this, you're correct that there probably aren't enough Indian location names in our training data.

I would recommend you make a list of Indian location names and use an EntityRuler to label the data and see how much coverage that gets. If the coverage is reasonable, you can use that data as training data for an NER component. You can put that component in the pipeline with the existing NER component and see how that works. I suspect that putting it after with overwrite is the best thing to do, but you should try different combinations of before/after the default NER and using overwrite or not. See the double NER example project for notes on how that works.

Y…

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by svlandeg
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
lang / en English language data and models feat / ner Feature: Named Entity Recognizer perf / accuracy Performance: accuracy
3 participants