Nepali NER dataset and code:
Short tutorial on building NER datasets and models for new languages
Task 1: Annotate NER dataset (~510 sentences) --- NAAMII NER
Task 2: Split annotated sentences into 100/100/410 train/dev/test split
Task 3: Train NER baseline model on the new dataset (NAAMII NER)
Task 4: Train NER models on CoNLL03, WikiANN (NE), WikiANN (HI), and Singh NER and evaluate on the new dataset (NAAMI NER)
David Adelani for Third Nepal Winter School in AI