GitHub - Lizhmq/DeftEval: Repo for SemEval Task6 - DeftEval subtasks1-3

SemEval Task6 - DeftEval

SemEval-2020 is an International Workshop on Semantic Evaluation hold by ACL.

In this repository, we focus on the SemEval Task6 -- DeftEval: Extracting term-definition pairs in free text.

Final Report

Please refer to the pdf file DeftEval - 李拙.pdf. (Updated on May 26, 2021)

Overview

DeftEval is split into three subtasks:

Subtask 1: Sentence Classification

Given a sentence, classify whether or not it contains a definition. This is the traditional definition extraction task.

Subtask 2: Sequence Labeling

Label each token with BIO tags according to the corpus' tag specification.

Subtask 3: Relation Classification

Given the tag sequence labels, label the relations between each tag according to the corpus' relation specification.

Please refer to the offered link for more information.

Subtask-1: Sentence Classification

As mentioned above, this subtask is a sentence binary classification problem.

We implement the following traditional ML methods and DL methods for demonstration:

Traditional ML: Naive Bayesian, KNN, Decision Tree, Logistic Regression, SVM
Deep Learning: LSTM, BERT + fine-tuning

The results are listed in res.txt:

	NB	KNN	CART	LR	SVM	LSTM	BERT
Accuracy	71%	68%	71%	73%	77%	81%	86%
F1	0.30	0.03	0.52	0.44	0.53	0.69	0.79
Macro F1	0.56	0.42	0.66	0.63	0.69	0.78	0.84

(Note that the hyper-parameters are not searched/selected well. I just do a simple experiment. :)

Run Command

Warning: dataset and trained model are not uploaded.

Prepare Data

Execute "dataset.ipynb" and you will get "pickle" dataset saved.

Task 1

cd task1
sh train.sh # train
sh eval.sh  # evaluate on test set, change the pretraineddir to saved model path

Task2

cd task2
sh train.sh # train
sh eval.sh # eval, change the "model_list" and "out_list" in eval.py
sh run.sh # convert to standart format, execute output.py and official evaluate script

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
input		input
output		output
program		program
task1		task1
task2		task2
task3		task3
.gitignore		.gitignore
DeftEval - 李拙.pdf		DeftEval - 李拙.pdf
README.md		README.md
data_converter.py		data_converter.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SemEval Task6 - DeftEval

Final Report

Overview

Subtask-1: Sentence Classification

Run Command

About

Releases

Packages

Languages

Lizhmq/DeftEval

Folders and files

Latest commit

History

Repository files navigation

SemEval Task6 - DeftEval

Final Report

Overview

Subtask-1: Sentence Classification

Run Command

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages