Train data #2

qjh-nj · 2023-12-27T03:20:21Z

Hello!
It seems that there is no train dataset. Should I just run "run_dataset_preprocessing.sh" to get train data by myself?

and where is the test data--"a paired bootstrap test" described in the paper? Do I need to sample data by myself?

and what's the function of the code below, why "add oracle sentences to training data". Does the Table 5 in the paper use the code below to get more train data?

##add oracle sentences to training data
if "train" in args.split:
chunks_output_list.append(chunks_output_dict)

Thanks!

ryokamoi · 2023-12-31T13:20:54Z

Hi,

It seems that there is no train dataset. Should I just run "run_dataset_preprocessing.sh" to get train data by myself?

To get chunk-level train data, you need to run the code.

and where is the test data--"a paired bootstrap test" described in the paper? Do I need to sample data by myself?

Yes.

and what's the function of the code below, why "add oracle sentences to training data". Does the Table 5 in the paper use the code below to get more train data?

Yes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train data #2

Train data #2

qjh-nj commented Dec 27, 2023 •

edited

Loading

ryokamoi commented Dec 31, 2023 •

edited

Loading

Train data #2

Train data #2

Comments

qjh-nj commented Dec 27, 2023 • edited Loading

ryokamoi commented Dec 31, 2023 • edited Loading

qjh-nj commented Dec 27, 2023 •

edited

Loading

ryokamoi commented Dec 31, 2023 •

edited

Loading