This is a public testing set using for the Vietnamese spelling correction task.
There are 6000 sentences injected with spelling correction.
The testset_6000.txt consists of 6000 sentences. The groundtruth.txt consists of 6000 all-correct sentences. The 6000.csv consists of pairs of correct and incorrect sentences.
This testset can be used for future Vietnamese spelling correction progress tracking.