You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We are trying to train a EN-FR sentence level QE model by using a predictor estimator model with parallel data.
We are using OpenKiwi 0.1.3 to train it.
The procedure was as follows:
Train the Predictor using parallel data (EN-FR)
Train the Estimator using the Predictor (from step 1) using the following data (as commented in thread Is it possible to train with just src, mt, ter? #46):
a.the English source sentences
b.the FR translated sentences using a pretrained MT model
c.the TER scores for each FR sentence translated
The results obtained were of a Pearson correlation of 0.32 and a Spearman correlation of 0.36, which are below the 0.5018 and 0.5566 obtained on the OpenKiwi paper (https://www.aclweb.org/anthology/P19-3020.pdf).
My question is: is it possible to obtain a similar result using only parallel data? If affirmative there is something wrong on our procedure?
The configuration files used to train are the following:
Hello @lluisg,
sorry for the (very) late response!
Everything seems alright in your settings and proposed setup. You could play a bit more with the hyper-params but nothing jumps out to me as obviously wrong.
As for your original question "is it possible to obtain a similar result using only parallel data? " I do not know! It is definitely an interesting research question!
Traditionally the community as believed that the multi-task nature of the normal QE setup helps it with both tasks, as HTER and the tag creation are inherently correlated, but who knows, maybe it is possible to get as good results with just parallel data? I would be interested in knowing about your results!
P.S. Is there any specific reason as for why you are using Openkiwi 0.1.3 instead of Openkiwi >2.0 ?
Hi!
We are trying to train a EN-FR sentence level QE model by using a predictor estimator model with parallel data.
We are using OpenKiwi 0.1.3 to train it.
The procedure was as follows:
a.the English source sentences
b.the FR translated sentences using a pretrained MT model
c.the TER scores for each FR sentence translated
The results obtained were of a Pearson correlation of 0.32 and a Spearman correlation of 0.36, which are below the 0.5018 and 0.5566 obtained on the OpenKiwi paper (https://www.aclweb.org/anthology/P19-3020.pdf).
My question is: is it possible to obtain a similar result using only parallel data? If affirmative there is something wrong on our procedure?
The configuration files used to train are the following:
The text was updated successfully, but these errors were encountered: