This repo attempts to reproduce the results presented in Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT, regarding a zero-shot text classification on MLdoc dataset.
More specifically, the scores for zero-shot cross-lingual transfer included in the original work are the following:
Language | Score |
---|---|
en | 94.2 |
de | 80.2 |
es | 72.6 |
fr | 72.6 |
it | 68.9 |
ja | 56.6 |
ru | 73.7 |
Average | 74.5 |
In contrast, the scores that we managed to reproduce are the following:
Language | Score |
---|---|
en | 96.5 |
de | 79.1 |
es | 73.4 |
fr | 78.0 |
it | 65.7 |
ja | 71.4 |
ru | 62.8 |
Average | 75.2 |