xquad

Paper

Title: On the Cross-lingual Transferability of Monolingual Representations Abstract: https://arxiv.org/abs/1910.11856

XQuAD (Cross-lingual Question Answering Dataset) is a benchmark dataset for evaluating cross-lingual question answering performance. The dataset consists of a subset of 240 paragraphs and 1190 question-answer pairs from the development set of SQuAD v1.1 (Rajpurkar et al., 2016) together with their professional translations into ten languages: Spanish, German, Greek, Russian, Turkish, Arabic, Vietnamese, Thai, Chinese, and Hindi. Consequently, the dataset is entirely parallel across 11 languages.

Homepage: https://github.com/google-deepmind/xquad

Citation

@article{Artetxe:etal:2019,
      author    = {Mikel Artetxe and Sebastian Ruder and Dani Yogatama},
      title     = {On the cross-lingual transferability of monolingual representations},
      journal   = {CoRR},
      volume    = {abs/1910.11856},
      year      = {2019},
      archivePrefix = {arXiv},
      eprint    = {1910.11856}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

xquad

Paper

Citation

Files

README.md

Latest commit

History

README.md

File metadata and controls

xquad

Paper

Citation