CEDR for MARCO document ranking #44

caiyinqiong · 2022-04-20T06:52:19Z

Hello, have you ever run CEDR_KNRM on MSMARCO document ranking task?
I encountered some problems when I trained CEDR_KNRM initialized with the fine-tuned BERT (the performance almost no longer increases or even decreases). I wonder if it's because the training settings on robust are not suitable for MARCO？

Look forward to some empirical guidance. Thank you.

seanmacavaney · 2022-04-20T08:04:16Z

I don't recall trying it, but in PARADE we identified some weirdness about the document ranking task that may explain what you're seeing. The dataset has a strong bias towards a "maximum passage", which means that more sophisticated aggregation techniques (perhaps like the KNRM aggregator employed by CEDR-KRNM) are less effective than simply taking a maximum passage score over the document. See Section 4.6 and Table 4 of the paper.

Hope this helps!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CEDR for MARCO document ranking #44

CEDR for MARCO document ranking #44

caiyinqiong commented Apr 20, 2022 •

edited

Loading

seanmacavaney commented Apr 20, 2022

CEDR for MARCO document ranking #44

CEDR for MARCO document ranking #44

Comments

caiyinqiong commented Apr 20, 2022 • edited Loading

seanmacavaney commented Apr 20, 2022

caiyinqiong commented Apr 20, 2022 •

edited

Loading