Awesome Information Retrieval in the Age of Large Language Model

A curated list of awesome papers about information retrieval (IR) in the age of large language model (LLM). These include retrieval augmented large language model, large language model for information retrieval, and so on. If I missed any papers, feel free to open a PR to include them! And any feedback and contributions are welcome!

This list is currently maintained by Yinqiong Cai, Yu-An Liu, Shiyu Nee, and Hengran Zhang at CAS Key Lab of Network Data Science and Technology, ICT, CAS.

We thank all the great contributors very much.

Survey

Retrieval Augmented LLM

Retrieval-based Language Models and Applications Akari Asai et.al. ACL 2023. (Tutorial)
Augmented Language Models: a Survey Grégoire Mialon et.al. Arxiv 2023.
Information Retrieval Meets Large Language Models: A Strategic Report from Chinese IR Community Qingyao Ai et.al. Arxiv 2023.
Retrieval-Augmented Generation for Large Language Models: A Survey. Yunfan Gao et.al. Arxiv 2023.
A Survey on RAG Meets LLMs: Towards Retrieval-Augmented Large Language Models. Yujuan Ding et.al. Arxiv 2024.

LLM for IR

Large Language Models for Information Retrieval: A Survey Yutao Zhu et.al. Arxiv 2023.

Perspective Papers

Perspectives on Large Language Models for Relevance Judgment Guglielmo Faggioli et.al. ICTIR 2023. (Best paper)
Information Retrieval Meets Large Language Models. Zheng Liu et.al. WWW 2024.

Retrieval Augmented LLM

LLM for IR

Generating Synthetic Queries

InPars: Data augmentation for information retrieval using large language models. Luiz Bonifacio et.al. SIGIR 2022.
UPR: Improving passage retrieval with zero-shot question generation. Devendra Singh Sachan et.al. EMNLP 2022.
Promptagator: Fewshot dense retrieval from 8 examples. Zhuyun Dai et.al. ICLR 2023.

Generating Synthetic Documents

Precise Zero-Shot Dense Retrieval without Relevance Labels. Luyu Gao et.al. Arxiv 2022.
Generating Synthetic Documents for Cross-Encoder Re-Rankers: A Comparative Study of ChatGPT and Human Experts. Arian Askari et.al. Arxiv 2023.

Generating Ranking Lists

Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agent. Weiwei Sun et.al. Arxiv 2023.
Zero-Shot Listwise Document Reranking with a Large Language Model. Xueguang Ma et.al. Arxiv 2023.
Are Large Language Models Good at Utility Judgments? Hengran Zhang et.al. SIGIR 2024.
RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs Yue Yu et.al. Arxiv 2024.
Iterative Utility Judgment Framework via LLMs Inspired by Relevance in Philosophy Hengran Zhang et.al. Arxiv 2024.

Query Understanding

Query Understanding in the Age of Large Language Models. Avishek Anand et.al. Gen-IR 2023.

Query Extension

Generative relevance feedback with large language models. Iain Mackie et.al. Arxiv 2023.
Query2doc: Query expansion with large language models. Liang Wang et.al. Arxiv 2023.
Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy Zhihong Shao et al. findings of EMNLP 2023

Generate rather than Retrieve

Generate rather than retrieve: Large language models are strong context generators. Wenhao Yu et.al. ICLR 2023.

Bias for IR

LLMs may Dominate Information Access: Neural Retrievers are Biased Towards LLM-Generated Texts. Sunhao Dai et.al. Arxiv 2023.

Benchmark and Evaluation

KILT: a benchmark for knowledge intensive language tasks. Fabio Petroni et.al. NAACL 2021.

Toolkits

RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit. Jiongnan Liu et.al. Arxiv 2023. RETA-LLM

Note

Shiyu is currently focusing on the timing for triggering RAG (when to retrieve) based on the responses from the LLMs. Regarding the papers, you can visit https://github.com/ShiyuNee/Awesome-Calibration-Papers

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
imgs		imgs
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome Information Retrieval in the Age of Large Language Model

Contents

Survey

Retrieval Augmented LLM

LLM for IR

Perspective Papers

Retrieval Augmented LLM

For Pre-training LLM

For Fine-tuning LLM

For Inference of LLM

Joint Optimization of IR and LLM

LLM for IR

Generating Synthetic Queries

Generating Synthetic Documents

Generating Ranking Lists

Query Understanding

Query Extension

Generate rather than Retrieve

Bias for IR

Benchmark and Evaluation

Toolkits

Note

About

Releases

Packages

Contributors 4

IR-LLM/Awesome-Information-Retrieval-in-the-Age-of-Large-Language-Model

Folders and files

Latest commit

History

Repository files navigation

Awesome Information Retrieval in the Age of Large Language Model

Contents

Survey

Retrieval Augmented LLM

LLM for IR

Perspective Papers

Retrieval Augmented LLM

For Pre-training LLM

For Fine-tuning LLM

For Inference of LLM

Joint Optimization of IR and LLM

LLM for IR

Generating Synthetic Queries

Generating Synthetic Documents

Generating Ranking Lists

Query Understanding

Query Extension

Generate rather than Retrieve

Bias for IR

Benchmark and Evaluation

Toolkits

Note

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Packages