Awesome papers about generative Information extraction using LLMs
The organization of papers is discussed in our survey: Large Language Models for Generative Information Extraction: A Survey.
If you have any suggestions or come across any mistakes, missing information, please feel free to let us know via email at [email protected]. We appreciate your feedback and help in improving our work.
If you find our survey useful for your research, please cite the following paper:
@misc{xu2023large,
title={Large Language Models for Generative Information Extraction: A Survey},
author={Derong Xu and Wei Chen and Wenjun Peng and Chao Zhang and Tong Xu and Xiangyu Zhao and Xian Wu and Yefeng Zheng and Enhong Chen},
year={2023},
eprint={2312.17617},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
A taxonomy by various tasks.
Models targeting only ner tasks.
Paper | Venue | Date | Code |
---|---|---|---|
Calibrated Seq2seq Models for Efficient and Generalizable Ultra-fine Entity Typing | EMNLP Findings | 2023-12 | Github |
Generative Entity Typing with Curriculum Learning | EMNLP | 2022-12 | Github |
Models targeting only RE tasks.
Paper | Venue | Date | Code |
---|---|---|---|
Document-level Entity-based Extraction as Template Generation | EMNLP | 2021-11 | Github |
Paper | Venue | Date | Code |
---|---|---|---|
REBEL: Relation Extraction By End-to-end Language generation | EMNLP Findings | 2021-11 | Github |
Models targeting only EE tasks.
Paper | Venue | Date | Code |
---|---|---|---|
Unleash GPT-2 Power for Event Detection | ACL | 2021-08 |
Paper | Venue | Date | Code |
---|---|---|---|
Context-Aware Prompt for Generation-based Event Argument Extraction with Diffusion Models | CIKM | 2023-10 | |
Code4Struct: Code Generation for Few-Shot Event Structure Prediction | ACL | 2023-07 | Github |
Event Extraction as Question Generation and Answering | ACL short | 2023-07 | Github |
Prompt for extraction? PAIE: prompting argument interaction for event argument extraction | ACL | 2022-05 | Github |
Unified models targeting multiple IE tasks.
Paper | Venue | Date | Code |
---|---|---|---|
Set Learning for Generative Information Extraction | EMNLP | 2023-12 | |
GIELLM: Japanese General Information Extraction Large Language Model Utilizing Mutual Reinforcement Effect | Arxiv | 2023-11 | |
InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction | Arxiv | 2023-04 | Github |
Zero-Shot Information Extraction via Chatting with ChatGPT | Arxiv | 2023-02 | Github |
GenIE: Generative Information Extraction | NAACL | 2022-07 | Github |
DEEPSTRUCT: Pretraining of Language Models for Structure Prediction | ACL Findings | 2022-05 | Github |
Unified Structure Generation for Universal Information Extraction | ACL | 2022-05 | Github |
Structured prediction as translation between augmented natural languages | ICLR | 2021-01 | Github |
Paper | Venue | Date | Code |
---|---|---|---|
GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction | Arxiv | 2023-12 | Github |
Retrieval-Augmented Code Generation for Universal Information Extraction | Arxiv | 2023-11 | |
CODEIE: Large Code Generation Models are Better Few-Shot Information Extractors | ACL | 2023-07 | Github |
CodeKGC: Code Language Model for Generative Knowledge Graph Construction | Arxiv | 2023-04 | Github |
A taxonomy by Learning Paradigms.
Paper | Venue | Date | Code |
---|---|---|---|
DemoSG: Demonstration-enhanced Schema-guided Generation for Low-resource Event Extraction | EMNLP Findings | 2023-12 | Github |
One Model for All Domains: Collaborative Domain-Prefx Tuning for Cross-Domain NER | IJCAI | 2023-09 | Github |
LightNER: A Lightweight Tuning Paradigm for Low-resource NER via Pluggable Prompting | COLING | 2022-10 | Github |
Unified Structure Generation for Universal Information Extraction | ACL | 2022-05 | Github |
InstructionNER: A Multi-Task Instruction-Based Generative Framework for Few-shot NER | Arxiv | 2022-03 | |
Template-Based Named Entity Recognition Using BART | ACL Findings | 2021-08 | Github |
Structured prediction as translation between augmented natural languages | ICLR | 2021-01 | Github |
Paper | Venue | Date | Code |
---|---|---|---|
Document-level event argument extraction by conditional generation | NAACL | 2021-06 | Github |
Paper | Venue | Date | Code |
---|---|---|---|
LLMaAA: Making Large Language Models as Active Annotators | EMNLP Findings | 2023-12 | Github |
Improving Unsupervised Relation Extraction by Augmenting Diverse Sentence Pairs | EMNLP | 2023-12 | Github |
Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models | Arxiv | 2023-11 | Github |
How to Unleash the Power of Large Language Models for Few-shot Relation Extraction? | ACL Workshop | 2023-07 | Github |
Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction | bioNLP Workshop | 2023-07 | Github |
Does Synthetic Data Generation of LLMs Help Clinical Text Mining? | Arxiv | 2023-04 | |
Unleash GPT-2 Power for Event Detection | ACL | 2021-08 |
Paper | Venue | Date | Code |
---|---|---|---|
Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset | EMNLP | 2023-12 | Github |
Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge |
EMNLP Findings | 2023-12 | Github |
Chain-of-Thought Prompt Distillation for Multimodal Named Entity Recognition and Multimodal Relation Extraction |
Arxiv | 2023-08 |
Paper | Venue | Date | Code |
---|---|---|---|
Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction | EMNLP | 2023-12 | Github |
Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks | ACL Findings | 2023-07 | |
Event Extraction as Question Generation and Answering | ACL Short | 2023-07 | Github |
STAR: Improving Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models | Arxiv | 2023-05 |
* denotes the dataset is multimodal. # refers to the number of categories or sentences.
The link will be updated soon~
Task | Dataset | Domain | #Class | #Train | #Val | #Test | Link |
---|---|---|---|---|---|---|---|
NER | ACE04 | News | 7 | 6202 | 745 | 812 | Link |
ACE05 | News | 7 | 7299 | 971 | 1060 | Link | |
BC5CDR | Biomedical | 2 | 4560 | 4581 | 4797 | Link | |
Broad Twitter Corpus | Social Media | 3 | 6338 | 1001 | 2000 | Link | |
CADEC | Biomedical | 1 | 5340 | 1097 | 1160 | Link | |
CoNLL03 | News | 4 | 14041 | 3250 | 3453 | Link | |
CoNLLpp | News | 4 | 14041 | 3250 | 3453 | Link | |
CrossNER-AI | Artificial Intelligence | 14 | 100 | 350 | 431 | Link | |
CrossNER-Literature | Literary | 12 | 100 | 400 | 416 | ||
CrossNER-Music | Musical | 13 | 100 | 380 | 465 | ||
CrossNER-Politics | Political | 9 | 199 | 540 | 650 | ||
CrossNER-Science | Scientific | 17 | 200 | 450 | 543 | ||
FabNER | Scientific | 12 | 9435 | 2182 | 2064 | Link | |
Few-NERD | General | 66 | 131767 | 18824 | 37468 | Link | |
FindVehicle | Traffic | 21 | 21565 | 20777 | 20777 | Link | |
GENIA | Biomedical | 5 | 15023 | 1669 | 1854 | Link | |
HarveyNER | Social Media | 4 | 3967 | 1301 | 1303 | Link | |
MIT-Movie | Social Media | 12 | 9774 | 2442 | 2442 | Link | |
MIT-Restaurant | Social Media | 8 | 7659 | 1520 | 1520 | Link | |
MultiNERD | Wikipedia | 16 | 134144 | 10000 | 10000 | Link | |
NCBI | Biomedical | 4 | 5432 | 923 | 940 | Link | |
OntoNotes 5.0 | General | 18 | 59924 | 8528 | 8262 | Link | |
ShARe13 | Biomedical | 1 | 8508 | 12050 | 9009 | Link | |
ShARe14 | Biomedical | 1 | 17404 | 1360 | 15850 | Link | |
SNAP* | Social Media | 4 | 4290 | 1432 | 1459 | Link | |
Temporal Twitter Corpus (TTC) | Social Meida | 3 | 10000 | 500 | 1500 | Link | |
Tweebank-NER | Social Media | 4 | 1639 | 710 | 1201 | Link | |
Twitter2015* | Social Media | 4 | 4000 | 1000 | 3357 | Link | |
Twitter2017* | Social Media | 4 | 3373 | 723 | 723 | Link | |
TwitterNER7 | Social Media | 7 | 7111 | 886 | 576 | Link | |
WikiDiverse* | News | 13 | 6312 | 755 | 757 | Link | |
WNUT2017 | Social Media | 6 | 3394 | 1009 | 1287 | Link | |
RE | ACE05 | News | 7 | 10051 | 2420 | 2050 | Link |
ADE | Biomedical | 1 | 3417 | 427 | 428 | Link | |
CoNLL04 | News | 5 | 922 | 231 | 288 | Link | |
DocRED | Wikipedia | 96 | 3008 | 300 | 700 | Link | |
MNRE* | Social Media | 23 | 12247 | 1624 | 1614 | Link | |
NYT | News | 24 | 56196 | 5000 | 5000 | Link | |
Re-TACRED | News | 40 | 58465 | 19584 | 13418 | Link | |
SciERC | Scientific | 7 | 1366 | 187 | 397 | Link | |
SemEval2010 | General | 19 | 6507 | 1493 | 2717 | Link | |
TACRED | News | 42 | 68124 | 22631 | 15509 | Link | |
TACREV | News | 42 | 68124 | 22631 | 15509 | Link | |
EE | ACE05 | News | 33/22 | 17172 | 923 | 832 | Link |
CASIE | Cybersecurity | 5/26 | 11189 | 1778 | 3208 | Link | |
GENIA11 | Biomedical | 9/11 | 8730 | 1091 | 1092 | Link | |
GENIA13 | Biomedical | 13/7 | 4000 | 500 | 500 | Link | |
PHEE | Biomedical | 2/16 | 2898 | 961 | 968 | Link | |
RAMS | News | 139/65 | 7329 | 924 | 871 | Link | |
WikiEvents | Wikipedia | 50/59 | 5262 | 378 | 492 | Link |