Generative AI Literature

LLaMA: Open and Efficient Foundation Language Models (Feb 2023) Link PDF
Stanford Alpaca: An Instruction-following LLaMA Model (March 2023) Link Github
Llama 2: Open Foundation and Fine-Tuned Chat Models (Jul 2023) Link PDF
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only (Jun 2023) Link PDF
Mistral 7B (Oct 2023) Link PDF

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks (Apr 2021) Link PDF
Fine-tune the Entire RAG Architecture (including DPR retriever) for Question-Answering (Jun 2021) Link PDF
Benchmarking Large Language Models in Retrieval-Augmented Generation (Sep 2023) Link PDF

Fine-tuning language models to find agreement among humans with diverse preferences (Nov 2022) Link PDF
Instruction Tuning for Large Language Models: A Survey (Oct 2023) Link PDF
Self-Instruct: Aligning Language Models with Self-Generated Instructions (May 2023) Link PDF
Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models (Feb 2023) Link PDF
On the Exploitability of Instruction Tuning (Jun 2023) Link PDF
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention (Jun 2023) Link PDF

Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning (Aug 2022) Link PDF
The Power of Scale for Parameter-Efficient Prompt Tuning (Sep 2021) Link PDF
LoRA: Low-Rank Adaptation of Large Language Models (Oct 2021) Link PDF
QLoRA: Efficient Finetuning of Quantized LLMs (May 2023) Link PDF
Parameter-Efficient Fine-Tuning without Introducing New Latency (May 2023) Link PDF

Training language models to follow instructions with human feedback (Mar 2022) Link PDF
Secrets of RLHF in Large Language Models Part I: PPO (Jul 2023) Link PDF
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback (Sep 2023) Link PDF
Constitutional AI: Harmlessness from AI Feedback (Dec 2022) Link PDF
Fine-tuning Language Models with Generative Adversarial Feedback (Oct 2023) Link PDF

Provide feedback