This repo is constructed for collecting and categorizing papers about RAG according to our survey paper: Retrieval-Augmented Generation for AI-Generated Content: A Survey. Considering the rapid growth of this field, we will continue to update both paper and this repo.
-
Query-based RAG
REALM: Retrieval-Augmented Language Model Pre-Training
KILT: a Benchmark for Knowledge Intensive Language Tasks
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
REPLUG: Retrieval-Augmented Black-Box Language Models
In-Context Retrieval-Augmented Language Models
When Language Model Meets Private Library
Retrieval-Augmented Score Distillation for Text-to-3D Generation
-
Latent Representation-based RAG
EditSum: A Retrieve-and-Edit Framework for Source Code Summarization
A Retrieve-and-Edit Framework for Predicting Structured Outputs
DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases
-
Logit-based RAG
Syntax-Aware Retrieval Augmented Code Generation
-
Speculative RAG
-
Input Enhancement
-
Retriever Enhancement
-
Recursive Retrieve
-
Chunk Optimization
-
Finetune Retriever
C-Pack: Packaged Resources To Advance General Chinese Embedding
LM-Cocktail: Resilient Tuning of Language Models via Model Merging
Retrieve Anything To Augment Large Language Models
Replug: Retrieval-augmented black-box language models
When Language Model Meets Private Library
EditSum: {A} Retrieve-and-Edit Framework for Source Code Summarization
Synchromesh: Reliable Code Generation from Pre-trained Language Models
Retrieval Augmented Convolutional Encoder-decoder Networks for Video Captioning
-
Hybrid Retrieve
RAP-Gen: Retrieval-Augmented Patch Generation with CodeT5 for Automatic Program Repair
ReACC: A Retrieval-Augmented Code Completion Framework
Retrieval-based neural source code summarization
BashExplainer: Retrieval-Augmented Bash Code Comment Generation based on Fine-tuned CodeBERT
Retrieval-Augmented Score Distillation for Text-to-3D Generation
-
Re-ranking
Re2G: Retrieve, Rerank, Generate
AceCoder: Utilizing Existing Code to Enhance Code Generation
-
Meta-data Filtering
-
-
Generator Enhancement
-
Prompt Engineering
Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models
Active Prompting with Chain-of-Thought for Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models
Lost in the Middle: How Language Models Use Long Contexts
ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model
Automatic Semantic Augmentation of Language Model Prompts (for Code Summarization)
Retrieval-Based Prompt Selection for Code-Related Few-Shot Learning
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models
-
Decoding Tuning
InferFix: End-to-End Program Repair with LLMs
Synchromesh: Reliable Code Generation from Pre-trained Language Models
-
Finetune Generator
Improving Language Models by Retrieving from Trillions of Tokens
When Language Model Meets Private Library
Concept-Aware Video Captioning: Describing Videos With Effective Prior Information
Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Retrieval-Augmented Score Distillation for Text-to-3D Generation
-
-
Result Enhancement
-
RAG Pipeline Enhancement
-
Adaptive Retrieval
-
Rule-Baesd
Active retrieval augmented generation
-
Model-Based
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation
Self-Knowledge Guided Retrieval Augmentation for Large Language Models
-
-
Iterative RAG
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation
Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
-
-
Qusetion Answering
Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering
REALM: Retrieval-Augmented Language Model Pre-Training
Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training
Atlas: Few-shot Learning with Retrieval Augmented Language Models
Improving Language Models by Retrieving from Trillions of Tokens
Self-Knowledge Guided Retrieval Augmentation for Large Language Models
Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering
Think-on-Graph: Deep and Responsible Reasoning of Large Language Model with Knowledge Graph
Nonparametric Masked Language Modeling
CL-ReLKT: Cross-lingual Language Knowledge Transfer for Multilingual Retrieval Question Answering
One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval
Entities as Experts: Sparse Memory Access with Entity Supervision
When to Read Documents or QA History: On Unified and Selective Open-domain QA
-
Fact verification
CONCRETE: Improving Cross-lingual Fact-checking with Cross-lingual Retrieval
-
Commonsense Reasoning
KG-BART: Knowledge Graph-Augmented {BART} for Generative Commonsense Reasoning
-
Human-Machine Conversation
Grounded Conversation Generation as Guided Traverses in Commonsense Knowledge Graphs
Skeleton-to-Response: Dialogue Generation Guided by Retrieval Memory
Internet-Augmented Dialogue Generation
BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage
A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue Systems
-
Neural Machine Translation
Neural Machine Translation with Monolingual Translation Memory
-
Event Extraction
Retrieval-Augmented Generative Question Answering for Event Argument Extraction
-
Summarization
Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training
Unlimiformer: Long-Range Transformers with Unlimited Length Input
-
Code Generation
Retrieval Augmented Code Generation and Summarization
When Language Model Meets Private Library
DocPrompting: Generating Code by Retrieving the Docs
CodeT5+: Open Code Large Language Models for Code Understanding and Generation
AceCoder: Utilizing Existing Code to Enhance Code Generation
The impact of lexical and grammatical processing on generating code from natural language
-
Code Summary
Retrieval-based neural source code summarization
Retrieve and Refine: Exemplar-based Neural Comment Generation
RACE: Retrieval-augmented Commit Message Generation
BashExplainer: Retrieval-Augmented Bash Code Comment Generation based on Fine-tuned CodeBERT
-
Code Completion
ReACC: A Retrieval-Augmented Code Completion Framework
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation
CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context
-
Automatic Program Repair
Repair Is Nearly Generation: Multilingual Program Repair with LLMs
Retrieval-Based Prompt Selection for Code-Related Few-Shot Learning
-
Text-to-SQL and Code-based Semantic Parsing
Synchromesh: Reliable Code Generation from Pre-trained Language Models
Leveraging Code to Improve In-context Learning for Semantic Parsing
-
Others
Leveraging training data in few-shot prompting for numerical reasoning
-
Audio Generation
-
Audio Captioning
-
Image Generation
Retrievegan: Image synthesis via differentiable patch retrieval
Memory-driven text-to-image generation
RE-IMAGEN: RETRIEVAL-AUGMENTED TEXT-TO-IMAGE GENERATOR
KNN-Diffusion: Image Generation via Large-Scale Retrieval
Retrieval-Augmented Diffusion Models
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models
X&Fuse: Fusing Visual Information in Text-to-Image Generation
-
Image Captioning
Memory-augmented image captioning
Retrieval-Augmented Transformer for Image Captioning
Retrieval-augmented image captioning
SmallCap: Lightweight Image Captioning Prompted With Retrieval Augmentation
Cross-Modal Retrieval and Semantic Refinement for Remote Sensing Image Captioning
-
Video Captioning
Retrieval Augmented Convolutional Encoder-decoder Networks for Video Captioning
Concept-Aware Video Captioning: Describing Videos With Effective Prior Information
-
Video Generation
Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Frozen in Time: {A} Joint Video and Image Encoder for End-to-End Retrieval
-
Text-to-3D
ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model
AMD: Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion
Retrieval-Augmented Score Distillation for Text-to-3D Generation
-
Knowledge Base Question Answering
ReTraCk: A Flexible and Efficient Framework for Knowledge Base Question Answering
Case-based Reasoning for Natural Language Queries over Knowledge Bases
Logical Form Generation via Multi-task Learning for Complex Question Answering over Knowledge Bases
-
Knowledge Graph Completion
Retrieval-Enhanced Generative Model for Large-Scale Knowledge Graph Completion
-
Drug Discovery
Retrieval-based controllable molecule generation
Prompt-based 3d molecular diffusion models for structure-based drug design
-
Medical Applications
Benchmarking Large Language Models in Retrieval-Augmented Generation
ARES: An Automated Evaluation Framework for Retrieval-AugmentedGeneration Systems
RAGAS: Automated Evaluation of Retrieval Augmented Generation
if you find this work useful, please cite our paper:
@misc{zhao2024retrievalaugmented,
title={Retrieval-Augmented Generation for AI-Generated Content: A Survey},
author={Penghao Zhao and Hailin Zhang and Qinhan Yu and Zhengren Wang and Yunteng Geng and Fangcheng Fu and Ling Yang and Wentao Zhang and Bin Cui},
year={2024},
eprint={2402.19473},
archivePrefix={arXiv},
primaryClass={cs.CV}
}