Long Context vs. RAG for LLMs: An Evaluation and Revisits

Extending context windows (i.e., Long Context, LC) and using retrievers to selectively access relevant information (i.e., RetrievalAugmentedGeneration, RAG)arethetwomain strategies to enable LLMs to incorporate extremely long external contexts. This paper revisits recent studies on this topic, highlighting their key insights and discrepancies. We then provide a more comprehensive evaluation by filtering out questions answerable without external context, identifying the most effective retrieval methods, and expanding the datasets. We show that LC generally outperforms RAG in question-answering benchmarks, especially for Wikipedia-based questions. Summarization-based retrieval performs comparably to LC, while chunk-based retrieval lags behind. However, RAG has advantages in dialogue-based and general question queries. These insights underscore the trade-offs between RAG and LC strategies, offering guidance for future optimization of LLMs with external knowledge sources. We also provide an in-depth discussion on this topic, highlighting the overlooked importance of context relevance in existing studies.

Paper Link

Download Paper

Preparation

Environment

To run our code, please install all the dependency packages by using the following command:

pip install -r requirements.txt

Dataset

Download Dataset from the link： Download Dataset

Please make sure your data folder structure as below.

Datasets
  ├── full_set
  │   ├── 2wikimultihop.jsonl
  │   ├── coursera.jsonl
  │   └── (more datasets ... )
  │
  ├── full_set_filtered
  │   ├── 2wikimultihop.jsonl
  │   ├── coursera.jsonl
  │   └── (more datasets ... )
  │  
  ├── sample_set
  │   ├── 2wikimultihop.jsonl
  │   ├── coursera.jsonl
  │   └── (more datasets ... )
  │   
  ├── sample_set_filtered
  │   ├── 2wikimultihop.jsonl
  │   ├── coursera.jsonl
  │   └── (more datasets ... )

Citation

Please cite our paper if you find it helpful to your work:

@misc{li2024longcontextvsrag,
      title={Long Context vs. RAG for LLMs: An Evaluation and Revisits}, 
      author={Xinze Li and Yixin Cao and Yubo Ma and Aixin Sun},
      year={2024},
      eprint={2501.01880},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2501.01880}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Data_Processing		Data_Processing
Eval		Eval
LC		LC
RAG		RAG
README.md		README.md
requirements.txt		requirements.txt
run_lc_scripts.sh		run_lc_scripts.sh
run_rag_scripts.sh		run_rag_scripts.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Long Context vs. RAG for LLMs: An Evaluation and Revisits

Paper Link

Preparation

Environment

Dataset

Citation

About

Uh oh!

Releases

Packages

Languages

lixinze777/LC_VS_RAG

Folders and files

Latest commit

History

Repository files navigation

Long Context vs. RAG for LLMs: An Evaluation and Revisits

Paper Link

Preparation

Environment

Dataset

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages