legalbenchrag2beir

LegalBench-RAG is a newly introduced benchmark by ZeroEntropy designed to evaluate the retrieval capabilities of Retrieval-Augmented Generation (RAG) systems in the legal domain.

However, the dataset in its original form is not structured according to standard IR benchmarks, such as those used in BEIR. This repository provides a converter (create_dataset.py) that restructures it, making it easier to integrate with existing retrieval pipelines, perform evaluations, and experiment with it.

💻 Requirements and setup

Python 3.10
Install dependencies with:

pip install -r requirements.txt

Download the LegalBench-RAG dataset at this link. Then unzip it inside ./data. You should have obtained the following structure:

./data/
├── benchmarks/
│   ├── cuad.json
│   └── ...
├── corpus/
│   ├── maud/...
│   └── ...

❓How to use

When everything is set up just run the script:

python create_dataset.py

The formatted dataset will be saved in ./data/legalbenchrag/. By default, the script uses the best settings provided in the original paper (LangChain's Recursive Character TextSplitter and 0 overlap between chunks).

📊 Final Statistics

The script prints:

Number of original documents and generated chunks
Number of queries
Min / max / average answer span lengths
QREL counts per split

You should get the following:

----- Statistics for train qrels        ------------------------------
Number of queries contained: 4133
Average number of positive documents per query: 2.764
Number of unique documents: 10128
----- Statistics for dev qrels  ------------------------------
Number of queries contained: 1377
Average number of positive documents per query: 2.674
Number of unique documents: 3550
----- Statistics for test qrels ------------------------------
Number of queries contained: 1379
Average number of positive documents per query: 2.750
Number of unique documents: 3642

Total number of queries: 6889. The sum of relevant queries across all splits is 6889 which is 100.00% of the total querie

📚 Citation

If you use this dataset, please cite the original paper and this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
benchmark_types.py		benchmark_types.py
create_dataset.py		create_dataset.py
dataset_statistics.py		dataset_statistics.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

legalbenchrag2beir

💻 Requirements and setup

❓How to use

📊 Final Statistics

📚 Citation

About

Uh oh!

Releases

Languages

License

giuliocapecchi/legalbenchrag2beir

Folders and files

Latest commit

History

Repository files navigation

legalbenchrag2beir

💻 Requirements and setup

❓How to use

📊 Final Statistics

📚 Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Languages