RTR: Route to Reason

This repository contains the codebase for the paper Route to Reason: Adaptive Routing for LLM and Reasoning Strategy Selection.

An illustration of the RTR pipeline. We represent candidate LLMs and reasoning strategies with embeddings. For each query, RTR predicts the performance and output tokens of all combinations of LLM-strategy pairs, then routes to the optimal one considering both performance and cost.

🔧 Setup

Clone this repository:
```
git clone <repo_url>
cd rtr_test
```

Create and activate the conda environment:

conda create -n rtr python=3.10
conda activate rtr
pip install -r requirements.txt

📊 Data

We use open-source math evaluation frameworks Qwen2.5-math and vLLM to generate our dataset.

🚀 Run Instructions

1. Generate Embeddings

Generate embeddings for models, strategies, and questions:

cd data
python ./data/gen_model_strategy_emb.py
python ./data/gen_question_emb.py

encoder_model_path: Path to your sentence-transformer model used to encode questions, models, and strategies.

2. Training and Prediction

Train the model and run prediction:

python ./src/main.py

Arguments:

--epochs: Number of training epochs (default: 100)
--batch_size: Training batch size (default: 32)
--lr: Learning rate (default: 1e-3)
--cls_hidden_dim: Hidden layer dimension for the classifier (default: 768)
--reg_hidden_dim: Hidden layer dimension for the regressor (default: 768)
--patience: Patience for early stopping (default: 5)
--encoder_model_path: Path to the sentence encoder model
--predict: Whether to run prediction after training (default: False)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
data		data
src		src
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RTR: Route to Reason

🔧 Setup

📊 Data

🚀 Run Instructions

1. Generate Embeddings

2. Training and Prediction

Arguments:

About

Uh oh!

Releases

Packages

Languages

goodmanpzh/Route-To-Reason

Folders and files

Latest commit

History

Repository files navigation

RTR: Route to Reason

🔧 Setup

📊 Data

🚀 Run Instructions

1. Generate Embeddings

2. Training and Prediction

Arguments:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages