EPFLLaMA: A Lightweight LLM Finetuned on EPFL Curriculum

Project Overview

EPFLLaMA is a project that enhances the TinyLlama model through Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO), utilizing datasets from student annotations and Stack Exchange. The project aims to create an AI tutor specialized in EPFL course content, with a particular focus on Multiple-Choice Question Answering (MCQA) tasks.

Key Features

Specialized for EPFL curriculum content
Enhanced performance on MCQA tasks
Utilizes advanced techniques: SFT, DPO, and Chain-of-Thought prompting
Incorporates quantization for reduced memory footprint

Project Structure

EPFLLAMA
│
├── README.md
│
├── data
│   ├── all_datasets
│   │   ├── MCQA_DPO.jsonl
│   │   ├── MCQA_unique_data.jsonl
│   │   ├── merged_DPO_test.jsonl
│   │   ├── merged_DPO_train.jsonl
│   │   └── sft_2000.jsonl
│   │
│   ├── annotated
│   │   ├── 383057.json
│   │   └── 384928.json
│   │
│   └── annotation_scripts
│       ├── DPO_Annotation.py
│       └── MCQA_Annotation.py
│
├── model
│   ├── dataset_example
│   │   ├── dpo_preference_example.jsonl
│   │   ├── mcqa_example.jsonl
│   │   └── MCQA_sft_test.jsonl
│   │
│   ├── models
│   │   ├── model_base.py
│   │   └── model_dpo.py
│   │
│   ├── Create_Loss_Plots.ipynb
│   ├── data_processing.ipynb
│   ├── Evaluate.ipynb
│   ├── evaluator.py
│   ├── main_config.yaml
│   ├── requirements.txt
│   ├── Training.ipynb
│   └── utils.py
│
├── pdfs
│   ├── litterature_reviews
│   │   ├── 383057.pdf
│   │   └── 384928.pdf
│   │
│   ├── progress_report
│   │   ├── ab-eh-me.pdf
│   │
│   ├── project_proposal
│   │   ├── ab_eh_me.pdf
│   │
│   └── project_report
│       ├── ab-eh-me.pdf

Data Collection and Preparation

The project utilizes various data sources:

Student-annotated data from EPFL curricula
Stack Exchange datasets (Data Science, Computer Science, Physics, Mathematics)
GPT-generated preference pairs

Data collection scripts can be found in model/models/data_processing.ipynb and annotation scripts in data/annotation_scripts.

Model Architecture

EPFLLaMA is based on the TinyLlama architecture, a compact and efficient language model with 1.1 billion parameters. It incorporates:

22 layers with 32 attention heads each
Grouped-query attention mechanism
RoPE (Rotary Positional Embedding)
SwiGLU activation function

Training Process

The training process involves two main phases:

Supervised Fine-Tuning (SFT): Using the SFTTrainer from the trl library.
Direct Preference Optimization (DPO): Implementing the DPO loss function to align the model with human preferences.

Additionally, the project explores:

Parameter-Efficient Fine-Tuning (PEFT)
Low-Rank Adaptation (LoRA)
Quantization techniques

Model Improvements

The project implements one main improvement:

Quantization: Reduces the model size while maintaining performance, using techniques like LLM.int8().

Results and Evaluation

The EPFLLaMA model demonstrates:

Improved performance on MCQA tasks compared to baselines
Robust performance across various technical subjects
Effective adaptation for educational purposes

Detailed results and analysis can be found in the project report (pdfs/project_report/ab-eh-me.pdf).

Getting Started

To use or contribute to this project:

Clone the repository
Install dependencies: pip install -r model/models/requirements.txt
Explore the Jupyter notebooks in model/models/ for training and evaluation

To use our pre-trained models:

from transformers import AutoModelForCausalLM, AutoTokenizer

# Load the base model
model = AutoModelForCausalLM.from_pretrained("Ali7538/EPFLLaMA")
tokenizer = AutoTokenizer.from_pretrained("Ali7538/EPFLLaMA")

# For MCQA tasks
mcqa_model = AutoModelForCausalLM.from_pretrained("Ali7538/EPFLLaMA_MCQA")
mcqa_tokenizer = AutoTokenizer.from_pretrained("Ali7538/EPFLLaMA_MCQA")

# For faster inference with the quantized model
quantized_model = AutoModelForCausalLM.from_pretrained("Ali7538/EPFLLaMA_MCQA_Quantized")
quantized_tokenizer = AutoTokenizer.from_pretrained("Ali7538/EPFLLaMA_MCQA_Quantized")

Contributors

Elias Hörnberg
Ali Bakly
Othmane Sqalli Houssaini

Acknowledgments

This project was developed as part of the CS-552 course at EPFL. Special thanks to the course staff and the NLP lab for providing resources and guidance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EPFLLaMA: A Lightweight LLM Finetuned on EPFL Curriculum

Project Overview

Key Features

Project Structure

Data Collection and Preparation

Model Architecture

Training Process

Model Improvements

Results and Evaluation

Getting Started

Contributors

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
model		model
pdfs		pdfs
.DS_Store		.DS_Store
README.md		README.md

eliashornberg/EPFLLaMA

Folders and files

Latest commit

History

Repository files navigation

EPFLLaMA: A Lightweight LLM Finetuned on EPFL Curriculum

Project Overview

Key Features

Project Structure

Data Collection and Preparation

Model Architecture

Training Process

Model Improvements

Results and Evaluation

Getting Started

Contributors

Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages