Skip to content
/ DeGF Public

[ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models

Notifications You must be signed in to change notification settings

zhangce01/DeGF

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

[ICLR 2025] DeGF

Website arXiv Conference License: MIT

👀Introduction

This repository contains the code for our ICLR 2025 paper Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models.

💡Environment

We test our codebase with PyTorch 2.0.1. Please install corresponding PyTorch and CUDA versions according to your computational resources.

conda create -n DeGF python=3.10
conda activate DeGF
git clone https://github.com/zhangce01/DeGF.git
cd DeGF
pip install -r requirements.txt

Please also download the model checkpoints:

As for the datasets and benchmarks:

📦Usage

We provide the code for evaluating our DeGF on POPE, CHAIR, and MME-Hallucination benchmark. You can simply run the following code to run the experiments:

  • POPE: bash eval_bench/scripts/pope_eval.sh
  • CHAIR:bash eval_bench/scripts/chair_eval.sh
  • MME:bash experiments/cd_scripts/mme_eval.sh

🙏Acknowledgements

Our codebase is adapted from RITUAL, VCD, OPERA, and LLaVA. We thank the authors for releasing their code!

📧Contact

If you have any questions, please contact at [email protected].

📌 BibTeX & Citation

If you find this code useful, please consider citing our work:

@inproceedings{zhang2025selfcorrecting,
  title={Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models},
  author={Ce Zhang and Zifu Wan and Zhehan Kan and Martin Q. Ma and Simon Stepputtis and Deva Ramanan and Russ Salakhutdinov and Louis-Philippe Morency and Katia P. Sycara and Yaqi Xie},
  booktitle={The Thirteenth International Conference on Learning Representations},
  year={2025},
  url={https://openreview.net/forum?id=tTBXePRKSx}
}

About

[ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models

Resources

Stars

Watchers

Forks

Languages