Skip to content

abzb1/f-clip

Repository files navigation

F-CLIPScore

Finegrained CLIPScore to detect object hallucination

This repository contains the source code for the following research paper: Vision-Encoders (Already) Know What They See: Mitigating Object Hallucination via Simple Fine-Grained CLIPScore

Environment setup

pip install -r requirements.txt
python -m spacy download en_core_web_sm

Evaluate

For vanila CLIPScore

python eval_ohd-caps.py --score_type clip

For F-CLIPScore

python eval_ohd-caps.py --score_type fclipam

cf. Further constrained version of F-CLIPScore is implemented in scorers.py

Result

image

Citation

@misc{oh2025visionencodersalreadyknowsee,
      title={Vision-Encoders (Already) Know What They See: Mitigating Object Hallucination via Simple Fine-Grained CLIPScore}, 
      author={Hongseok Oh and Wonseok Hwang},
      year={2025},
      eprint={2502.20034},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2502.20034}, 
}

About

finegrained clip score to detect hallucination

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages