Skip to content

Conversation

trent-sp
Copy link
Contributor

@trent-sp trent-sp commented Dec 9, 2024

NLTK's implementation of BLEU is limited. In particular, only 20% of my attempts to compute BLEU return a score because the number of candidate and reference sentences are not the same, a requirement of NLTK's implementation. The implementation of BLEU by sacrebleu is recommended because it is more robust. In my PR, I have modified _bleu_score.py to use sacrebleu's implementation.

@dosubot dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Dec 9, 2024
Added sacrebleu
Updated BLEU documentation.
@shahules786 shahules786 self-requested a review December 10, 2024 10:07
Copy link
Member

@shahules786 shahules786 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you. This is great replacement for BLEU.

@shahules786 shahules786 merged commit 27c8277 into explodinggradients:main Dec 12, 2024
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size:S This PR changes 10-29 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants