Transform references for sacrebleu #520

jbragg · 2020-08-20T00:26:55Z

Currently it is impossible to use sacrebleu when len(predictions) != the number of references per prediction (very uncommon), due to a strange format expected by sacrebleu. If one passes in the data to nlp.metric.compute() in sacrebleu format, nlp throws an error due to mismatching lengths between predictions and references. If one uses a more standard format where predictions and references are lists of the same length, sacrebleu throws an error.

This PR transforms reference data in a more standard format into the unusual format expected by sacrebleu.

lhoestq

Thanks for reporting this !
Indeed sacrebleu expected this unusual format as input...

I think it would be better to check that all the references have the same length rather than cropping to the minimum length. If one length doesn't match the others, we should probably raise an error. What do you think ?

jbragg · 2020-08-20T09:02:09Z

I think I agree @lhoestq so I pushed a change.
Thanks for your work on the library!

thomwolf

Very cool, thanks @jbragg

lhoestq

Nice thank you !

Transform references for sacrebleu

2c5b584

lhoestq reviewed Aug 20, 2020

View reviewed changes

Raise error for varying number of sacrebleu references

5fe16a9

thomwolf approved these changes Aug 20, 2020

View reviewed changes

lhoestq approved these changes Aug 20, 2020

View reviewed changes

lhoestq merged commit f33d598 into huggingface:master Aug 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Transform references for sacrebleu #520

Transform references for sacrebleu #520

Uh oh!

jbragg commented Aug 20, 2020 •

edited

Loading

Uh oh!

lhoestq left a comment

Uh oh!

jbragg commented Aug 20, 2020

Uh oh!

thomwolf left a comment

Uh oh!

lhoestq left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Transform references for sacrebleu #520

Transform references for sacrebleu #520

Uh oh!

Conversation

jbragg commented Aug 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lhoestq left a comment

Choose a reason for hiding this comment

Uh oh!

jbragg commented Aug 20, 2020

Uh oh!

thomwolf left a comment

Choose a reason for hiding this comment

Uh oh!

lhoestq left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jbragg commented Aug 20, 2020 •

edited

Loading