Skip to content

Commit a02f806

Browse files
authored
Update link to "Learning to Summarize from Human Feedback" (#241)
1 parent dcbf7b0 commit a02f806

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/summarize_rlhf/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
## Learning to summarize from Human Feedback using `trlx`
22

33
This example shows how to use `trlx` to train a summarization model using human feedback
4-
following the fine-tuning procedures described in Stiennon et al.'s, "[Learning to Summarize from human feedback](https://arxiv.org/abs/2106.00987)".
4+
following the fine-tuning procedures described in Stiennon et al.'s, "[Learning to Summarize from human feedback](https://arxiv.org/abs/2009.01325)".
55

66

77
Before running everything, we need some extra packages not included in the `trlx` dependency list. Specifically, we need HuggingFace's [`evaluate`](https://huggingface.co/docs/evaluate/index) package and Google's re-implementation of ROUGE, [`rouge-score`](https://github.com/google-research/google-research/tree/master/rouge). To install them, run `requirements.txt` in this example's root directory:

0 commit comments

Comments
 (0)