Skip to content

Commit a06fe9f

Browse files
authored
Update stale comment from results table (#222)
* Remove stale comment from results table * Add details
1 parent 96fa6d9 commit a06fe9f

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/summarize_rlhf/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ For an in-depth description of the example, please refer to our [blog post](http
4040

4141
### Results
4242

43-
On 1,000 samples from CNN/DailyMail test dataset:
43+
The following tables display ROUGE and reward scores on the test set of the TL;DR dataset between SFT and PPO models.
4444

4545
1. SFT vs PPO
4646

0 commit comments

Comments
 (0)