Skip to content

Commit fd04760

Browse files
authored
Paper Index: Change num_completions to num_generations (huggingface#4515)
1 parent b7918c0 commit fd04760

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

docs/source/paper_index.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -142,7 +142,7 @@ training_args = GRPOConfig(
142142
top_p=0.99,
143143
top_k=100,
144144
temperature=0.99,
145-
num_completions=8, # = num_return_sequences in the paper
145+
num_generations=8, # = num_return_sequences in the paper
146146
num_iterations=1, # = ppo_epochs in the paper
147147
per_device_train_batch_size=4,
148148
gradient_accumulation_steps=32,
@@ -246,7 +246,7 @@ from trl import GRPOConfig
246246
config = GRPOConfig(
247247
loss_type="cispo",
248248
epsilon_high=5.0,
249-
num_completions=16,
249+
num_generations=16,
250250
scale_rewards="batch",
251251
cast_lm_head_to_fp32=True
252252
)

0 commit comments

Comments
 (0)