-
Notifications
You must be signed in to change notification settings - Fork 31k
Open
Description
| losses.append(accelerator.gather_for_metrics(loss.repeat(args.per_device_eval_batch_size))) |
drop_last default to False for DataLoader construction. Thus, computing the mean over the losses is inaccurate.Same issue for the train loss:
| "train_loss": total_loss.item() / len(train_dataloader), |
Metadata
Metadata
Assignees
Labels
No labels