Skip to content

Conversation

maxreciprocate
Copy link
Collaborator

This PR

  • forces use_cache in gen_kwargs to be true by default
  • replaces batch_size to chunk_size in evaluation pipeline, mirroring make_experience pipeline usage
  • removes enforcement on padding token to be the same as endoftext with decoder models

https://wandb.ai/sorry/trlx-references/reports/fix-default-gen_kwargs-v-main--Vmlldzo0NzIwNzc2
(note that pythia run is absent because there are only 40gb gpus available currently)

Copy link
Collaborator

@PhungVanDuy PhungVanDuy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sgtm!

@maxreciprocate maxreciprocate merged commit 171357b into main Jun 23, 2023
@maxreciprocate maxreciprocate deleted the fix-default-gen_kwargs branch June 23, 2023 12:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants