Skip to content

Conversation

akoumpa
Copy link
Contributor

@akoumpa akoumpa commented Sep 10, 2025

  • Padding is handled inside collator, with the pad_seq_len_divisible option
  • We use the max_seq_length option to enforce the max tokenized sequence length the model sees. If an example has larger context, then it's truncated (to a space). If the truncation removes all the context, it raises an exception and will retry up to 64 times. If fetching an item fails, the next one is selected at random.

Signed-off-by: Alexandros Koumparoulis <[email protected]>
Copy link

copy-pr-bot bot commented Sep 10, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@akoumpa
Copy link
Contributor Author

akoumpa commented Sep 10, 2025

/ok to test db74d1c

Signed-off-by: Alexandros Koumparoulis <[email protected]>
@akoumpa
Copy link
Contributor Author

akoumpa commented Sep 10, 2025

/ok to test bf62a61

Signed-off-by: Alexandros Koumparoulis <[email protected]>
Signed-off-by: Alexandros Koumparoulis <[email protected]>
@akoumpa
Copy link
Contributor Author

akoumpa commented Sep 10, 2025

/ok to test a315726

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant