ModernBert: reuse GemmaRotaryEmbedding via modular + Integration tests #35459

tomaarsen · 2024-12-30T15:10:48Z

What does this PR do?

Add integration tests with the now-released checkpoint + 2 checkpoints under hf-internal-testing (thanks @xenova !)
Reuse GemmaRotaryEmbedding via modular functionality

Note that I based the expected values from the integration tests on the main implementation, not the RotaryEmbedding refactored implementation. In short: the test passes indicate that the refactor preserves the performance.

This is the refactor that I promised earlier offline.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@ArthurZucker

cc @warner-benjamin

Tom Aarsen

HuggingFaceDocBuilderDev · 2024-12-30T15:38:27Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

warner-benjamin

LGTM

ArthurZucker

Super nice to add the tests

ArthurZucker · 2025-01-09T16:16:31Z

src/transformers/models/modernbert/modular_modernbert.py

+class ModernBertRotaryEmbedding(GemmaRotaryEmbedding):
+    def __init__(self, config: ModernBertConfig, dim: int, base: float, device: Optional[torch.device] = None):
+        super().__init__(self, config=config, device=device)
+        self.rope_kwargs = {"dim": dim, "base": base}


I think a PR #35589 might have removed the kwargs! but happy to add them back if need or just not use the gemma one!

I'll reintroduce the rope_kwargs in LlamaRotaryEmbedding.

…to modernbert/rotary_embeds

…odernbert

ArthurZucker

Actually it makes more sense to juste inherit and overwrite the call to inv_freq, self.attention_scaling = self.rope_init_fn(self.config, device, **self.rope_kwargs) rather than reverting the commit!

…odular_modernbert" This reverts commit 11b44b9.

ArthurZucker

Good to go! Thanks

#35459) * Introduce 5 integration tests for the 4 model classes + torch export * ModernBert: reuse GemmaRotaryEmbedding via modular * Revert #35589, keep rope_kwargs; rely on them in modular_modernbert * Revert "Revert #35589, keep rope_kwargs; rely on them in modular_modernbert" This reverts commit 11b44b9. * Don't set rope_kwargs; override 'self.rope_init_fn' call instead

tomaarsen added 2 commits December 30, 2024 16:06

Introduce 5 integration tests for the 4 model classes + torch export

b5b3fdc

ModernBert: reuse GemmaRotaryEmbedding via modular

0839a51

warner-benjamin approved these changes Dec 30, 2024

View reviewed changes

ArthurZucker approved these changes Jan 9, 2025

View reviewed changes

tomaarsen added 2 commits January 9, 2025 17:21

Merge branch 'main' of https://github.com/huggingface/transformers in…

385853a

…to modernbert/rotary_embeds

Revert huggingface#35589, keep rope_kwargs; rely on them in modular_m…

11b44b9

…odernbert

tomaarsen requested a review from Rocketknight1 as a code owner January 9, 2025 16:38

ArthurZucker reviewed Jan 9, 2025

View reviewed changes

tomaarsen added 2 commits January 9, 2025 20:11

Revert "Revert huggingface#35589, keep rope_kwargs; rely on them in m…

219631d

…odular_modernbert" This reverts commit 11b44b9.

Don't set rope_kwargs; override 'self.rope_init_fn' call instead

3e97bff

ArthurZucker approved these changes Jan 10, 2025

View reviewed changes

tomaarsen merged commit 6b73ee8 into huggingface:main Jan 10, 2025
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ModernBert: reuse GemmaRotaryEmbedding via modular + Integration tests #35459

ModernBert: reuse GemmaRotaryEmbedding via modular + Integration tests #35459

Uh oh!

tomaarsen commented Dec 30, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Dec 30, 2024

Uh oh!

warner-benjamin left a comment

Uh oh!

ArthurZucker left a comment

Uh oh!

ArthurZucker Jan 9, 2025

Uh oh!

tomaarsen Jan 9, 2025 •

edited

Loading

Uh oh!

tomaarsen Jan 9, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ModernBert: reuse GemmaRotaryEmbedding via modular + Integration tests #35459

ModernBert: reuse GemmaRotaryEmbedding via modular + Integration tests #35459

Uh oh!

Conversation

tomaarsen commented Dec 30, 2024

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Dec 30, 2024

Uh oh!

warner-benjamin left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Jan 9, 2025

Choose a reason for hiding this comment

Uh oh!

tomaarsen Jan 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tomaarsen Jan 9, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tomaarsen Jan 9, 2025 •

edited

Loading