Fix train_step, test_step and tests for CLIP #18684

Rocketknight1 · 2022-08-18T15:16:42Z

CLIP models were not being tested correctly with fit() because the test skipped models without a hf_compute_loss method. This skip was added to skip base models like TFBERTModel that do not have specific output heads and losses. However, it also skips models like CLIP that do not use compute_loss / hf_compute_loss methods.

The new test checks whether the model's return type dataclass has a loss key, which is a more reliable check. Enabling this reveals the bug in fit() for TFClip, so this PR also includes fixes to train_step and test_step for CLIP and models like it that require return_loss=True to be passed, but do not set it by default.

Draft for now because this will likely flush out other bugs or cause other problems!

Fixes #18670.

HuggingFaceDocBuilderDev · 2022-08-18T15:41:36Z

The documentation is not available anymore as the PR was closed or merged.

Rocketknight1 · 2022-08-18T15:45:47Z

The tests passed. I'm as surprised as everyone else. It's ready for review!

ydshieh · 2022-08-18T16:01:09Z

Thanks @Rocketknight1 for quick fix. I am wondering if it makes sense to wrap the loss computation for TFCLIP into a hf_compute_loss?

I also see for TFSegformerForSemanticSegmentation, it defines hf_compute_loss under its own model class.

If doable, maybe it's good to add TFSemanticSegmentationLoss, TFcontrastiveLoss class etc.

Just want to hear some opinions. cc @gante @amyeroberts

Rocketknight1 · 2022-08-18T16:05:05Z

@ydshieh I don't think that's necessary - the new check we use means we don't need to rely on hf_compute_loss being present anymore!

Rocketknight1 · 2022-08-18T16:16:56Z

Just realized that using return instead of continue in the tests was skipping a lot of tests, which might have been why it was green so easily. Rerunning everything!

gante

Looks good 👍 And I agree, relying on a loss output is safer!

Will re-review after fixing errors (I see the latest commit, which replaces the return, broke some tests :D)

Rocketknight1 · 2022-08-18T18:41:59Z

Further updates: Now that we're no longer incorrectly skipping tests, this turned up quite a few bugs! The main source of issues is that some more recent models are returning scalar losses, but both our test_loss_computation test and Keras fit() expect that the loss has some shape, even if that shape is (1,).

amyeroberts

Overall structure looks good to me. Leaving some comments as I was 👀 , but just nits.

tests/test_modeling_tf_common.py

amyeroberts · 2022-08-23T16:49:01Z

tests/test_modeling_tf_common.py

+            added_label_names = sorted(list(prepared_for_class.keys() - inputs_dict.keys()), reverse=True)
+            if not added_label_names:
+                continue  # This test is only for models with easily-separable labels
+            added_label = prepared_for_class[added_label_names[0]]


Can we assumed there's only one added label? Not sure if it matters, but if we can and it does, can we add an assert above?

sgugger

Left a nit, but LGTM! Thanks for working on this!

src/transformers/models/mobilebert/modeling_tf_mobilebert.py

Co-authored-by: amyeroberts <[email protected]>

Rocketknight1 requested a review from gante August 18, 2022 15:45

Rocketknight1 marked this pull request as ready for review August 18, 2022 15:47

Rocketknight1 requested review from LysandreJik and ydshieh August 18, 2022 15:47

gante approved these changes Aug 18, 2022

View reviewed changes

Rocketknight1 requested a review from sgugger August 18, 2022 18:12

Rocketknight1 force-pushed the return_loss_fix branch from 70a158e to 307f34e Compare August 19, 2022 12:01

amyeroberts reviewed Aug 23, 2022

View reviewed changes

sgugger approved these changes Aug 31, 2022

View reviewed changes

src/transformers/models/mobilebert/modeling_tf_mobilebert.py Outdated Show resolved Hide resolved

Rocketknight1 added 15 commits September 9, 2022 18:55

Fix train_step and test_step, correctly enable CLIP fit test

4a3db4b

Stop using get_args on older Python versions

2af8bba

Don't use get_origin either

1107ead

UnionType is actually even newer, don't use that either

1a331b4

Apply the same fix to test_loss_computation

0afc9fa

Just realized I was accidentally skipping a bunch of tests!

e90bfcd

Fix test_loss_computation for models without separable labels

bb9f3e6

Fix scalar losses in test_step and train_step

48e2889

Stop committing your breakpoints

ca0dce9

Fix Swin loss shape

2192f81

Fix Tapas loss shape

046560e

Shape fixes for TAPAS, DeIT, HuBERT and ViTMAE

3459ac0

Add loss computation to TFMobileBertForPreTraining

52b9baf

make fixup and move copied from statement

6c3953b

make fixup and move copied from statement

27c5616

Rocketknight1 and others added 15 commits September 9, 2022 18:55

Correct copied from

13c7d9c

Add labels and next_sentence_label inputs to TFMobileBERT

b76bed7

Make sure total_loss is always defined

0145927

Update tests/test_modeling_tf_common.py

15b9659

Co-authored-by: amyeroberts <[email protected]>

Fix copied from

b3a8ce4

Ensure CTC models get labels in tests

317824d

Ensure CTC models get labels in tests

24382d7

Fix tests for vit_mae

518ecae

Fix tests for vit_mae

c217048

Fix tests for vit_mae

f4c4793

Reduce batch size for wav2vec2 testing because it was causing OOM

bf10b3b

Skip some TAPAS tests that are failing

eea79ad

Skip a failing HuBERT test

7804a90

make style

47b2fe6

Fix mobilebertforpretraining test

406c7fd

Rocketknight1 force-pushed the return_loss_fix branch from 26288b2 to 406c7fd Compare September 9, 2022 17:55

Rocketknight1 added 2 commits September 9, 2022 19:12

Skip Wav2Vec2 tests that use huge amounts of mem

4db13b9

Skip keras_fit for Wav2Vec2 as well

641a20b

Rocketknight1 merged commit 660e0b9 into main Sep 9, 2022

Rocketknight1 deleted the return_loss_fix branch September 9, 2022 19:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix train_step, test_step and tests for CLIP #18684

Fix train_step, test_step and tests for CLIP #18684

Uh oh!

Rocketknight1 commented Aug 18, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Aug 18, 2022 •

edited

Loading

Uh oh!

Rocketknight1 commented Aug 18, 2022 •

edited

Loading

Uh oh!

ydshieh commented Aug 18, 2022 •

edited

Loading

Uh oh!

Rocketknight1 commented Aug 18, 2022

Uh oh!

Rocketknight1 commented Aug 18, 2022

Uh oh!

gante left a comment •

edited

Loading

Uh oh!

Rocketknight1 commented Aug 18, 2022

Uh oh!

amyeroberts left a comment

Uh oh!

Uh oh!

Uh oh!

amyeroberts Aug 23, 2022

Uh oh!

sgugger left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Fix train_step, test_step and tests for CLIP #18684

Fix train_step, test_step and tests for CLIP #18684

Uh oh!

Conversation

Rocketknight1 commented Aug 18, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Aug 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Rocketknight1 commented Aug 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ydshieh commented Aug 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Rocketknight1 commented Aug 18, 2022

Uh oh!

Rocketknight1 commented Aug 18, 2022

Uh oh!

gante left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 commented Aug 18, 2022

Uh oh!

amyeroberts left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

amyeroberts Aug 23, 2022

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

HuggingFaceDocBuilderDev commented Aug 18, 2022 •

edited

Loading

Rocketknight1 commented Aug 18, 2022 •

edited

Loading

ydshieh commented Aug 18, 2022 •

edited

Loading

gante left a comment •

edited

Loading