-
Notifications
You must be signed in to change notification settings - Fork 528
[Feature] allow custom dropout, number of layers/units for BERT #950
Conversation
Codecov Report
@@ Coverage Diff @@
## master #950 +/- ##
==========================================
+ Coverage 78.41% 88.27% +9.86%
==========================================
Files 67 67
Lines 6277 6287 +10
==========================================
+ Hits 4922 5550 +628
+ Misses 1355 737 -618
|
Job PR-950/1 is complete. |
Job PR-950/2 is complete. |
Job PR-950/3 is complete. |
Job PR-950/4 is complete. |
|
I saw that.. And i am not able to reproduce it locally. |
@leezu were you able to reproduce the err? |
…opoutnumberoflayers
Job PR-950/5 is complete. |
Job PR-950/6 is complete. |
Job PR-950/7 is complete. |
Job PR-950/8 is complete. |
'Cannot override predefined model settings.' | ||
predefined_args = bert_hparams[model_name].copy() | ||
if not hparam_allow_override: | ||
mutable_args = ['use_residual', 'dropout', 'word_embed'] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not changed in this PR, but why is embed_size
not part of this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I copied some from the transformer get model function. Actually, i'm not sure whether we want to have a whitelist of mutable args. It make the fucntion harder to maintain, since we have hparam_allow_override
flag.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do you want to remove the whitelist then? In this PR or a separate one?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd prefer a separate one
Description
reopen #851
Checklist
Essentials
Changes
Comments
cc @dmlc/gluon-nlp-team