[Feature] allow custom dropout, number of layers/units for BERT #950

eric-haibin-lin · 2019-09-30T21:06:41Z

Description

reopen #851

Checklist

Essentials

PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

cc @dmlc/gluon-nlp-team

codecov · 2019-09-30T21:06:45Z

Codecov Report

Merging #950 into master will increase coverage by 9.86%.
The diff coverage is 86.34%.

@@            Coverage Diff             @@
##           master     #950      +/-   ##
==========================================
+ Coverage   78.41%   88.27%   +9.86%     
==========================================
  Files          67       67              
  Lines        6277     6287      +10     
==========================================
+ Hits         4922     5550     +628     
+ Misses       1355      737     -618

Impacted Files	Coverage Δ
src/gluonnlp/utils/parameter.py	`87.09% <100%> (+3.16%)`	⬆️
src/gluonnlp/optimizer/__init__.py	`100% <100%> (ø)`	⬆️
src/gluonnlp/metric/__init__.py	`100% <100%> (ø)`	⬆️
src/gluonnlp/metric/masked_accuracy.py	`100% <100%> (+39.53%)`	⬆️
src/gluonnlp/utils/version.py	`100% <100%> (ø)`	⬆️
src/gluonnlp/utils/files.py	`42.62% <18.18%> (-6.4%)`	⬇️
src/gluonnlp/model/train/language_model.py	`88.51% <25%> (+48.62%)`	⬆️
src/gluonnlp/optimizer/bert_adam.py	`87.32% <81.57%> (-5.86%)`	⬇️
src/gluonnlp/metric/length_normalized_loss.py	`89.28% <89.28%> (ø)`
src/gluonnlp/data/utils.py	`86.39% <95.34%> (+12.34%)`	⬆️
... and 22 more

mli · 2019-09-30T21:44:01Z

Job PR-950/1 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-950/1/index.html

mli · 2019-11-06T07:21:19Z

Job PR-950/2 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-950/2/index.html

mli · 2019-12-16T01:32:35Z

Job PR-950/3 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-950/3/index.html

mli · 2020-01-15T16:50:42Z

Job PR-950/4 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-950/4/index.html

leezu · 2020-01-15T18:04:45Z

@eric-haibin-lin

[2020-01-15T17:07:50.259Z] tests/unittest/test_models.py::test_bert_models[False] FAILED            [ 60%]

[2020-01-15T17:08:37.013Z] tests/unittest/test_models.py::test_bert_models[True] FAILED             [ 61%]

eric-haibin-lin · 2020-01-15T20:56:18Z

I saw that.. And i am not able to reproduce it locally.

eric-haibin-lin · 2020-01-24T21:02:09Z

@leezu were you able to reproduce the err?

…opoutnumberoflayers

mli · 2020-01-27T19:51:54Z

Job PR-950/5 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-950/5/index.html

mli · 2020-01-28T23:18:56Z

Job PR-950/6 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-950/6/index.html

This reverts commit d2c0e0f.

This reverts commit c2a89a8.

mli · 2020-01-28T23:49:14Z

Job PR-950/7 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-950/7/index.html

mli · 2020-01-29T00:18:51Z

Job PR-950/8 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-950/8/index.html

leezu · 2020-01-29T17:55:50Z

src/gluonnlp/model/bert.py

-        'Cannot override predefined model settings.'
+    predefined_args = bert_hparams[model_name].copy()
+    if not hparam_allow_override:
+        mutable_args = ['use_residual', 'dropout', 'word_embed']


Not changed in this PR, but why is embed_size not part of this?

I copied some from the transformer get model function. Actually, i'm not sure whether we want to have a whitelist of mutable args. It make the fucntion harder to maintain, since we have hparam_allow_override flag.

Do you want to remove the whitelist then? In this PR or a separate one?

I'd prefer a separate one

Ubuntu and others added 9 commits July 26, 2019 21:49

add hparam_allow_override

c9401ce

add unit test

cde0230

fix conflict

1fef43c

support roberta

dd4f4d6

fix bert test

461ce81

Merge remote-tracking branch 'upstream/master' into allow

99a501b

Merge remote-tracking branch 'upstream/master' into allow

c6c97c9

Merge remote-tracking branch 'upstream/master' into allow

385b2c0

merge

8fded81

eric-haibin-lin requested a review from a team as a code owner September 30, 2019 21:06

leezu approved these changes Oct 1, 2019

View reviewed changes

szha added the release focus Progress focus for release label Oct 8, 2019

Merge branch 'master' into allow

14a6bc9

resovle confglict

66f236f

Merge branch 'master' into allow

f100cae

leezu self-assigned this Jan 22, 2020

Merge remote-tracking branch 'origin/master' into haibinallowcustomdr…

a094d36

…opoutnumberoflayers

leezu added 2 commits January 28, 2020 22:45

Debug: Use NaiveEngine on CI

c2a89a8

Isolate test

d2c0e0f

avoid mutating global hparan

181e614

Lin added 2 commits January 28, 2020 15:45

Revert "Isolate test"

2e4f2f2

This reverts commit d2c0e0f.

Revert "Debug: Use NaiveEngine on CI"

5a69810

This reverts commit c2a89a8.

leezu reviewed Jan 29, 2020

View reviewed changes

leezu merged commit 5a776bf into dmlc:master Jan 29, 2020

eric-haibin-lin deleted the allow branch February 2, 2020 06:21

[Feature] allow custom dropout, number of layers/units for BERT #950

[Feature] allow custom dropout, number of layers/units for BERT #950

Uh oh!

Conversation

eric-haibin-lin commented Sep 30, 2019

Description

Checklist

Essentials

Changes

Comments

Uh oh!

codecov bot commented Sep 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mli commented Sep 30, 2019

Uh oh!

mli commented Nov 6, 2019

Uh oh!

mli commented Dec 16, 2019

Uh oh!

mli commented Jan 15, 2020

Uh oh!

leezu commented Jan 15, 2020

Uh oh!

eric-haibin-lin commented Jan 15, 2020

Uh oh!

eric-haibin-lin commented Jan 24, 2020

Uh oh!

mli commented Jan 27, 2020

Uh oh!

mli commented Jan 28, 2020

Uh oh!

mli commented Jan 28, 2020

Uh oh!

mli commented Jan 29, 2020

Uh oh!

leezu Jan 29, 2020

Choose a reason for hiding this comment

Uh oh!

eric-haibin-lin Jan 29, 2020

Choose a reason for hiding this comment

Uh oh!

leezu Jan 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eric-haibin-lin Jan 29, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov bot commented Sep 30, 2019 •

edited

Loading

leezu Jan 29, 2020 •

edited

Loading