Implemented hydra heads + adaptive kl #33

Dahoas · 2022-10-13T01:34:18Z

Implemented BranchModel class to support multi-headed hydra type models. Also added adaptive kl controller.

Achieves 4x speedup for training on GPT2-mediuma and 10x speedup for training on GPTj and halves memory footprint.

I also added unittests.

LouisCastricato · 2022-10-13T01:57:59Z

configs/ppo_config.yml

  model_type : "AcceleratePPOModel"  # Name of accelerate model type to load
  device : "cuda"  # Train device
-  num_layers_unfrozen : -1  # Number of bottom layers to freeze during training
+  num_layers_unfrozen : 2  # Number of bottom layers to freeze during training


Why are we changing this in the default config.

See comment below

LouisCastricato · 2022-10-13T01:58:07Z

configs/ppo_config.yml

  cliprange : 0.2  # clip range
  cliprange_value : 0.2  # clip range
-  vf_coef : 0.2  # value term weight
+  vf_coef : 2.3  # value term weight


Likewise here.

I found these parameters work a lot better for quickly checking whether reward is increasing

Got it. sounds good.

LouisCastricato · 2022-10-13T01:58:28Z

configs/test_config.yml

@@ -0,0 +1,52 @@
+model:


This config would be great for a CI.

LouisCastricato · 2022-10-13T02:04:19Z

trlx/model/nn/ppo_models.py

+
+# Cell
+
+class ModelBranch(PreTrainedModel):


Need a high level overview comment of how this class works.

LouisCastricato · 2022-10-13T02:06:15Z

unittests/test_ppo.py

@@ -0,0 +1,52 @@
+import unittest


Now this is a useful class but I think we should be handling unit in a separate PR....

I kept it in this merge because it tests the ModelBranch implementation

got it. It needs a lot of work. lets chat later.

Dahoas added 6 commits October 6, 2022 17:27

added frozen layers

9cc3249

implemented multi-branched ref model

fd677dd

added adaptive kl controller

e091789

verified gptj with hydra heads

96ec3f1

reverting ppo_sentiments.py

4c5fb6f

Merge branch 'master' into alex-working

033866f

Dahoas requested review from LouisCastricato and maxreciprocate October 13, 2022 01:34

LouisCastricato reviewed Oct 13, 2022

View reviewed changes

added description for ModelBranch class

15048a7

LouisCastricato merged commit d90dc88 into master Oct 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implemented hydra heads + adaptive kl #33

Implemented hydra heads + adaptive kl #33

Uh oh!

Dahoas commented Oct 13, 2022 •

edited

Loading

Uh oh!

LouisCastricato Oct 13, 2022

Uh oh!

Dahoas Oct 13, 2022

Uh oh!

LouisCastricato Oct 13, 2022

Uh oh!

Dahoas Oct 13, 2022

Uh oh!

LouisCastricato Oct 13, 2022

Uh oh!

LouisCastricato Oct 13, 2022

Uh oh!

LouisCastricato Oct 13, 2022

Uh oh!

LouisCastricato Oct 13, 2022

Uh oh!

Dahoas Oct 13, 2022

Uh oh!

LouisCastricato Oct 13, 2022

Uh oh!

Uh oh!

Implemented hydra heads + adaptive kl #33

Implemented hydra heads + adaptive kl #33

Uh oh!

Conversation

Dahoas commented Oct 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Dahoas commented Oct 13, 2022 •

edited

Loading