Add support for more `CausalLM`s #103

jon-tow · 2022-11-21T15:40:09Z

This PR adds support for more CausalLMs in the HuggingFace Hub. Previously, only models that followed the gpt2 architecture layer-naming convention were supported (except for ILQL which also supports gpt-neox).
This will allow one to use models such as OPT and Pythia out of the box.

wandb reports:

Notes:

As discussed in the TRLX meeting (Dec. 5, 2022) - I've ignored the implementation of BLOOM for hydra-style model branching. CC @LouisCastricato

Related Issue: #121

…vert-layers

Dahoas · 2022-11-21T17:19:15Z

tests/test_utils.py

+
+
+@pytest.mark.parametrize(
+    "model_name",


Nice this is great

Dahoas · 2022-11-21T17:19:41Z

docs/source/models.rst


 **PPO**

 .. autoclass:: trlx.model.accelerate_ppo_model.AcceleratePPOModel


You will probably have to merge with @reciprocated latest commit (just fyi)

trlx/utils/modeling.py

Dahoas · 2022-11-21T17:32:44Z

trlx/utils/modeling.py

+    def _getattr(obj, attr):
+        return getattr(obj, attr, *args)
+
+    return functools.reduce(_getattr, [obj] + attr.split("."))


Dahoas · 2022-11-21T17:33:52Z

Looks very nice! For some reason tests for the HydraHead are breaking (I'm trying to see why).

Edit: I think it's negligible. We should should the assert equal to something like self.assertTrue(hs_diff <= 1e-5)

jon-tow · 2022-11-21T17:43:38Z

@Dahoas Thanks for the review! This is still in draft as I'll be adding more tests and running examples. I'll bother you again for another review later, if you don't mind 😄

…vert-layers

dongs0104 · 2022-11-23T04:47:37Z

OPTForCausalLM is working well ? cause i changed ppo trainer code, and trained but result was bad results was not like gpt2,

OPT Model is going to add a special token, </s> value function loss is not convergence. any idea for it ?

jon-tow · 2022-11-23T05:35:26Z

Hi @dongs0104! The default PPO trainer uses a GPT hydra-head modification which will require re-working of the internals to support other models such as OPT. I plan to add hydras for OPT in this PR soon.

Thanks for bringing awareness to possible special-token issues - we'll keep an eye on it 👍

danyang-rainbow · 2022-12-03T08:23:34Z

Good job!

…vert-layers

LouisCastricato

Looks great! Lets merge.

LouisCastricato · 2022-12-06T00:20:05Z

trlx/model/nn/ilql_models.py

        logit_mask=None,
-        pad_token_id=50256,
-        eos_token_id=50256,
+        pad_token_id=None,


Why the change

50256 is the default eos_token_id for gpt2 based tokenizers - this just ensures there is no implicit default for the other non-gpt2 models.

Dahoas · 2022-12-06T01:20:32Z

This is excellent!

jon-tow added 3 commits November 21, 2022 15:37

Add support for more CausalLMs

a2d1e8f

Format with black

9da2553

Merge branch 'main' of https://github.com/CarperAI/trlx into auto-con…

364d409

…vert-layers

Dahoas reviewed Nov 21, 2022

View reviewed changes

jon-tow added 4 commits November 21, 2022 22:52

Merge branch 'main' of https://github.com/CarperAI/trlx into auto-con…

95fdc34

…vert-layers

Add layer freezing utils and fix model loading from config

50ce1c3

Revert name change value_head -> v_head

1ac760d

Merge branch 'main' of https://github.com/CarperAI/trlx into auto-con…

ef17f8a

…vert-layers

jon-tow and others added 4 commits November 23, 2022 18:51

Rename getters to avoid collisions and add generic arg support

130f466

Add HF hidden layer count getter

4f38b8a

Add temporary forward arg filter

859c579

Unpack PPO forward outputs with *

c3d7dbf

guac added 2 commits December 4, 2022 17:36

Merge branch 'main' of https://github.com/CarperAI/trlx into auto-con…

37a633f

…vert-layers

Revert default PPO arch to hydra

440048d

maxreciprocate mentioned this pull request Dec 5, 2022

Make on-the-fly configs work with HPTHydraHeadWithValueModel #116

Closed

guac added 4 commits December 5, 2022 09:15

Add initial OPT model branch support for hydras

14fd0b4

Run pre-commit and small fixes

6cf1f26

Re-run pre-commit

cbd8e01

Remove duplicate data move to device

a526c8d

jon-tow marked this pull request as ready for review December 6, 2022 00:12

LouisCastricato approved these changes Dec 6, 2022

View reviewed changes

LouisCastricato merged commit 803f8cf into CarperAI:main Dec 6, 2022

jon-tow deleted the auto-convert-layers branch December 7, 2022 20:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for more `CausalLM`s #103

Add support for more `CausalLM`s #103

Uh oh!

jon-tow commented Nov 21, 2022 •

edited

Loading

Uh oh!

Dahoas Nov 21, 2022

Uh oh!

Dahoas Nov 21, 2022

Uh oh!

Uh oh!

Dahoas Nov 21, 2022

Uh oh!

Dahoas commented Nov 21, 2022 •

edited

Loading

Uh oh!

jon-tow commented Nov 21, 2022

Uh oh!

dongs0104 commented Nov 23, 2022

Uh oh!

jon-tow commented Nov 23, 2022

Uh oh!

danyang-rainbow commented Dec 3, 2022

Uh oh!

LouisCastricato left a comment

Uh oh!

LouisCastricato Dec 6, 2022

Uh oh!

jon-tow Dec 6, 2022

Uh oh!

Dahoas commented Dec 6, 2022

Uh oh!

Uh oh!


		PPO

		.. autoclass:: trlx.model.accelerate_ppo_model.AcceleratePPOModel



		@pytest.mark.parametrize(
		"model_name",

Add support for more CausalLMs #103

Add support for more CausalLMs #103

Uh oh!

Conversation

jon-tow commented Nov 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Dahoas Nov 21, 2022

Choose a reason for hiding this comment

Uh oh!

Dahoas Nov 21, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Dahoas Nov 21, 2022

Choose a reason for hiding this comment

Uh oh!

Dahoas commented Nov 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jon-tow commented Nov 21, 2022

Uh oh!

dongs0104 commented Nov 23, 2022

Uh oh!

jon-tow commented Nov 23, 2022

Uh oh!

danyang-rainbow commented Dec 3, 2022

Uh oh!

LouisCastricato left a comment

Choose a reason for hiding this comment

Uh oh!

LouisCastricato Dec 6, 2022

Choose a reason for hiding this comment

Uh oh!

jon-tow Dec 6, 2022

Choose a reason for hiding this comment

Uh oh!

Dahoas commented Dec 6, 2022

Uh oh!

Uh oh!

Add support for more `CausalLM`s #103

Add support for more `CausalLM`s #103

jon-tow commented Nov 21, 2022 •

edited

Loading

Dahoas commented Nov 21, 2022 •

edited

Loading