How to add custom models in NeMo? #13158
Unanswered
gagangayari
asked this question in
Q&A
Replies: 1 comment
-
Hi, thanks for your question! Many adjustments to the architecture can be made by tweaking the model's GPTConfig. Using MLA requires swapping out MCore's |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I wanted to play around with different model components. Eg Plugging in MLA instead of MHA. Or change the forward method somewhat. I guess we can specify it via MCoreSpec. Can somebody please help me with the complete flow on how to add such a customized model and use it for training. If there is any recipe that would be a great help. Thanks
Beta Was this translation helpful? Give feedback.
All reactions