-
Notifications
You must be signed in to change notification settings - Fork 920
Depparse peft #1344
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Depparse peft #1344
Changes from all commits
Commits
Show all changes
17 commits
Select commit
Hold shift + click to select a range
f0ce4d0
Don't create an optimizer for the transformer if there is no learning…
AngledLuffa 0a0f172
Add a flag to set a different sized batch for the 2nd optimizer. Let…
AngledLuffa eb8558b
Keep a map of dependency optimizers and iterate them in loops instead…
AngledLuffa 0dcf712
Make checkpoints less often - every 500 by default, with occasional o…
AngledLuffa 0687c37
Add a warmup scheduler to finetuning the depparse transformer
AngledLuffa 96fb54e
Fix --no_checkpoint option
AngledLuffa e75cb31
Oops, fix usage of weight_decay in the common function to build the b…
AngledLuffa 331d3ae
Add flags for using weight decay in the first round of optimizer in t…
AngledLuffa c087fcf
Move --use_peft and the checking of --use_peft vs --bert_finetune to …
AngledLuffa df37c25
Set the default number of hidden layers used from the transformer to 4
AngledLuffa 19fa8c0
Force bert saved when loading / saving a model
AngledLuffa b9ee08f
Add a PEFT wrapper to the dependency parser
AngledLuffa f99e7aa
Saved depparse models were missing the last score update
AngledLuffa 59085d3
Refactor the code which runs the predictions in depparse. Use this t…
AngledLuffa 4c7d46f
Add a linear warmup scheduler for the 2nd optimizer pass
AngledLuffa 72a3cd5
Continue training from the current global_step count rather than rewi…
AngledLuffa cd50fc6
Refactor a method that builds the LoRA wrapper around a bert model, s…
AngledLuffa File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wonder if this causes problems with PEFT
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The
detach
setting just controls whether or not the gradients of the transformer embedding are kept. I don't think it should affect peft in any way if the model is frozen