Skip to content

Conversation

AngledLuffa
Copy link
Collaborator

@AngledLuffa AngledLuffa commented Jul 20, 2022

Load the pretrained charlm, adds it as inputs to the POS model

This improves accuracy on almost all POS models

Doing the same thing for depparse would also make sense, but is
currently not done. However, the downstream scores of depparse don't
seem to be negatively affected by using the different (better) POS
tags produced by models using the pretrained charlm

Add a pos-specific charlm map for the medical EN datasets and the one dataset which appears to be hurt by the charlm (tr_boun)

craft, genia -> None

Produces resources.json with pos charlms

Make the Pipeline pass in charlm paths if present in resources.json
TODO: use the foundation_cache to load them

This improves accuracy on almost all POS models

Doing the same thing for depparse would also make sense, but is
currently not done.  However, the downstream scores of depparse don't
seem to be negatively affected by using the different (better) POS
tags produced by models using the pretrained charlm

Add a pos-specific charlm map for the medical EN datasets and the one dataset which appears to be hurt by the charlm (tr_boun)

craft, genia -> None

Produces resources.json with pos charlms

Make the Pipeline pass in charlm paths if present in resources.json
TODO: use the foundation_cache to load them
@AngledLuffa AngledLuffa merged commit b152169 into dev Jul 20, 2022
@AngledLuffa AngledLuffa deleted the pos_charlm branch July 20, 2022 21:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant