Skip to content

Conversation

AngledLuffa
Copy link
Collaborator

Eliminate goeswith phrases from the lemmatizer training data. Doesn't do anything to the lemmatizer in the case of eval data (dev or test sets, and more importantly, Pipelines)

Addresses #1345 once the models are retrained

…tizer, since some treebanks have the standard of making the lemma the complete goeswith phrase, and that works pretty horribly for the separate word components
@AngledLuffa
Copy link
Collaborator Author

Had to update the previous caseless example to better match the new models. Should be good to go now. The new model is available in 1.7.0 already

@AngledLuffa AngledLuffa merged commit 191a05f into dev Feb 18, 2024
@AngledLuffa AngledLuffa deleted the lemma_goeswith branch February 18, 2024 09:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant