-
Notifications
You must be signed in to change notification settings - Fork 38
Open
Description
The spaCy tokenizer splits hyphenated words by inserting a space before and after the hyphen. For example, "eye-opening" becomes "eye - opening". Is there a way to keep hyphenated words together, like with the quanteda tokenizers? (@JBGruber : Any idea? :))
Metadata
Metadata
Assignees
Labels
No labels