Skip to content

Commit 06667a2

Browse files
datquocnguyenelusenji
authored andcommitted
Update tokenization_bertweet.py (huggingface#16941)
The emoji version must be either 0.5.4 or 0.6.0. Newer emoji versions have been updated to newer versions of the Emoji Charts, thus not consistent with the one used for pre-processing the pre-training Tweet corpus (i.e. not consistent with the vocab).
1 parent 25a19ac commit 06667a2

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

src/transformers/models/bertweet/tokenization_bertweet.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -152,7 +152,7 @@ def __init__(
152152
self.demojizer = demojize
153153
except ImportError:
154154
logger.warning(
155-
"emoji is not installed, thus not converting emoticons or emojis into text. Please install emoji: pip3 install emoji"
155+
"emoji is not installed, thus not converting emoticons or emojis into text. Install emoji: pip3 install emoji==0.6.0"
156156
)
157157
self.demojizer = None
158158

0 commit comments

Comments
 (0)