Skip to content

Conversation

@FlamingoPg
Copy link
Contributor

PaddleNLP 完善 TokenizerFast支持

@paddle-bot
Copy link

paddle-bot bot commented Nov 3, 2024

你的PR提交成功,感谢你对开源项目的贡献!
请检查PR提交格式和内容是否完备,具体请参考示例模版
Your PR has been submitted. Thanks for your contribution!
Please check its format and content. For this, you can refer to Template and Demo.

@luotao1
Copy link
Collaborator

luotao1 commented Nov 4, 2024

@KB-Ding @DrownFish19 可以review了

@DrownFish19
Copy link
Collaborator

此处需要注意,

  • 护航同学也在更新TokenizerFast相关代码,但是两项工作为辅助并行关系,黑客松主要面向单个模型的tokenizerfast,重点在验证精度,护航主要面向TokenizerFast基类模型,重点在提升基础能力。
  • BertTokenizerFast已有PR,可以直接验证,无需添加。

PaddlePaddle/PaddleNLP#9215
PaddlePaddle/PaddleNLP#9353

Copy link
Collaborator

@DrownFish19 DrownFish19 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@luotao1 luotao1 merged commit 5aa7c84 into PaddlePaddle:master Nov 4, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants