Optimize FasterTokenizer #36701

joey12300 · 2021-10-25T10:34:52Z

PR types

Performance optimization

PR changes

OPs

Describe

optimize fast tokenizer by:

remove the usage of boost::split
create a local BertTokenizer object in stack instead of heap.

paddle-bot-old · 2021-10-25T10:34:55Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

wawltor

LGTM

optimize fast tokenizer

3227a88

ZeyuChen changed the title ~~optimize fast tokenizer~~ Optimize FasterTokenizer Oct 25, 2021

remove const_cast

2127f6d

wawltor approved these changes Oct 26, 2021

View reviewed changes

joey12300 requested a review from Steffy-zxf October 26, 2021 02:29

Steffy-zxf approved these changes Oct 26, 2021

View reviewed changes

Steffy-zxf merged commit 290ded7 into PaddlePaddle:develop Oct 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize FasterTokenizer #36701

Optimize FasterTokenizer #36701

Uh oh!

joey12300 commented Oct 25, 2021

Uh oh!

paddle-bot-old bot commented Oct 25, 2021

Uh oh!

wawltor left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Optimize FasterTokenizer #36701

Optimize FasterTokenizer #36701

Uh oh!

Conversation

joey12300 commented Oct 25, 2021

PR types

PR changes

Describe

Uh oh!

paddle-bot-old bot commented Oct 25, 2021

Uh oh!

wawltor left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants