Skip to content

Conversation

@LiYuRio
Copy link
Contributor

@LiYuRio LiYuRio commented Jun 14, 2022

PR types

New features

PR changes

Others

Description

新增GPT-175B和GPT-89B的参数配置,和FasterTransformer对齐,主要用于推理部署。

@CLAassistant
Copy link

CLAassistant commented Jun 14, 2022

CLA assistant check
All committers have signed the CLA.

@LiYuRio LiYuRio force-pushed the dev_add_configs branch 2 times, most recently from 26f090e to 5eb4c11 Compare June 14, 2022 03:17
qingqing01
qingqing01 previously approved these changes Jun 14, 2022
Copy link
Contributor

@qingqing01 qingqing01 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个是大模型推理部署是吧。

这里是 普通 的gpt 模型,这里可以不加。

动态图gpt-3可以添加一下
examples/language_model/gpt-3/dygraph/modeling.py

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已修改

@ZHUI ZHUI self-assigned this Jun 14, 2022
Copy link
Contributor

@ZHUI ZHUI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@ZHUI ZHUI merged commit 1f446ff into PaddlePaddle:develop Jun 14, 2022
@LiYuRio LiYuRio deleted the dev_add_configs branch June 14, 2022 11:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants