Skip to content

[Typo]  #137

@Chandler-Bing

Description

@Chandler-Bing

Required prerequisites

Questions

基于上述的几个优化技术,我们在千卡 A800 显卡上达到了 7B 模型 182 TFLOPS 的吞吐,GPU 峰值算力利用率高达 58.3%。

吞吐Throughput 应该指的是训练速度,eg. 3000 token/s/gpu

Checklist

  • I have provided all relevant and necessary information above.
  • I have chosen a suitable title for this issue.

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions