Fix all-reduce memory usage #2151

WoosukKwon · 2023-12-17T07:41:33Z

Fixes #2150

Related issue: https://discuss.pytorch.org/t/cuda-allocation-lifetime-for-inputs-to-distributed-all-reduce/191573

Before:

After:

The peak memory usage was reduced from 25.5 GiB to 19.5 GiB for Llama-70B with TP=8.

zhuohan123

LGTM! Thanks for the fix!

WoosukKwon added 2 commits December 17, 2023 07:34

Fix all reduce memory usage

cf9f18b

Fix

d14bfb2

WoosukKwon requested a review from zhuohan123 December 17, 2023 07:41

WoosukKwon mentioned this pull request Dec 17, 2023

Remove dependency on CuPy #2152

Merged

WoosukKwon changed the title ~~Fix all reduce memory usage~~ Fix all-reduce memory usage Dec 17, 2023

zhuohan123 approved these changes Dec 17, 2023

View reviewed changes

Yard1 approved these changes Dec 17, 2023

View reviewed changes

WoosukKwon merged commit e1d5402 into main Dec 17, 2023

WoosukKwon deleted the fix-all-reduce branch December 17, 2023 09:44

WoosukKwon mentioned this pull request Dec 18, 2023

KV cache is low, memory profiling does not see the remaining VRAM #2136

Closed

xjpang pushed a commit to xjpang/vllm that referenced this pull request Dec 18, 2023

Fix all-reduce memory usage (vllm-project#2151)

11f7905

WoosukKwon mentioned this pull request Jan 3, 2024

Memory leak when using CUDA Graph with torch.distributed.all_reduce (vLLM default config) #2323

Closed

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

Fix all-reduce memory usage (vllm-project#2151)

e163a89

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix all-reduce memory usage #2151

Fix all-reduce memory usage #2151

Uh oh!

WoosukKwon commented Dec 17, 2023 •

edited

Loading

Uh oh!

zhuohan123 left a comment

Uh oh!

Uh oh!

Uh oh!

Fix all-reduce memory usage #2151

Fix all-reduce memory usage #2151

Uh oh!

Conversation

WoosukKwon commented Dec 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zhuohan123 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

WoosukKwon commented Dec 17, 2023 •

edited

Loading