[CI/Build] vLLM cache directory for images #6444

DarkLight1337 · 2024-07-15T11:40:59Z

This PR adds a cache directory for vLLM. Similar to HuggingFace, the location of the cache is given by the environment variable VLLM_CACHE_ROOT, or $XDG_CACHE_HOME/vllm if it is not provided (e.g. ~/.cache/vllm by default). Under this directory, we have two subdirectories for containing different types of data:

VLLM_XLA_CACHE_PATH for XLA persistent cache directory. cc @WoosukKwon
VLLM_ASSETS_CACHE for downloaded assets.
- This removes the need to use s3 or curl to download the images used in VLM examples and tests. cc @xwjiang2010

Accordingly, VLLM_CONFIG_ROOT has been updated to take the vllm directory into account. This means that:

The default has been changed from $XDG_CONFIG_HOME to $XDG_CONFIG_HOME/vllm.
Subpaths of VLLM_CONFIG_ROOT should no longer prepend vllm/ since that is included in the environment variable already.

github-actions · 2024-07-15T11:41:10Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only trigger fastcheck CI to run, which consists only a small and essential subset of tests to quickly catch errors with the flexibility to run extra individual tests on top (you can do this by unblocking test steps in the Buildkite run).

Full CI run is still required to merge this PR so once the PR is ready to go, please make sure to run it. If you need all test signals in between PR commits, you can trigger full CI as well.

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

DarkLight1337 · 2024-07-15T11:43:13Z

@WoosukKwon do you think we can move the XLA cache under ~/.cache/vllm as well?

xwjiang2010 · 2024-07-15T16:04:53Z

vllm/assets/base.py

+
+def get_cache_dir():
+    """Get the path to the cache for storing downloaded assets."""
+    path = Path(VLLM_ASSETS_CACHE)


need to expanduser here?

It is already done in vllm.env so no need to do it again here.

xwjiang2010

just a small comment.

WoosukKwon · 2024-07-15T16:25:38Z

do you think we can move the XLA cache under ~/.cache/vllm as well?

@DarkLight1337 Yes. I think that's good for consistency.

WoosukKwon · 2024-07-15T16:51:12Z

BTW, we also have config-related cache under ~/.config/vllm/.

DarkLight1337 · 2024-07-16T02:42:12Z

BTW, we also have config-related cache under ~/.config/vllm/.

The only cache I found under that directory is gpu_p2p_access_cache_*. I have moved it under ~/.cache/vllm. cc @youkaichao

youkaichao · 2024-07-16T04:43:48Z

LGTM

Signed-off-by: Alvant <[email protected]>

Signed-off-by: LeiWang1999 <[email protected]>

Download and cache image assets automatically

7c35f28

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 15, 2024

yapf

171a04f

xwjiang2010 reviewed Jul 15, 2024

View reviewed changes

xwjiang2010 approved these changes Jul 15, 2024

View reviewed changes

DarkLight1337 added 4 commits July 16, 2024 02:22

Move XLA cache path under ~/.cache

91a67e9

Use consistent format

74e9db6

Update VLLM_CONFIG_ROOT and introduce VLLM_CACHE_ROOT

5dd6935

Move p2p cache under VLLM_CACHE_ROOT

a6ef5ce

DarkLight1337 added 2 commits July 16, 2024 03:23

Update comments

af8a7d0

Move VLLM_CACHE_ROOT to runtime env vars

2e462a8

WoosukKwon approved these changes Jul 16, 2024

View reviewed changes

simon-mo merged commit d970115 into vllm-project:main Jul 16, 2024
71 of 73 checks passed

DarkLight1337 deleted the image-assets branch July 16, 2024 06:14

DarkLight1337 mentioned this pull request Jul 19, 2024

[Docs] Update docs for wheel location #6580

Merged

fialhocoelho pushed a commit to opendatahub-io/vllm that referenced this pull request Jul 19, 2024

[CI/Build] vLLM cache directory for images (vllm-project#6444)

e9e312b

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

[CI/Build] vLLM cache directory for images (vllm-project#6444)

971e9de

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[CI/Build] vLLM cache directory for images (vllm-project#6444)

2d4910f

Signed-off-by: Alvant <[email protected]>

LeiWang1999 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request Mar 26, 2025

[CI/Build] vLLM cache directory for images (vllm-project#6444)

618e8cf

Signed-off-by: LeiWang1999 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[CI/Build] vLLM cache directory for images #6444

[CI/Build] vLLM cache directory for images #6444

Uh oh!

DarkLight1337 commented Jul 15, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Jul 15, 2024

Uh oh!

DarkLight1337 commented Jul 15, 2024 •

edited

Loading

Uh oh!

xwjiang2010 Jul 15, 2024

Uh oh!

DarkLight1337 Jul 16, 2024

Uh oh!

xwjiang2010 left a comment

Uh oh!

WoosukKwon commented Jul 15, 2024 •

edited

Loading

Uh oh!

WoosukKwon commented Jul 15, 2024

Uh oh!

DarkLight1337 commented Jul 16, 2024

Uh oh!

youkaichao commented Jul 16, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[CI/Build] vLLM cache directory for images #6444

[CI/Build] vLLM cache directory for images #6444

Uh oh!

Conversation

DarkLight1337 commented Jul 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jul 15, 2024

Uh oh!

DarkLight1337 commented Jul 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xwjiang2010 Jul 15, 2024

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 Jul 16, 2024

Choose a reason for hiding this comment

Uh oh!

xwjiang2010 left a comment

Choose a reason for hiding this comment

Uh oh!

WoosukKwon commented Jul 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

WoosukKwon commented Jul 15, 2024

Uh oh!

DarkLight1337 commented Jul 16, 2024

Uh oh!

youkaichao commented Jul 16, 2024

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 commented Jul 15, 2024 •

edited

Loading

DarkLight1337 commented Jul 15, 2024 •

edited

Loading

WoosukKwon commented Jul 15, 2024 •

edited

Loading