[Model] use AutoWeightsLoader for bart #18299

calvin0327 · 2025-05-17T08:06:55Z

FIX (partial) #15697

github-actions · 2025-05-17T08:07:05Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

calvin0327 · 2025-05-17T08:07:59Z

@lengrongfu PTAL

vllm/model_executor/models/bart.py

lengrongfu · 2025-05-17T09:10:08Z

Thanks for working on this, there are some comments.

vllm/model_executor/models/bart.py

calvin0327 · 2025-06-01T05:15:58Z

test:

calvin0327 · 2025-06-01T05:54:16Z

@DarkLight1337 @noooop I've resolved it, and BART can now be successfully loaded.

noooop · 2025-06-01T06:28:50Z

Is BartModel.load_weight not used and can be deleted? Because BartForConditionalGeneration already uses AutoWeightsLoader.

or

Can the BartModel.load_weights logic be simplified using AutoWeightsLoader?

then

BartForConditionalGeneration.load_weights:

    def load_weights(self, weights: Iterable[tuple[str, torch.Tensor]]):
        return self.model.load_weights(weights, mapper=self.hf_to_vllm_mapper)

The final result of AutoWeightsLoader replaces each model's load_weights function.

Now the load_weights logic still not simplified.

calvin0327 · 2025-06-02T09:19:32Z

Is BartModel.load_weight not used and can be deleted? Because BartForConditionalGeneration already uses AutoWeightsLoader.

or

Can the BartModel.load_weights logic be simplified using AutoWeightsLoader?

then

BartForConditionalGeneration.load_weights:
    def load_weights(self, weights: Iterable[tuple[str, torch.Tensor]]):
        return self.model.load_weights(weights, mapper=self.hf_to_vllm_mapper)
The final result of AutoWeightsLoader replaces each model's load_weights function.

Now the load_weights logic still not simplified.

@noooop Now,BartModel.load_weights is used, it be called by BartForConditionalGeneration.load_weights. I've noticed that other models also add the load_weights method to their base models, which is then invoked via AutoWeightsLoader.

Please correct me if I'm misunderstanding.

noooop · 2025-06-03T02:24:42Z

You have optimized the logic related to lm_head and tie_word_embeddings.

Try using AutoWeightsLoader to simplify the BartModel.load_weights logic.

calvin0327 · 2025-07-14T03:20:25Z

@noooop Thank you for your reply, but I haven't found a way to optimize the BartModel.load_weight, could you give me some hints?

noooop · 2025-07-14T03:41:18Z

Please refer to #20534

calvin0327 · 2025-07-16T05:44:20Z

This is test result:

calvin0327 · 2025-07-16T05:46:38Z

@noooop Thanks for your hints, I had optimized the load_weights for BartModel, PTAL

noooop · 2025-07-16T06:37:05Z

@noooop Thanks for your hints, I had optimized the load_weights for BartModel, PTAL

Please see the message above.

vllm/model_executor/models/bart.py

noooop

thanks for your contribution.

cc @DarkLight1337

Signed-off-by: calvin chen <[email protected]>

noooop · 2025-07-18T05:00:46Z

v1/entrypoints/openai/test_multi_api_servers.py::test_single_completion[ibm-research/PowerMoE-3b] Testing locally

I think it's failure not related to this PR's , but it has failed twice already.

can pass locally, I don't know why it failed.

noooop · 2025-07-18T09:48:23Z

@calvin0327

I found another failure related to ibm-research/PowerMoE-3b, so it's failure not related to this PR's

https://buildkite.com/vllm/ci/builds/24219/steps/canvas?sid=019818b9-c1bf-4856-909d-04ed3afc8696

Try tomorrow, click on calvin0327:autoweights-for-bart under the title -> sync fork -> update branch to merge into main and trigger CI. Also, auto-merge will not be canceled. Do not submit new code, it will cancel auto-merge.

calvin0327 · 2025-07-18T09:54:20Z

@calvin0327

I found another failure related to ibm-research/PowerMoE-3b, so it's failure not related to this PR's

https://buildkite.com/vllm/ci/builds/24219/steps/canvas?sid=019818b9-c1bf-4856-909d-04ed3afc8696

Try tomorrow, click on calvin0327:autoweights-for-bart under the title -> sync fork -> update branch to merge into main and trigger CI. Also, auto-merge will not be canceled. Do not submit new code, it will cancel auto-merge.

Ok. thanks.

Signed-off-by: calvin chen <[email protected]>

Signed-off-by: calvin chen <[email protected]> Signed-off-by: avigny <[email protected]>

Signed-off-by: calvin chen <[email protected]> Signed-off-by: x22x22 <[email protected]>

Signed-off-by: calvin chen <[email protected]>

Signed-off-by: calvin chen <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>

calvin0327 force-pushed the autoweights-for-bart branch from e647f4b to af0f5d3 Compare May 17, 2025 08:24

lengrongfu reviewed May 17, 2025

View reviewed changes

vllm/model_executor/models/bart.py Outdated Show resolved Hide resolved

calvin0327 force-pushed the autoweights-for-bart branch from af0f5d3 to 5243921 Compare May 17, 2025 09:53

DarkLight1337 reviewed May 17, 2025

View reviewed changes

vllm/model_executor/models/bart.py Outdated Show resolved Hide resolved

calvin0327 force-pushed the autoweights-for-bart branch from 5243921 to 1a1b005 Compare May 18, 2025 03:24

DarkLight1337 reviewed May 18, 2025

View reviewed changes

vllm/model_executor/models/bart.py Outdated Show resolved Hide resolved

calvin0327 force-pushed the autoweights-for-bart branch from 1a1b005 to d2a3759 Compare May 18, 2025 04:02

DarkLight1337 reviewed May 18, 2025

View reviewed changes

vllm/model_executor/models/bart.py Outdated Show resolved Hide resolved

calvin0327 force-pushed the autoweights-for-bart branch 2 times, most recently from 74530cb to 340f7e3 Compare May 20, 2025 06:44

calvin0327 force-pushed the autoweights-for-bart branch 2 times, most recently from 45f885f to 81f8de8 Compare June 1, 2025 05:13

calvin0327 force-pushed the autoweights-for-bart branch from 81f8de8 to dfc3af8 Compare June 1, 2025 05:24

calvin0327 force-pushed the autoweights-for-bart branch 5 times, most recently from a49df83 to eeb7349 Compare July 16, 2025 05:43

calvin0327 force-pushed the autoweights-for-bart branch from eeb7349 to 35cdd55 Compare July 16, 2025 05:54

noooop reviewed Jul 16, 2025

View reviewed changes

calvin0327 force-pushed the autoweights-for-bart branch 4 times, most recently from 3e97f9e to 06e4c17 Compare July 16, 2025 09:35

noooop reviewed Jul 17, 2025

View reviewed changes

vllm/model_executor/models/bart.py Outdated Show resolved Hide resolved

vllm/model_executor/models/bart.py Outdated Show resolved Hide resolved

vllm/model_executor/models/bart.py Outdated Show resolved Hide resolved

calvin0327 force-pushed the autoweights-for-bart branch 2 times, most recently from 1656f62 to b244c9c Compare July 17, 2025 08:10

noooop approved these changes Jul 17, 2025

View reviewed changes

DarkLight1337 approved these changes Jul 17, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) July 17, 2025 10:03

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jul 17, 2025

[Model] use AutoWeightsLoader for bart

e22db99

Signed-off-by: calvin chen <[email protected]>

auto-merge was automatically disabled July 18, 2025 02:06
Head branch was pushed to by a user without write access

calvin0327 force-pushed the autoweights-for-bart branch from b244c9c to e22db99 Compare July 18, 2025 02:06

DarkLight1337 enabled auto-merge (squash) July 18, 2025 03:11

Merge branch 'vllm-project:main' into autoweights-for-bart

8f03a18

Merge branch 'vllm-project:main' into autoweights-for-bart

3249029

DarkLight1337 merged commit 51ba839 into vllm-project:main Jul 20, 2025
69 checks passed

LyrisZhong pushed a commit to LyrisZhong/vllm that referenced this pull request Jul 23, 2025

[Model] use AutoWeightsLoader for bart (vllm-project#18299)

3e546b1

Signed-off-by: calvin chen <[email protected]>

avigny pushed a commit to avigny/vllm that referenced this pull request Jul 31, 2025

[Model] use AutoWeightsLoader for bart (vllm-project#18299)

9985fb3

Signed-off-by: calvin chen <[email protected]> Signed-off-by: avigny <[email protected]>

x22x22 pushed a commit to x22x22/vllm that referenced this pull request Aug 5, 2025

[Model] use AutoWeightsLoader for bart (vllm-project#18299)

1462881

Signed-off-by: calvin chen <[email protected]> Signed-off-by: x22x22 <[email protected]>

Pradyun92 pushed a commit to Pradyun92/vllm that referenced this pull request Aug 6, 2025

[Model] use AutoWeightsLoader for bart (vllm-project#18299)

6617b03

Signed-off-by: calvin chen <[email protected]>

npanpaliya pushed a commit to odh-on-pz/vllm-upstream that referenced this pull request Aug 6, 2025

[Model] use AutoWeightsLoader for bart (vllm-project#18299)

aba978d

Signed-off-by: calvin chen <[email protected]>

jinzhen-lin pushed a commit to jinzhen-lin/vllm that referenced this pull request Aug 9, 2025

[Model] use AutoWeightsLoader for bart (vllm-project#18299)

9ecbe0f

Signed-off-by: calvin chen <[email protected]> Signed-off-by: Jinzhen Lin <[email protected]>

Uh oh!

[Model] use AutoWeightsLoader for bart #18299

[Model] use AutoWeightsLoader for bart #18299

Uh oh!

Conversation

calvin0327 commented May 17, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented May 17, 2025

Uh oh!

calvin0327 commented May 17, 2025

Uh oh!

Uh oh!

lengrongfu commented May 17, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

calvin0327 commented Jun 1, 2025

Uh oh!

calvin0327 commented Jun 1, 2025

Uh oh!

noooop commented Jun 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

calvin0327 commented Jun 2, 2025

Uh oh!

noooop commented Jun 3, 2025

Uh oh!

calvin0327 commented Jul 14, 2025

Uh oh!

noooop commented Jul 14, 2025

Uh oh!

calvin0327 commented Jul 16, 2025

Uh oh!

calvin0327 commented Jul 16, 2025

Uh oh!

noooop commented Jul 16, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

noooop left a comment

Choose a reason for hiding this comment

Uh oh!

noooop commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

noooop commented Jul 18, 2025

Uh oh!

calvin0327 commented Jul 18, 2025

Uh oh!

Uh oh!

Uh oh!

calvin0327 commented May 17, 2025 •

edited by github-actions bot

Loading

noooop commented Jun 1, 2025 •

edited

Loading

noooop commented Jul 18, 2025 •

edited

Loading