Skip to content

Commit 2b04c20

Browse files
authored
[Bugfix] Allow shared_experts skip quantization for DeepSeekV2/V3 (#14100)
Signed-off-by: mgoin <[email protected]>
1 parent ae122b1 commit 2b04c20

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

vllm/model_executor/models/deepseek_v2.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -145,6 +145,7 @@ def __init__(
145145
hidden_act=config.hidden_act,
146146
quant_config=quant_config,
147147
reduce_results=False,
148+
prefix=f"{prefix}.shared_experts",
148149
)
149150

150151
def forward(self, hidden_states: torch.Tensor) -> torch.Tensor:

0 commit comments

Comments
 (0)