-
Notifications
You must be signed in to change notification settings - Fork 195
Closed
Labels
bugSomething isn't workingSomething isn't working
Description
It seems that bloom is not support for quantization right now
Traceback (most recent call last):
File "/home/work/vllm-main/scripts/w8a8v2.py", line 40, in <module>
oneshot(
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/transformers/finetune/text_generation.py", line 76, in oneshot
main(model_args, data_args, training_args)
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/transformers/finetune/text_generation.py", line 364, in main
stage_runner.one_shot()
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/transformers/finetune/runner.py", line 171, in one_shot
self.trainer.one_shot(calibration_data=calib_data, stage=stage)
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/transformers/finetune/session_mixin.py", line 401, in one_shot
apply(
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/core/session_functions.py", line 184, in apply
return active_session().apply(
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/core/session.py", line 210, in apply
self.initialize(**kwargs)
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/core/session.py", line 156, in initialize
mod_data = self._lifecycle.initialize(
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/core/lifecycle.py", line 126, in initialize
data = mod.initialize(state=self.state, **extras)
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/modifiers/stage.py", line 124, in initialize
modifier.initialize(state, **kwargs)
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/modifiers/modifier.py", line 118, in initialize
initialized = self.on_initialize(state=state, **kwargs)
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/modifiers/smoothquant/base.py", line 127, in on_initialize
self.resolved_mappings_ = self._resolve_mappings(state.model)
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/modifiers/smoothquant/base.py", line 184, in _resolve_mappings
_, balance_layer = get_matching_layer(
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/utils/pytorch/module.py", line 311, in get_matching_layer
potential_matches = get_layers(target, module)
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/utils/pytorch/module.py", line 166, in get_layers
return match_layers_params(targets, module)
File "/home/bml/.local/lib/python3.9/site-packages/llmcompressor/utils/pytorch/module.py", line 160, in match_layers_params
raise ValueError(f"Could not find targets {missed} in module {module}")
ValueError: Could not find targets ['re:.*q_proj'] in module BloomForCausalLM
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working