Skip to content

Error : NotImplementedError: offload_to_cpu=True and NO_SHARD is not supported yet #76

@ghost

Description

我使用 alpacaLlava_llamaQformerv2Peft_QF_13B.sh 微调模型,在迭代完一次epoch ,
保存模型时出现了错误,使用单卡跑的。脚本内容如下
2ae814302c794ac8582b4991947af66

File "/home/liwx/anaconda3/envs/accessory/lib/python3.10/contextlib.py", line 135, in __enter__ return next(self.gen) File "/home/liwx/anaconda3/envs/accessory/lib/python3.10/site-packages/torch/distributed/fsdp/_unshard_param_utils.py", line 171, in _unshard_fsdp_state_params _validate_unshard_params_args( File "/home/liwx/anaconda3/envs/accessory/lib/python3.10/site-packages/torch/distributed/fsdp/_unshard_param_utils.py", line 140, in _validate_unshard_params_args raise NotImplementedError( NotImplementedError: offload_to_cpu=True and NO_SHARD is not supported yet

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions