Skip to content

模型融合代码跑不通merge_lora_params.py,求解答 #1187

Open
@fengyue20

Description

@fengyue20

执行以下代码:
python paddlemix/tools/merge_lora_params.py
--model_name_or_path paddlemix/examples/deepseek_vl2/deepseek-ai/deepseek-vl2-tiny
--lora_path work_dirs/deepseekvl2_tiny_lora_bs16_1e5/checkpoint-60
--merge_model_path paddlemix/tools/merge2

显示
LlavaConfig register success!!!!!
LLavaTokenizer register success!!!!
[2025-04-03 22:03:17,608] [ INFO] - Loading configuration file paddlemix/examples/deepseek_vl2/deepseek-ai/deepseek-vl2-tiny/config.json
Traceback (most recent call last):
File "/mnt/storage/jinming/miniconda3/envs/DeepSeek-paddle_fine_tune/lib/python3.10/site-packages/paddlenlp/transformers/auto/configuration.py", line 466, in from_pretrained
config_class = CONFIG_MAPPING[config_dict["model_type"]]
File "/mnt/storage/jinming/miniconda3/envs/DeepSeek-paddle_fine_tune/lib/python3.10/site-packages/paddlenlp/transformers/auto/configuration.py", line 255, in getitem
raise KeyError(key)
KeyError: 'deepseek_vl_v2'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "/mnt/storage/jinming/djm/deepseekvl2lora/paddle2/PaddleMIX/paddlemix/tools/merge_lora_params.py", line 63, in
merge()
File "/mnt/storage/jinming/djm/deepseekvl2lora/paddle2/PaddleMIX/paddlemix/tools/merge_lora_params.py", line 41, in merge
model_config = AutoConfigMIX.from_pretrained(args.model_name_or_path, dtype=dtype)
File "/mnt/storage/jinming/miniconda3/envs/DeepSeek-paddle_fine_tune/lib/python3.10/site-packages/paddlenlp/transformers/auto/configuration.py", line 468, in from_pretrained
raise ValueError(
ValueError: The checkpoint you are trying to load has model type deepseek_vl_v2 b

Image

求解答

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions