You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
"""This LoRA model has all supported Mixtral target modules"""
79
-
80
-
iftorch.cuda.device_count() <tp_size:
81
-
pytest.skip(f"Not enough GPUs for tensor parallelism {tp_size}")
82
-
83
-
prompts= [
84
-
"Question: What happens if you touch the eyes of a blind man?\nChoices:\nB: The man's sight will be restored.\nA: Nothing happens if you touch the eyes of a blind man.\nC: The man will get his eyesight back.\nD: The man will no longer be blind.\nAnswer:", # noqa: E501
85
-
"Question: Which will increase the energy of motion of water molecules?\nChoices:\nA: add heat\nB: add ice\nC: remove some of the water\nD: place the water in the freezer\nAnswer:", # noqa: E501
86
-
"Since Craig threw aluminum cans in the trash and Benjamin recycled, _ was environmentally irresponsible.\nChoices:\n1: Craig\n2: Benjamin\nAnswer:", # noqa: E501
87
-
]
88
-
89
-
llm=vllm.LLM(
90
-
MODEL_PATH,
91
-
enable_lora=True,
92
-
max_num_seqs=16,
93
-
max_loras=4,
94
-
distributed_executor_backend="ray",
95
-
tensor_parallel_size=tp_size,
96
-
fully_sharded_loras=fully_shard,
97
-
max_lora_rank=32,
98
-
)
99
-
100
-
expected_lora_output= [
101
-
"A: Nothing happens if you touch the eyes of a blind man.",
0 commit comments