Skip to content

Commit fc91188

Browse files
dsikkaElizaWszola
andauthored
[Kernel] Expand MoE weight loading + Add Fused Marlin MoE Kernel (#7766)
Co-authored-by: ElizaWszola <[email protected]>
1 parent ed6f002 commit fc91188

File tree

16 files changed

+2382
-85
lines changed

16 files changed

+2382
-85
lines changed

CMakeLists.txt

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -296,6 +296,11 @@ set(VLLM_MOE_EXT_SRC
296296
"csrc/moe/torch_bindings.cpp"
297297
"csrc/moe/topk_softmax_kernels.cu")
298298

299+
if(VLLM_GPU_LANG STREQUAL "CUDA")
300+
list(APPEND VLLM_MOE_EXT_SRC
301+
"csrc/moe/marlin_moe_ops.cu")
302+
endif()
303+
299304
define_gpu_extension_target(
300305
_moe_C
301306
DESTINATION vllm

0 commit comments

Comments
 (0)