vllm.model_executor.layers.fused_moe.shared_fused_moe ¶
SharedFusedMoE ¶
Bases: FusedMoE
A FusedMoE operation that also computes the results of shared experts. If an all2all communicator is being used the shared expert computation can be interleaved with the fused all2all dispatch communication step.