Speculative MoE: Communication Efficient Parallel MoE Inference with Speculative Token and Expert Pre-scheduling Paper • 2503.04398 • Published Mar 6, 2025 • 1