Long-context MoE training guidance for Megatron Bridge. Covers CP sizing, selective recompute, dispatcher choices, and practical patterns from DSV3, Qwen3, and Qwen3-Next long-context experiments.
we've indexed the metadata for this skill but the body is fetched on demand. click "view source" above to read the canonical SKILL.md on clawhub, or "run inline in claude" to apply it without leaving your session.
read on clawhubdon't have the plugin yet? install it then click "run inline in claude" again.