StepZen Step3.5Flash Full-Stack Open Source: 196 Billion Parameters MoE Architecture, Inference Volume Ranks Second to OpenClaw
StepZen opens the full stack of the Step3.5Flash model, including pre-training, mid-training weights, and training framework. The model is designed for agents, using a sparse MoE architecture, with a total of 196 billion parameters, activating approximately 11 billion parameters during inference, high energy efficiency, and the highest code task inference speed per request reaches 350TP.