Stability AI has once again broken through technical barriers by launching the new Stable Diffusion 3.5 Medium model. This AI art tool aimed at the general public is not only completely free for commercial use but also achieves a perfect balance between high performance and accessibility. With its multimodal diffusion transformer (MMDiT-X) architecture and a streamlined design of 2.5 billion parameters, this model cleverly addresses the hardware threshold for ordinary users, requiring only 9.9GB of VRAM to run smoothly on most consumer-grade graphics cards.
Alibaba
$2
Input tokens/M
-
Output tokens/M
Context Length
ckpt
A text-to-image generation model featuring an improved Multimodal Diffusion Transformer (MMDiT-X), with significant enhancements in image quality, typography effects, complex prompt understanding, and resource efficiency
stabilityai
A text-to-image generation model based on the improved Multimodal Diffusion Transformer (MMDiT-X), with significant improvements in image quality, text layout, complex prompt understanding, and resource efficiency