Best MMDiT AI Tools & Models - Premium MMDiT News

AI News

Qwen-Image Launches with a 20B-parameter MMDiT Model, Setting New SOTA in Image Generation

No description available

Alibaba Tongyi Qianwen Open Sources New Text-to-Image Model Qwen-Image

Tongyi Qianwen series has open-sourced a 2 billion parameter multimodal diffusion transformer (MMDiT) image generation foundation model named Qwen-Image for the first time. This innovative achievement has made breakthroughs in complex text rendering and precise image editing, and has demonstrated excellent performance on multiple public benchmark tests, becoming a rising star in the field of image generation and editing. Qwen-Image stands out with its strong text rendering capabilities, supporting multi-line layout, paragraph-level text generation, and fine-grained detail presentation, whether in English or Chinese.

10.9k 5 hours ago

Alibaba Tongyi Qianwen Open Sources New Text-to-Image Model Qwen-Image

JUMPSTAR Releases Image Generation Model Step-1X-Medium with New Features such as Image-to-Image Generation

Shanghai JUMPSTAR Intelligent Technology Co., Ltd. recently announced a major upgrade to its Step-1X series of image generation models with the launch of the improved Step-1X-Medium version. This upgraded version has achieved significant enhancements in several areas: based on the MMDit architecture, the generation speed has increased by over 30%; through targeted training, the new version exhibits stronger understanding capability and text-image consistency, resulting in more natural details in the generated images.

16.1k yesterday

JUMPSTAR Releases Image Generation Model Step-1X-Medium with New Features such as Image-to-Image Generation

Free for Commercial Use! Stability AI Launches Lightweight AI Art Tool Stable Diffusion 3.5 Medium Model

Stability AI has once again broken through technical barriers by launching the new Stable Diffusion 3.5 Medium model. This AI art tool aimed at the general public is not only completely free for commercial use but also achieves a perfect balance between high performance and accessibility. With its multimodal diffusion transformer (MMDiT-X) architecture and a streamlined design of 2.5 billion parameters, this model cleverly addresses the hardware threshold for ordinary users, requiring only 9.9GB of VRAM to run smoothly on most consumer-grade graphics cards.

18.3k 3 hours ago