Alibaba Tongyi Qianwen Open Sources New Text-to-Image Model Qwen-Image
Tongyi Qianwen series has open-sourced a 2 billion parameter multimodal diffusion transformer (MMDiT) image generation foundation model named Qwen-Image for the first time. This innovative achievement has made breakthroughs in complex text rendering and precise image editing, and has demonstrated excellent performance on multiple public benchmark tests, becoming a rising star in the field of image generation and editing. Qwen-Image stands out with its strong text rendering capabilities, supporting multi-line layout, paragraph-level text generation, and fine-grained detail presentation, whether in English or Chinese.