PixArt-α: A Transformer-based Image Generation Model Competing with Midjourney and SDXL

站长之家

Published inAI News · 1 min read · Nov 10, 2023

102

The data to be translated: PixArt-α is a Transformer-based text-to-image generation model, whose competitive image generation quality and significantly reduced training costs allow it to rival Midjourney and SDXL. With a training strategy decomposition, an efficient T2I Transformer, and high-information-density data training, PixArt-α excels in high-resolution image synthesis and complex text prompts, achieving a training speed that is only 10.8% of Stable Diffusion v1.5. PixArt supports high-resolution image synthesis up to 1024 pixels, reduces training costs by 90%, and offers the AIGC community and startups a new perspective on low-cost, high-quality generative models.

Artificial Intelligence Image Generation PixArt

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Report: ByteDance's Pico is Developing a Lightweight MR Glasses, Directly Targeting Meta's Next-Generation Product

ByteDance's Pico is developing a lightweight MR glasses, weighing only 127 grams. It uses a split design to transfer computing tasks to an external processing unit and is developing a dedicated chip to reduce latency. This move directly targets Meta's MR strategy, as the latter has paused the development of Quest 4 and shifted towards lightweight devices. Both major companies are betting on split-style MR glasses, indicating that the industry is transitioning from traditional VR headsets to more compact forms.

Jul 15, 2025

Meitu RoboNeo Launch: One Sentence to Complete Photo Editing and Website Building, AI Image Processing Enters the All-Round Era

Meitu launches RoboNeo, an AI tool for image editing, brand design, and web creation via natural language commands. It automates tasks like wedding photo retouching, brand kits, and e-commerce content, reducing visual production barriers for SMEs. Initial tests show rapid multi-format output generation within 5 minutes, though lighting details need refinement.....

Jul 15, 2025

110

PixVerse AI Video Creation Platform Launches Multi-Keyframe Generation Feature

On July 11, PixVerse AI video creation platform, which has surpassed 60 million global users, announced a major feature upgrade — the addition of the 'Multi-Keyframe Generation' function in the Start-End Frame module. This marks a new stage in AI video creation, transitioning from the generation of single segments to narrative expression. Users can now upload up to 7 images as keyframes via the web version's start-end frame feature, and the AI will automatically analyze the semantic relationships between frames, intelligently building smooth action and scene transition paths. This technological breakthrough enables static images to be presented dynamically.

Jul 14, 2025

110

The Ministry of Industry and Information Technology will release the 'International Artificial Intelligence Open Source Cooperation Initiative' at the 2025 World Artificial Intelligence Conference

The 2025 World AI Conference, themed 'Intelligent Era, Global Collaboration', will be held in Shanghai from July 26-28. It will launch an international AI open-source initiative and showcase latest AI technologies, building on its success since 2018 (300k+ visitors in 2024). China also plans a BRICS AI cooperation center.....

Jul 14, 2025

Study Warns of Major Risks in Using Artificial Intelligence to Treat Chatbots

Stanford study warns of risks in AI therapy chatbots, showing stigmatization of mental conditions and inadequate crisis responses. Some AIs failed to detect dangers, giving mechanical replies. Researchers recommend auxiliary roles over replacing therapists.....

Jul 14, 2025

New Breakthrough in Real-Time Video Generation: Meta StreamDiT Can Generate High-Quality Videos Frame by Frame with a Single GPU

Meta and Berkeley developed StreamDiT for real-time AI video generation: 1) 16fps 512p on single GPU, 4B-param model creates 1-min videos with live edits; 2) Novel buffer enables parallel processing (2 frames/0.5s) with 8-step optimization; 3) Trained on 3K HD videos + 2.6M dataset; 4) Outperforms rivals in motion smoothness, 30B-param version shows quality potential; 5) Enables interactive video despite transition flaws.....

Jul 14, 2025

200

AI Daily: Zhipu Launches PPT Generation Function AI Slides; Ke Ling AI Releases Ketur 2.1 Model

1. Zhipu launches free AI Slides for PPT generation. 2. Keling AI introduces KeTu 2.1 with 180 styles. 3. NVIDIA's DiffusionRenderer enables 3D scene editing. 4. Modao AI offers 30-second prototype generation. 5. Higgsfield creates avatars from 10 photos. 6. Google open-sources GenAI Processors. 7. Google Veo3 adds image-to-video. 8. Mistral AI releases Devstral2507 for code generation.....

Jul 11, 2025

130

Google Announces the Latest Class of Students at the American Artificial Intelligence Infrastructure Institute

Jul 11, 2025

150

Zhipei has launched a PPT generation feature similar to Manus AI Slides, free to use without limitations

Jul 11, 2025

220

Google Veo3 Adds Image-to-Video Feature, Users Create Over 40 Million Videos Within Seven Weeks

Jul 11, 2025

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

PixArt-α: A Transformer-based Image Generation Model Competing with Midjourney and SDXL

站长之家

This article is from AIbase Daily

AI News Recommendations

Report: ByteDance's Pico is Developing a Lightweight MR Glasses, Directly Targeting Meta's Next-Generation Product

Meitu RoboNeo Launch: One Sentence to Complete Photo Editing and Website Building, AI Image Processing Enters the All-Round Era

PixVerse AI Video Creation Platform Launches Multi-Keyframe Generation Feature

The Ministry of Industry and Information Technology will release the 'International Artificial Intelligence Open Source Cooperation Initiative' at the 2025 World Artificial Intelligence Conference

Study Warns of Major Risks in Using Artificial Intelligence to Treat Chatbots

New Breakthrough in Real-Time Video Generation: Meta StreamDiT Can Generate High-Quality Videos Frame by Frame with a Single GPU

AI Daily: Zhipu Launches PPT Generation Function AI Slides; Ke Ling AI Releases Ketur 2.1 Model

Google Announces the Latest Class of Students at the American Artificial Intelligence Infrastructure Institute

Zhipei has launched a PPT generation feature similar to Manus AI Slides, free to use without limitations

Google Veo3 Adds Image-to-Video Feature, Users Create Over 40 Million Videos Within Seven Weeks