Translated data: Google's newly designed image generation model, Instruct-Imagen, demonstrates exceptional generation capabilities through multimodal instructions. Experiments have shown that this model rivals or even surpasses previous methods in domain-specific and zero-shot evaluations, featuring the ability to handle complex instructions and strong generalization. This technology not only enhances image quality but also achieves significant results in text alignment.