3DTopia is a two-stage text-to-3D generation model. In the first stage, a diffusion model quickly generates candidate items. In the second stage, the selected assets from the first stage are optimized. This model can achieve high-quality text-to-3D generation within 5 minutes.