No description available
A 2D video generation system based on diffusion models, achieving human-object interaction animations.