AnimateDiff is an effective framework for animating personalized text-to-image models. It adds a new initialization animation modeling module to a frozen base text-to-image model and trains on video clips to extract reasonable animation priors. Once trained, injecting this animation modeling module enables all personalized versions derived from the same base model to generate diverse and personalized animated images. This framework saves the effort of fine-tuning for specific models.