OmniHuman-1
OmniHuman-1 is a multimodal framework that generates human videos based on a single portrait and motion signals.
CommonProductVideoArtificial IntelligenceVideo Generation
OmniHuman-1 is an end-to-end multimodal conditional human video generation framework that can create human videos based on a single portrait and motion signals (such as audio, video, or a combination of both). This technology overcomes the challenge of high-quality data scarcity through a mixed training strategy and supports images of arbitrary aspect ratios, producing realistic human videos. It excels in handling weak signal inputs, particularly audio, making it suitable for various scenarios, including virtual streaming and video production.
OmniHuman-1 Visit Over Time
Monthly Visits
80272
Bounce Rate
40.83%
Page per Visit
1.5
Visit Duration
00:00:10