Virtual Try-On Tool Voost Can Reproduce Fabric Texture and Wrinkle Details

AIbase基地

Published inAI News · 4 min read · Aug 11, 2025

Recently, researchers have proposed an innovative framework called Voost, aimed at improving the performance of virtual try-on and try-off technologies. Virtual try-on refers to generating a realistic image of a person wearing a target outfit. However, accurately modeling the correspondence between clothing and the body has always been a major challenge due to changes in posture and appearance. The introduction of Voost provides a new solution to this problem.

Voost is a unified and scalable model that jointly learns virtual try-on and try-off tasks through a single diffusion transformer (DiT). Unlike traditional methods, Voost enables bidirectional supervision for each pair of clothing and person, thereby enhancing the reasoning of the relationship between clothing and the body, without relying on task-specific networks, auxiliary losses, or additional labels. This feature makes Voost excel in task flexibility and generation diversity.

Additionally, the research team introduced two techniques for inference to enhance the model's robustness. One is attention temperature scaling, which maintains model stability under changes in resolution or masks; the other is self-correcting sampling, which further optimizes the generation results by utilizing the bidirectional consistency between tasks. These innovative techniques enable Voost to adapt to different input conditions during inference.

In extensive experiments, Voost performed excellently, achieving the latest level in virtual try-on and try-off benchmark tests. Research results show that Voost significantly outperforms many strong baseline models in multiple aspects, including alignment accuracy, visual realism, and generalization ability. This achievement not only provides a new direction for the development of virtual try-on and try-off technology but also lays the foundation for future research in related fields.

Voost's success demonstrates the potential of deep learning technology in the clothing try-on experience, signaling that we may witness new changes in the digital fashion and online shopping fields.

Project: https://nxnai.github.io/Voost/

Key Points:
🌟 Voost is a new framework that enables joint learning of virtual try-on and try-off through a single diffusion transformer.
🔍 Voost excels in task flexibility and generation diversity, without requiring specific networks or additional labels.
🚀 Experimental results show that Voost outperforms various strong baseline models in accuracy and visual quality.

Google Gemini Faces Large-Scale Model Distillation Attack, With Over 100,000 Prompts Leaking Core Logic in a Single Instance

Google's AI chatbot Gemini faced a large-scale 'distillation attack,' where attackers used over 100,000 repeated queries to extract its internal mechanisms, aiming to clone or enhance their own AI systems. Google attributed the attack to commercial motives, raising industry-wide concerns over large model security.....

AI Daily: ByteDance Launches Seedream 5.0 Lite; Xiaohongshu Will Limit Traffic if AI Is Not Marked; Meitu's First Batch of Shots Are Integrated with Seedance 2.0 Large Model

Welcome to the [AI Daily] section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we bring you the latest content in the AI field, focusing on developers, helping you understand technological trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. ByteDance Launches Seedream 5.0 Lite: A new benchmark for image creation with 'visual reasoning' and real-time internet connectivity capabilities. The Seed team of ByteDance has launched Seedream

Only 7 People Can Beat It! New Gemini 3 Deep Think Released: Dominating Programming and Research Rankings

Google's Gemini 3 Deep Think model has been significantly upgraded, excelling in programming, research, and engineering. Its key highlight is achieving a high score of 3455 on Codeforces, surpassing most human players, with only 7 globally able to beat it, marking a new stage in AI reasoning capabilities.....

Highlighting Ultra-Low Latency! Mistral Launches a New Speech-to-Text AI Model

French AI company Mistral AI has released two speech-to-text models, Voxtral Mini Transcribe V2 and Voxtral Realtime, with high-speed transcription, privacy protection, and cost-effectiveness as their main features. The models offer high-precision transcription, speaker identification, and low-latency characteristics, suitable for commercial applications such as virtual assistants, call centers, and compliance records.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Virtual Try-On Tool Voost Can Reproduce Fabric Texture and Wrinkle Details

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Google Gemini Faces Large-Scale Model Distillation Attack, With Over 100,000 Prompts Leaking Core Logic in a Single Instance

Chinese-style Webtoon Enters the AI Era! Hengdian Film and Television's 'Nine Provinces: The Story of the Clouds' Launches Today, Pioneering a New Model for Oriental Aesthetics

AI Daily: ByteDance Launches Seedream 5.0 Lite; Xiaohongshu Will Limit Traffic if AI Is Not Marked; Meitu's First Batch of Shots Are Integrated with Seedance 2.0 Large Model

Only 7 People Can Beat It! New Gemini 3 Deep Think Released: Dominating Programming and Research Rankings

AI Funding Records Broken Again! Anthropic Secures $30 Billion in Major Funding, Valuation Surges to $380 Billion, Approaching OpenAI

Google DeepMind CEO Hassabis: I try to sleep 6 hours a day, and I usually feel really energetic around 1 a.m.

Another Breakthrough in Domestic Computing Infrastructure! Moortu MTT S5000 Completes Full-Process Compatibility with Zhipu GLM-5 Large Model

AI Inference Track Valuation Surges: Modal Labs Discusses New Funding Round, Valuation May Reach $2.5 Billion

Highlighting Ultra-Low Latency! Mistral Launches a New Speech-to-Text AI Model

Has the Robot Evolution Singularity Arrived? Alibaba Releases RynnBrain Large Model: Equipping Machines with Thinking Brains, Performance Exceeds Google Gemini

AI News Recommendations

Google Gemini Faces Large-Scale Model Distillation Attack, With Over 100,000 Prompts Leaking Core Logic in a Single Instance

Chinese-style Webtoon Enters the AI Era! Hengdian Film and Television's 'Nine Provinces: The Story of the Clouds' Launches Today, Pioneering a New Model for Oriental Aesthetics

AI Daily: ByteDance Launches Seedream 5.0 Lite; Xiaohongshu Will Limit Traffic if AI Is Not Marked; Meitu's First Batch of Shots Are Integrated with Seedance 2.0 Large Model

Only 7 People Can Beat It! New Gemini 3 Deep Think Released: Dominating Programming and Research Rankings

AI Funding Records Broken Again! Anthropic Secures $30 Billion in Major Funding, Valuation Surges to $380 Billion, Approaching OpenAI

Google DeepMind CEO Hassabis: I try to sleep 6 hours a day, and I usually feel really energetic around 1 a.m.

Another Breakthrough in Domestic Computing Infrastructure! Moortu MTT S5000 Completes Full-Process Compatibility with Zhipu GLM-5 Large Model

AI Inference Track Valuation Surges: Modal Labs Discusses New Funding Round, Valuation May Reach $2.5 Billion

Highlighting Ultra-Low Latency! Mistral Launches a New Speech-to-Text AI Model

Has the Robot Evolution Singularity Arrived? Alibaba Releases RynnBrain Large Model: Equipping Machines with Thinking Brains, Performance Exceeds Google Gemini

GEO Services